Abstract
We establish that the Dirichlet problem for linear growth functionals on \({\text {BD}}\), the functions of bounded deformation, gives rise to the same unconditional Sobolev and partial \({\text {C}}^{1,\alpha }\)-regularity theory as presently available for the full gradient Dirichlet problem on \({\text {BV}}\). Functions of bounded deformation play an important role in, for example plasticity, however, by Ornstein’s non-inequality, contain \({\text {BV}}\) as a proper subspace. Thus, techniques to establish regularity by full gradient methods for variational problems on BV do not apply here. In particular, applying to all generalised minima (that is, minima of a suitably relaxed problem) despite their non-uniqueness and reaching the ellipticity ranges known from the \({\text {BV}}\)-case, this paper extends previous Sobolev regularity results by Gmeineder and Kristensen (in J Calc Var 58:56, 2019) in an optimal way.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
A variety of physically relevant variational problems that describe the displacements of bodies subject to external forces are posed in the space \({\text {BD}}\) of functions of bounded deformation, see [9, 40, 52, 53, 77, 79, 80] for overviews and applications in plasticity. For a given open set \(\Omega \subset {\mathbb {R}}^{n}\), this space consists of all \(u\in {\text {L}}^{1}(\Omega ;{\mathbb {R}}^{n})\) such that the distributional symmetric gradient \(\varepsilon (u):=\frac{1}{2}({\text {D}}\!u+{\text {D}}\!u^{\top })\) is a finite, matrix-valued Radon measure on \(\Omega \). By Ornstein’s Non-Inequality [24, 50, 51, 65], there exists no constant \(c>0\) such that \(\Vert D\varphi \Vert _{{\text {L}}^{1}(\Omega ;{\mathbb {R}}^{n\times n})}\leqq c\Vert \varepsilon (\varphi )\Vert _{{\text {L}}^{1}(\Omega ;{\mathbb {R}}^{n\times n})}\) holds for all \(\varphi \in {\text {C}}_{c}^{\infty }(\Omega ;{\mathbb {R}}^{n})\). In consequence, \({\text {BD}}(\Omega )\) is in fact larger than \({\text {BV}}(\Omega ;{\mathbb {R}}^{n})\), and the full distributional gradients of \({\text {BD}}\)-maps in general do not need to exist as (locally) finite \({\mathbb {R}}^{n\times n}\)-valued Radon measures. However, by the coerciveness considerations outlined below, this space displays the natural function space setup for a vast class of variational integrals. For minima of such functionals, the present paper aims to develop a regularity theory which—from a Sobolev regularity and partial Hölder continuity perspective—essentially yields the same results which are presently known for the Dirichlet problem on\({\text {BV}}\).
This task can be viewed as a borderline case of a theory having emerged over the past decades. Namely, considering variational integrals
over suitable Dirichlet classes, an abundance of criteria for improved regularity of minima is available subject to convexity, smoothness and the growth behaviour of \(g:{\mathbb {R}}^{N\times n}\rightarrow {\mathbb {R}}\). It is only possible to give an incomplete list of the wealth of contributions, and instead we refer to Mingione [62, 63] and the references therein for more detail. When linear growth functionals are considered—that is, \(c_{1}|z|-\gamma \leqq g(z)\leqq c_{2}(1+|z|)\) for some \(c_{1},c_{2},\gamma >0\) and all \(z\in {\mathbb {R}}^{N\times n}\)—then compactness considerations lead to the study of minima of a suitably relaxed problem on \({\text {BV}}\), cf. [14, 18, 19, 42]. In both linear and superlinear growth regimes, these contributions crucially utilise at various steps that the full gradients of minimising sequences are uniformly bounded in some \({\text {L}}^{p}\)-space, \(p\geqq 1\). When (1.1) is modified to act on the symmetric gradients exclusively, convexity and \(1<p<\infty \)-growth of g still allow one to work on \({\text {W}}^{1,p}\) by Korn’s inequality. Also, in the borderline case of \(L\log L\)-growth integrands as considered in the seminal works by Fuchs and Seregin [39, 40], one may essentially still work on \({\text {W}}^{1,1}\) (cf. Section 2.2.2). In the linear growth, symmetric gradient situation, however, Ornstein’s Non-Inequality neither allows to a priori consider \({\text {W}}^{1,1}\)- or \({\text {BV}}\)-regular minima nor to employ the usual full-gradient techniques. A key question in this setting thus is to which extent the results from corresponding full gradient theory on\({\text {BV}}\) continue to hold for the Dirichlet problem on \({\text {BD}}\), too.
1.1 Aim and Scope
Toward a unifying regularity theory for the Dirichlet problem on \({\text {BD}}\), we begin by giving the underlying setup first. Let \(\Omega \subset {\mathbb {R}}^{n}\) be open and bounded with Lipschitz boundary \(\partial \Omega \). We consider (generalised) minima of variational principles
where \({\mathscr {D}}_{u_{0}}\) is a suitable Dirichlet class. As a key feature, we suppose that the convex integrand \(f:{\mathbb {R}}_{{\text {sym}}}^{n\times n}\rightarrow {\mathbb {R}}\) is of linear growth, by which we understand that there exist constants \(c_{1},c_{2},\gamma >0\) such that there holds
In this situation, we put \({\text {LD}}(\Omega ):=\{v\in {\text {L}}^{1}(\Omega ;{\mathbb {R}}^{n}):\;\varepsilon (v)\in {\text {L}}^{1}(\Omega ;{\mathbb {R}}_{{\text {sym}}}^{n\times n})\}\) to be endowed with the canonical norm \(\Vert v\Vert _{{\text {LD}}(\Omega )}:=\Vert v\Vert _{{\text {L}}^{1}(\Omega ;{\mathbb {R}}^{n})}+\Vert \varepsilon (v)\Vert _{{\text {L}}^{1}(\Omega ;{\mathbb {R}}_{{\text {sym}}}^{n\times n})}\), and define \({\text {LD}}_{0}(\Omega )\) as the closure of \({\text {C}}_{c}^{\infty }(\Omega ;{\mathbb {R}}^{n})\) with respect to \(\Vert \cdot \Vert _{{\text {LD}}(\Omega )}\). With this terminology, we pick \(u_{0}\in {\text {LD}}(\Omega )\) and set \({\mathscr {D}}_{u_{0}}:=u_{0}+{\text {LD}}_{0}(\Omega )\). Subject to (LG), F is bounded below on \({\mathscr {D}}_{u_{0}}\) and minimising sequences are bounded in \({\text {LD}}(\Omega )\); note that this is not necessarily the case in \({\text {W}}^{1,1}(\Omega ;{\mathbb {R}}^{n})\). By non-reflexivity of \({\text {LD}}(\Omega )\) and possible concentration effects, minimising sequences do not need to be weakly relatively compact in \({\text {LD}}(\Omega )\) but can be shown to be weak*-relatively compact in\({\text {BD}}(\Omega )\) (cf. Section 2 for the requisite background terminology). As a routine consequence, for F to be defined for \({\text {BD}}\)-maps, it must be suitably relaxed. For \(u,v\in {\text {BD}}_{{\text {loc}}}(\Omega )\) and an open Lipschitz subset \(\omega \subseteq \Omega \) we put
Following the by now classical works [42, 49], \({\overline{F}}_{u_{0}}[u]:={\overline{F}}_{u_{0}}[u;\Omega ]\) then coincides with the weak*-relaxation (or weak*-Lebesgue–Serrin extension) of F to \({\text {BD}}(\Omega )\) subject to the Dirichlet constraint \(u|_{\partial \Omega }=u_{0}\). Here, for \(u\in {\text {BD}}(\Omega )\) we denote the Lebesgue–Radon–Nikodým decomposition of \({\text {E}}\!u\)Footnote 1 into its absolutely continuous and singular parts for \({\mathscr {L}}^{n}\). Moreover, \(f^{\infty }(z):=\lim _{t\searrow 0}tf(z/t)\) denotes the recession function of f, capturing the integrand’s behaviour at infinity. Consequently, we call a map \(u\in {\text {BD}}(\Omega )\) a generalised minimiser if \({\overline{F}}_{u_{0}}[u]\leqq {\overline{F}}_{u_{0}}[v]\) for all \(v\in {\text {BD}}(\Omega )\). Similarly, we call \(u\in {\text {BD}}_{{\text {loc}}}(\Omega )\) a local generalised minimiser if \({\overline{F}}_{u}[u;\omega ]\leqq {\overline{F}}_{u}[v;\omega ]\) for all open subsets \(\omega \Subset \Omega \) with Lipschitz boundary \(\partial \omega \) and all \(v\in {\text {BD}}_{{\text {loc}}}(\Omega )\). Subject to the Dirichlet datum \(u_{0}\), the set of all generalised minima is denoted \({\text {GM}}(F;u_{0})\) and, similarly, the set of all local generalised minima is denoted \({\text {GM}}_{{\text {loc}}}(F)\). As a consequence of [46, Section 5], generalised minimisers always exist in this framework. For future reference, we remark that even if f is strictly convex, generalised minima are not unique in general; see Section 4.1 for more detail.
In view of the main theme of the paper, we shall focus on higher Sobolev and partial regularity for generalised minima of the variational principle (1.2), even leading to novel results in the radially symmetric case \(f={\widetilde{f}}(|{\cdot }|)\). The corresponding results crucially rely on the degenerate elliptic behaviour of the integrands f, being roughly depicted in Figure 1, and let us retrieve what is unconditionally known for the Dirichlet problem on \({\text {BV}}\). As such, we particularly obtain criteria for the full gradients of generalised minima to exist as locally finite Radon measures. To explain why the results given below are close to optimal, we briefly pause to introduce the relevant ellipticity scale.
1.2 \({\text {W}}_{{\text {loc}}}^{1,1}\)-Regularity of Minima
As it is customary in the linear growth context and motivated by Bernstein’s genre [16, 42, 75] and the conditions considered by Ladyzhenskaya and Ural’ceva [57], a natural scale of \({\text {C}}^{2}\)-integrands is given by those \(f:{\mathbb {R}}_{{\text {sym}}}^{n\times n}\rightarrow {\mathbb {R}}\) that satisfy for some \(a>1\) and \(0<\lambda \leqq \Lambda <\infty \)
For such integrands, (1.4) precisely describes the degeneration of the ellipticity ratio of \(f''\). From a more systematic viewpoint, this scale has been studied by Bildhauer, Fuchs and Mingione [17,18,19, 38] in the (p, q)-growth or \({\text {BV}}\)-context, respectively, under the name of \(\mu \)-ellipticity, where \(\mu =a\) in our terminology; also see [42]. Note that \(a=1\) is excluded here as then the corresponding integrands are not of linear growth. Even though convex integrands f with (LG) have the same growth behaviour from above and below, this is not the case on the level of second derivatives. To some extent, such problems thus have some resemblance with (p, q)-growth type problems. Higher integrability of the minimisers’ gradients can only be expected when p and q are not too far apart or additional hypotheses are imposed, see the seminal work [34] by Esposito, Leonetti and Mingione (also cf. [62, Theorem 6.2] and Carozza, Kristensen and Passarelli di Napoli [22]). More precisely, for suitably regular, convex (p, q)-type problems the critical exponent ratio to yield \({\text {W}}_{{\text {loc}}}^{1,q}\)-regular minima was determined in [34] as
a threshold ratio getting in line with others identified earlier in a slightly different context by Marcellini [59, 60]. Beyond this threshold, one usually imposes additional hypotheses—such as local boundedness, cf. [22]—on minima in order to obtain regularity results, and such conditions in fact can be justified for a variety of situations, so for instance by maximum principles or, in the radial situation, Moser-type \({\text {L}}^{\infty }\)-bounds.
This distinction of ellipticity regimes also enters in the \({\text {BV}}\)-theory for full gradient functionals. In fact, it is known from [14, 18] that if \(1<a<1+\frac{2}{n}\), then generalised minima of the corresponding full gradient functionals belong to some \({\text {W}}_{{\text {loc}}}^{1,p}\) with \(p>1\) whereas in the regime \(1+\frac{2}{n}\leqq a \leqq 3\), the only \({\text {W}}^{1,1}\)-regularity results [14, 18] are subject to the additional \({\text {L}}_{{\text {loc}}}^{\infty }\)-hypothesis on the generalised minima. For variational principles of the form (1.2) subject to (1.4), a first result has been given by Kristensen and the author [46] by passing to fractional estimates. While striving for the optimal ellipticity \(1<a<1+\frac{2}{n}\), the method as employed therein only yields the \({\text {W}}_{{\text {loc}}}^{1,1}\)-regularity for \(1<a<1+\frac{1}{n}\), revealing a crucial ellipticity gap of size \(\frac{1}{n}\). The first main result of this paper precisely closes this gap:
Theorem 1.1
(Universal \({\text {W}}_{{\text {loc}}}^{1,1}\)-regularity estimates) Let \(u_{0}\in {\text {LD}}(\Omega )\) and suppose that \(f\in {\text {C}}^{2}({\mathbb {R}}_{{\text {sym}}}^{n\times n})\) satisfies (LG) and (1.4) with \(1<a<1+\frac{2}{n}\). If
-
(a)
\(n=2\), then every generalised minimiser \(u\in {\text {GM}}(F;u_{0})\) is of class \({\text {LD}}(\Omega )\cap {\text {W}}_{{\text {loc}}}^{1,q}(\Omega ;{\mathbb {R}}^{n})\) for any\(1\leqq q < \infty \). More precisely, u has locally exponentially integrable gradients in the following sense: there exists \(c=c(a,c_{1},c_{2},\gamma ,\lambda ,\Lambda )>0\) such that for any \(x_{0}\in \Omega \) and \(0<r<1\) with \({\text {B}}(x_{0},5r)\subset \Omega \) it holds that
(1.6) -
(b)
\(n\geqq 3\), then every generalised minimiser \(u\in {\text {GM}}(F;u_{0})\) is of class \({\text {LD}}(\Omega )\cap {\text {W}}_{{\text {loc}}}^{1,q}(\Omega ;{\mathbb {R}}^{n})\) for \(q=\frac{2-a}{n-2}n\). More precisely, there exists \(c=c(n,a,c_{1},c_{2},\gamma ,\lambda ,\Lambda )>0\) such that for any \(x_{0}\in \Omega \) and \(0<r<1\) with \({\text {B}}(x_{0},5r)\subset \Omega \) it holds that
(1.7)
Theorem 1.1 thus gives exactly the same Sobolev regularity in the\({\text {BD}}\)-situation as is presently known for the autonomous Dirichlet problem on\({\text {BV}}\). As mentioned above, for the autonomous Dirichlet problem on \({\text {BV}}\) it is possible to establish \({\text {W}}_{{\text {loc}}}^{1,L\log ^{2}L}\)-regularity of locally bounded generalised minima for the wider ellipticity range \(1+\frac{2}{n}\leqq a \leqq 3\); note that for \(a>3\), no \({\text {W}}^{1,1}\)-regularity results are available at presentFootnote 2 While, in principle, the strategy underlying Theorem 1.1 can be modified to work in the \({\text {L}}_{{\text {loc}}}^{\infty }\)-constrained case, too, no method is known to us that would provide locally bounded generalised minima at all. In fact, whereas maximum principles and Moser-type \({\text {L}}_{{\text {loc}}}^{\infty }\)-bounds can be employed in the full gradient setting subject to specific structural conditions on the integrands (cf. [14, Theorem 1.11, Theorem D.1–3]), the symmetric gradient seems to destroy the impact of any such good structural hypotheses (so for example radial dependence on the arguments). In order not to produce a possibly vacuous result, we thus stick to the ellipticity range \(1<a<1+\frac{2}{n}\) for which the additional local boundedness is not required. Deferring the precise discussion to Section 4, let us now briefly outline the underlying chief obstructions that make Theorem 1.1 considerably harder to obtain than its BV-analogue.
To establish the regularity assertions of Theorem 1.1, one might consider a vanishing viscosity sequence and then derive uniform second order estimates. Essentially inspired by the foundational works of Seregin [71,72,73,74], in the \({\text {BV}}\)-case a difference quotient approach yields the requisite estimates as a consequence of the fact that the full gradients of the single viscosity approximations are uniformly bounded in \({\text {L}}^{1}(\Omega ;{\mathbb {R}}^{n\times n})\); cf. [18, 19]. Within the framework of Theorem 1.1, however, the latter boundedness cannot be assumed and \({\text {L}}^{1}\)-estimates on the full gradients must be avoided throughout. On the other hand, generalised minima are in general non-unique—despite strict convexity of the integrands f. Hence, even if it were applicable, the vanishing viscosity approach would only apply to one particular generalised minimiser. The claimed universal regularity estimates (that is, for all generalised minima) thus require another argument.
Directly working on the primal problem, we modify and extend the Ekeland viscosity approximation scheme as introduced by Beck and Schmidt [14] in the \({\text {BV}}\)-context and generalised to the \({\text {BD}}\)-situation by Kristensen and the author [46]; see [1, 58] for the first applications of the Ekeland variational principle in the regularity context. Here, on the one hand, the Ekeland-type approximations must be strong enough for the (perturbed) Euler–Lagrange equations to permit a splitting strategy, thereby implying the requisite second order estimates for the corresponding almost-minima. Simultaneously, they must be weak enough to be treatable by the a priori information on the minimising sequences. By our arguments below—and contrary to the \({\text {W}}^{-1,1}\)-perturbations in the \({\text {BV}}\)-context [14]—the correct perturbation space now turns out to be \({\text {W}}^{-2,1}\) (see Section 2.2.3 for the definition). Without the aforementioned splitting strategy, in turn inspired by Seregin et al. [39, 74], we would be bound to argue as in [46], and then the desired ellipticity range \(1<a<1+\frac{2}{n}\) would not be reached. By the degenerate elliptic behaviour of the integrands, non-uniqueness of generalised minima and the overall lack of Korn’s inequality, the proof of Theorem 1.1 requires to overcome both technical and conceptual difficulties and is given in Section 4 below.
Once the presence of the singular parts \({\text {E}}\!^{s}u\) is ruled out for all\(u\in {\text {GM}}(F;u_{0})\), the boundary integrals in (1.3) are identified as the only source of non-uniqueness. This admits to apply more general principles (to be established in the Appendix, Section 8, with emphasis on the two-dimensional case) to draw conclusions on the structure of \({\text {GM}}(F;u_{0})\), cf. Section 4.6.
1.3 Partial \({\text {C}}^{1,\alpha }\)-Regularity of Minima
The second part of this paper is devoted to the partial (Hölder) regularity of generalised minima of F. We note that, essentially because the minimisation of F constitutes a vectorial problem, full Hölder continuity in general is not to be expected; see [41, 43, 63, 64] and the references therein. To streamline terminology, in this paper we say that a map \(v\in {\text {L}}_{{\text {loc}}}^{1}(\Omega ;{\mathbb {R}}^{n})\) is partially regular if there exists a relatively open subset \(\Omega _{u}\subset \Omega \) such that v is of class \({\text {C}}^{1,\alpha }\) in a neighbourhood of any of the elements of \(\Omega _{u}\) for any \(0<\alpha <1\).
There is an extensive literature on the topic of partial regularity and proof strategies, most notably the (indirect) blow-up method with roots in De Giorgi’s work [27] and the \({\mathcal {A}}\)-harmonic approximation method with roots in Almgren’s and Allard’s work in geometric measure theory [4, 5]. These proof strategies have been adapted to the setting of functionals of the type (1.2) with \(\varepsilon \) replaced by the full gradient, see [1, 2, 29, 30, 35, 62] for an incomplete list. For instance, even in the convex full-gradient linear growth case, indirect methods such as blow-up are difficult to implement by the relatively weak compactness properties of \({\text {BV}}\)as long as no additional Sobolev regularity is available. Appealing to Theorem 1.1, this is for example the case in the very degenerate regime \(a\geqq 1+\frac{2}{n}\). On the other hand, should an integrand degenerate completely for large values of the argument, one might still aim for a local-in-phase-space regularity result (in the terminology of Schmidt [70]).
To establish such a regularity theorem, and in turn being able to cover all degenerate ellipticities, we make use of a direct strategy using mollifications as comparison maps. Since, by Jensen’s inequality, mollifications can be suitably controlled by convex functions, this method is particularly designed for convex problems. Originally employed by Anzellotti and Giaquinta [10] in the full gradient context (also see the related result by Schmidt [69] for the model integrands \({\text {m}}_{p}(\cdot )=(1+|{\cdot }|^{p})^{\frac{1}{p}}\), \(p\ne 2\)), functionals (1.3) require a different treatment. First, now the decay of the comparison maps must appear as a consequence of a careful linearisation and hereafter Korn’s inequality in \({\text {L}}^{2}\). More importantly, the comparison argument forces us to control V-function type distances from a given generalised minimiser to its mollifications by the symmetric gradients only. While this is a consequence of the fundamental theorem of calculus in the \({\text {BV}}\)-context, the requisite estimates now must be accessed without appealing to the full gradients. This motivates the derivation of a novel family of convolution-type Poincaré inequalities in Section 5, which might be of independent interest. Lastly, the estimates of Section 5 necessitate a refined construction of good annuli in the partial regularity proof, where the key parts of the comparison are performed. A combination of these tools in Section 6 then yields an \(\varepsilon \)-regularity result (cf. Corollary 6.7), and implies the following second main result of the paper:
Theorem 1.2
(Local-in-phase-space regularity) Let \(f\in {\text {C}}^{2}({\mathbb {R}}_{{\text {sym}}}^{n\times n})\) be convex and satisfy (LG). Given \(u_{0}\in {\text {LD}}(\Omega )\), let \(u\in {\text {GM}}(F;u_{0})\). If \((x_{0},z_{0})\in \Omega \times {\mathbb {R}}_{{\text {sym}}}^{n\times n}\) is such that
and \(f''(z_{0})\) is positive definite, then there holds \(u\in {\text {C}}^{1,\alpha }(U;{\mathbb {R}}^{n})\) for a suitable open neighbourhood U of \(x_{0}\) for all \(0<\alpha <1\). As a consequence, if \(f''\) is positive definite everywhere on \({\mathbb {R}}_{{\text {sym}}}^{n\times n}\), then the singular set\(\Sigma _{u}\) of points in whose neighourhood u is not of class \({\text {C}}^{1,\alpha }\) for any \(0<\alpha <1\) satisfies \({\mathscr {L}}^{n}(\Sigma _{u})=0\), is relatively closed and is given by
Similarly as to \({\text {BV}}\)-theory, the importance of the previous theorem is manifested by its minimal assumptions regarding locality and (degenerate) ellipicity; in fact, no global uniform strong convexity needs to be imposed on f in order to yield the corresponding partial \({\text {C}}^{1,\alpha }\)-regularity result. Recalling the a-ellipticity scale (1.4), Theorem 1.2 thus particularly complements Theorem 1.1 in the very degenerate ellipticity regime \(1+\frac{2}{n}\leqq a <\infty \), cf. Figure 1. As a routine matter, however, strengthening the ellipticity to \(1<a<\frac{n}{n-1}\), Theorem 1.1 can be invoked to yield bounds on \(\dim _{{\mathscr {H}}}(\Sigma _{u})\)—cf. Corollary 4.6. We moreover note that the previous theorem equally proves interesting for radially symmetric integrands. Indeed, techniques to arrive at full\({\text {C}}^{1,\alpha }\)-regularity results in the full gradient setting (cf. Uhlenbeck [81], Ural’ceva [82] or Beck and Schmidt [15] in the \({\text {BV}}\)-context) are hard to be implemented: The symmetric gradient seems to destroy the beneficial structure of the corresponding Euler–Lagrange equations. As such, Theorem 1.2 seems hard to be generalised to the model integrands \({\text {m}}_{p}\) (revealing p-Laplacean type behaviour at the origin) for \(p\ne 2\), cf. Section 3 for a discussion. Finally, recalling the aim of a regularity result in the very degenerate ellipticity regime, Theorem 1.2 proves independent of the recent companion theorem [45] for strongly symmetric quasiconvex integrals by the author. Whereas the main difficulties in [45] stem from the weakened convexity notion, its application to convex integrands only yields a partial regularity theorem for a-elliptic integrands, \(1<a\leqq 3\). A discussion of these matters, together with possible generalisations of Theorems 1.1 and 1.2 is given in Section 7.
1.4 Organisation of the Paper
In Section 2 we fix notation, record basic definitions and auxiliary estimates. After a discussion of sample integrands in Section 3, we provide the proof of Theorem 1.1 and selected implications in Section 4. Section 5 provides convolution-type Poincaré inequalities to crucially enter the proof of Theorem 1.2 in Section 6. Section 7 discusses generalisations of the results of the paper, and the appendices, Sections 8 and 9, comprise selected uniqueness assertions and proofs of auxiliary results.
2 Preliminaries
2.1 General Notation and Background
We briefly comment on the notation used throughout. By \({\mathbb {R}}_{{\text {sym}}}^{n\times n}\) or \({\mathbb {R}}_{{\text {skew}}}^{n\times n}\) we denote the symmetric or skew-symmetric \((n\times n)\)-matrices with real entries. All finite dimensional vector spaces are equipped with the euclidean (or, in the matrix case, Frobenius) norm \(|{\cdot }|\), and the inner product on such spaces is denoted \(\langle \cdot ,\cdot \rangle \). Given \(a,b\in {\mathbb {R}}^{n}\), the symmetric tensor product is given by \(a\odot b :=\frac{1}{2}(ab^{{\mathsf {T}}}+ba^{{\mathsf {T}}})\). Given \(x_{0}\in {\mathbb {R}}^{n}\) and \(r>0\), the open ball of radius \(r>0\) centered at \(x_{0}\in {\mathbb {R}}^{n}\) is denoted \({\text {B}}(x_{0},r):=\{x\in {\mathbb {R}}^{n}:\;|x-x_{0}|<r\}\). For \(0<t<s<\infty \), we denote \({\mathcal {A}}(x_{0};t,s):={\text {B}}(x_{0},s){\setminus }{\text {B}}(x_{0},t)\) the annulus centered at \(x_{0}\) having outer and inner radii s and t, respectively. To distinguish from balls in matrix space, we write \({\mathbb {B}}(z,r):=\{y\in {\mathbb {R}}_{{\text {sym}}}^{n\times n}:\;|y-z|<r\}\) for \(z\in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\) and \(r>0\). Cubes Q in \({\mathbb {R}}^{n}\) are tacitly assumed to be non-degenerate, and we denote by \(\ell (Q)\) their sidelengths. The n-dimensional Lebesgue and \((n-1)\)-dimensional Hausdorff measure are denoted \({\mathscr {L}}^{n}\) and \({\mathscr {H}}^{n-1}\), respectively. Accordingly, the Hausdorff dimension of a Borel set \(A\in {\mathscr {B}}({\mathbb {R}}^{n})\) is denoted \(\dim _{{\mathscr {H}}}(A)\). For \(u\in {\text {L}}_{{\text {loc}}}^{1}({\mathbb {R}}^{n};{\mathbb {R}}^{m})\) and an open set \(U\subset {\mathbb {R}}^{n}\) with \({\mathscr {L}}^{n}(U)<\infty \), we use the shorthand whereas, if \(U={\text {B}}(x,r)\) is ball, we abbreviate \((u)_{x,r}:=(u)_{{\text {B}}(x,r)}\). Moreover, for a given finite dimensional real vector space V, we denote \({\mathscr {M}}_{({\text {loc}})}(\Omega ;V)\) the V-valued (locally) finite Radon measures on (the open set) \(\Omega \). For \(\mu \in {\mathscr {M}}(\Omega ;V)\), its Lebesgue–Radon–Nikodým decomposition is given by \(\mu =\mu ^{a}+ \frac{\mathrm {d}\mu }{\mathrm {d}|\mu ^{s}|}|\mu ^{s}|\), where \(\mu ^{a}\ll {\mathscr {L}}^{n}\) and \(\mu ^{s}\bot {\mathscr {L}}^{n}\).
By \(c,C>0\) we denote generic constants whose value might change from line to line, and shall only be specified if their precise value is required.
2.2 Function Spaces and Integral Operators
In this section we give an overview of the requisite function spaces on which the main part is based. This comprises functions of bounded deformation, to be discussed in Section 2.2.1, as well as Orlicz and negative Sobolev spaces to be introduced and discussed in Sections 2.2.2 and 2.2.3.
2.2.1 Functions of Bounded Deformation
Let \(\Omega \subset {\mathbb {R}}^{n}\) be open and bounded. We then define \({\text {BD}}(\Omega )\) as the space of all \(u\in {\text {L}}^{1}(\Omega ;{\mathbb {R}}^{n})\) for which the total deformation
is finite; note that by writing \({\text {E}}\!u\) we indicate that the symmetric distributional gradient of u is a measure whereas by \(\varepsilon (u)\) we tacitly understand that it is representable by an \({\text {L}}^{1}\)-map. This space has been introduced in [26, 79] and studied from various perspectives in [6, 9, 11, 52, 53, 77]; unless stated otherwise, all of the following can be traced back to these references. Given \(u\in {\text {BD}}(\Omega )\), the Lebesgue–Radon–Nikodým decomposition of \({\text {E}}\!u\) reads as . Here, \({\mathscr {E}}u\) takes the rôle of the symmetric part of the approximate gradient (cf. [7] for this terminology).
Let \(u,u_{1},u_{2},\ldots \in {\text {BD}}(\Omega )\). We say that \(u_{k}{\mathop {\rightharpoonup }\limits ^{*}}u\) if and only if \(u_{k}\rightarrow u\) in \({\text {L}}^{1}(\Omega ;{\mathbb {R}}^{n})\) and \({\text {E}}\!u_{k}{\mathop {\rightharpoonup }\limits ^{*}} {\text {E}}\!u\) in \({\mathscr {M}}(\Omega ;{\mathbb {R}}_{{\text {sym}}}^{n\times n})\). If \(u_{k}{\mathop {\rightharpoonup }\limits ^{*}} u\) as just defined and \(|\!{\text {E}}\!u_{k}|(\Omega )\rightarrow |\!{\text {E}}\!u|(\Omega )\), then we say that \((u_{k})\) converges (symmetric) strictly to u. If, moreover, \(\sqrt{1+|{\text {E}}\!u_{k}|^{2}}(\Omega )\rightarrow \sqrt{1+|{\text {E}}\!u|^{2}}(\Omega )\) with
then we say that \((u_{k})\) converges (symmetric) area-strictly to u. These notions are usually reserved for the \({\text {BV}}\)-context, but as we deal with the \({\text {BD}}\)-situation exclusively we shall often omit the supplementary symmetric and simply speak of strict and area-strict convergence.
Now let \(\Omega \) have Lipschitz boundary \(\partial \Omega \). Both \({\text {LD}}(\Omega )\) and \({\text {BD}}(\Omega )\) then have trace space \({\text {L}}^{1}(\partial \Omega ;{\mathbb {R}}^{n})\); however, note that the trace operator onto \({\text {L}}^{1}(\partial \Omega ;{\mathbb {R}}^{n})\) is not continuous with respect to weak*-convergence on \({\text {BD}}(\Omega )\). In this case, continuity can only be achieved when \({\text {BD}}(\Omega )\) is equipped with strict convergence. Moreover, as \(\Omega \) has Lipschitz boundary, any \(u\in {\text {BD}}(\Omega )\) can be extended by zero to the entire \({\mathbb {R}}^{n}\) so that the trivial extension \({\overline{u}}\) again belongs to \({\text {BD}}({\mathbb {R}}^{n})\) and we have
where \(\nu _{\partial \Omega }\) is the outward unit normal to \(\partial \Omega \). Also, we have the Gauß-Green formula
for all \(u\in {\text {BD}}(\Omega )\) and all \(\varphi \in {\text {C}}^{1}({\overline{\Omega }};{\mathbb {R}}_{{\text {sym}}}^{n\times n})\); here, \({\text {div}}\) denotes the row-wise divergence. For latter applications, the following approximation result will turn out to be particularly useful:
Lemma 2.1
Let \(\Omega \subset {\mathbb {R}}^{n}\) be an open and bounded Lipschitz domain. Then for any \(u\in {\text {BD}}(\Omega )\) and any \(u_{0}\in {\text {LD}}(\Omega )\) there exists a sequence \((u_{k})\subset u_{0}+{\text {C}}_{c}^{\infty }(\Omega ;{\mathbb {R}}^{n})\) such that \(u_{k}\rightarrow u\) in \({\text {L}}^{1}(\Omega ;{\mathbb {R}}^{n})\) and
2.2.2 Korn- and Poincaré Inequalities in Lebesgue and Orlicz Spaces
To transfer integrability from \(\varepsilon (u)\) to the full gradients in a flexible space scale, we recall here Korn-type inequalities in Orlicz spaces; our notation is mainly taken from the recent work of Cianchi [23], cf. Acerbi and Mingione [3] and Breit and Diening [20] for related results.
Let \(A:[0,\infty )\rightarrow [0,\infty )\) be a Young function; by this we understand that \(A(t)=\int \nolimits _{0}^{t}a(\tau )\mathrm {d}\tau \) for \(t\geqq 0\), where \(a:[0,\infty )\rightarrow [0,\infty ]\) is non-decreasing, left-continuous and being neither identical to 0 nor \(\infty \). We then denote \({\text {L}}^{A}(\Omega ;{\mathbb {R}}^{m})\) the linear space of all measurable maps \(u:\Omega \rightarrow {\mathbb {R}}^{m}\) such that the Luxembourg norm
is finite. We then define \(E^{1}A(\Omega )\) as the space of all \(u\in {\text {L}}^{A}(\Omega ;{\mathbb {R}}^{n})\) such that the distributional symmetric gradient belongs to \({\text {L}}^{A}(\Omega ;{\mathbb {R}}_{{\text {sym}}}^{n\times n})\). As examples, if \(A(t)=|t|\), then \(E^{1}A({\mathbb {R}}^{n})={\text {LD}}({\mathbb {R}}^{n})\), if \(A(t)=|t|^{p}\) for \(1<p<\infty \), then \(E^{1}A({\mathbb {R}}^{n})={\text {W}}^{1,p}({\mathbb {R}}^{n};{\mathbb {R}}^{n})\). It is worth noting that the Young function \(A(t):=t\log (1+t)\) displays a borderline case: For \(\alpha \geqq 0\), the general conclusion
persists if and only if \(\alpha \geqq 1\); hence, briefly recalling the \(L\log L\)-setup mentioned in the introduction, variational problems with symmetric gradients belonging to \(L\log L\) are essentially dealt with in \({\text {W}}^{1,1}\). Namely, by the Smith representation [76] to be used in a different context later on, \(u=(u^{1},\ldots ,u^{n})\in {\text {C}}_{c}^{\infty }({\mathbb {R}}^{n};{\mathbb {R}}^{n})\) can be retrieved from \(\varepsilon (u)=(\varepsilon _{ij}(u))_{i,j=1}^{n}\) via
for all \(k\in \{1,\ldots ,n\}\), where \(K_{ij}(x):=x_{i}x_{j}/|x|^{n}\) for \(x\in {\mathbb {R}}^{n}{\setminus }\{0\}\). The convolutions here are understood in the Cauchy principal value sense, and so the map \(\Phi :\varepsilon (u)\mapsto \nabla u\) displays a singular integral of convolution type satisfying the usual Hörmander condition. Then (2.3) follows from the theory of singular integrals on Orlicz spaces, cf. [23]. For the what follows, let us remind the reader of the space of rigid deformations
which, for open and connected \(\Omega \), is precisely the nullspace of \(\varepsilon \). Since elements of \({\mathscr {R}}(\Omega )\) are polynomials, we shall often identify \({\mathscr {R}}(\Omega )\) with \({\mathscr {R}}({\mathbb {R}}^{n})\).
Lemma 2.2
([23, Theorem 3.3, Corollary 3.4, Example 3.11]) Let \(\Omega \subset {\mathbb {R}}^{n}\) be an open, bounded and connected Lipschitz domain. Then the following holds:
-
(a)
For each \(1<p<\infty \) there exists \(c=c(p,n,\Omega )>0\) such that
$$\begin{aligned} \inf _{\pi \in {\mathscr {R}}(\Omega )}\Vert \nabla (v-\pi )\Vert _{{\text {L}}^{p}(\Omega ;{\mathbb {R}}^{n\times n})}\leqq c\Vert \varepsilon (v)\Vert _{{\text {L}}^{p}(\Omega ;{\mathbb {R}}_{{\text {sym}}}^{n\times n})}\quad \text {for all}\; v\in {\text {W}}^{1,p}(\Omega ;{\mathbb {R}}^{n}). \end{aligned}$$Moreover, for all \(v\in {\text {W}}^{1,p}(\Omega ;{\mathbb {R}}^{n})\) there holds
$$\begin{aligned} \Vert \nabla v - (\nabla v)_{\Omega }\Vert _{{\text {L}}^{p}(\Omega ;{\mathbb {R}}^{n\times n})}\leqq c\Vert \varepsilon (v)-(\varepsilon (v))_{\Omega }\Vert _{{\text {L}}^{p}(\Omega ;{\mathbb {R}}_{{\text {sym}}}^{n\times n})}. \end{aligned}$$ -
(b)
For each \(\beta >0\) there exists \(c=c(\beta ,n,\Omega )>0\) such that
$$\begin{aligned}&\inf _{\pi \in {\mathscr {R}}(\Omega )}\Vert \nabla (v-\pi )\Vert _{\exp {\text {L}}^{\frac{\beta }{\beta +1}}(\Omega ;{\mathbb {R}}^{n\times n})}\\&\qquad \leqq c\Vert \varepsilon (v)\Vert _{\exp {\text {L}}^{\beta }(\Omega ;{\mathbb {R}}_{{\text {sym}}}^{n\times n})}\qquad \text {for all}\;v\in E^{1}\exp {\text {L}}^{\beta }(\Omega ), \end{aligned}$$where \(\exp {\text {L}}^{\beta }(\Omega )\) is the Orlicz space corresponding to \(A(t):=\exp (t^{\beta })\).
In the sequel, we gather some instrumental results on certain projection operators and augment (2.4) by a decomposition result due to Reshetnyak [67]. Note that, since \({\mathscr {R}}({\text {B}}(0,1))\) is a finite dimensional vector space, all norms are equivalent on\({\mathscr {R}}({\text {B}}(0,1))\). Thus, by scaling, we find that for each \(1\leqq q < \infty \) there exists a constant \(c(n,q)>0\) such that for all \(x_{0}\in {\mathbb {R}}^{n}\) and \(r>0\) it holds that
for all \(\pi \in {\mathscr {R}}({\text {B}}(x_{0},r))\). The same inequality holds true with the obvious modifications for if \(q=\infty \) on the left-hand side. Moreover, there exists a bounded linear projection operator \(\Pi _{{\text {B}}(x_{0},r)}:{\text {L}}^{1}({\text {B}}(x_{0},r);{\mathbb {R}}^{n})\ni u\mapsto \pi _{u}\in {\mathscr {R}}({\text {B}}(x_{0},r))\) satisfying
for all \(u\in {\text {L}}^{q}({\text {B}}(x_{0},r);{\mathbb {R}}^{n})\) and each \(1\leqq q <\infty \); see the appendix, Section 9.1, for an elementary proof. A similar result holds for cubes Q instead of balls, and we shall refer to this property as \({\text {L}}^{q}\)-stability of \(\Pi _{{\text {B}}(x_{0},r)}\) or \(\Pi _{Q}\), respectively. In a routine manner, the foregoing now yields the next lemma which should be well-known, but is hard to be found in the following form:
Lemma 2.3
(Projections in Poincaré- and Korn-type inequalities) Let \(1\leqq p < \infty \), \(x_{0}\in {\mathbb {R}}^{n}\) and \(r>0\). For each \(1\leqq q\leqq p\) there exists a constant \(c=c(n,q)>0\) such that for all \(u\in {\text {W}}^{1,p}({\text {B}}(x_{0},r);{\mathbb {R}}^{n})\) there exists \(\pi _{u}\in {\mathscr {R}}({\text {B}}(x_{0},r))\) such that
In particular, the map \(\Pi _{{\text {B}}(x_{0},r)} u\mapsto \pi _{u}\) is independent of q. The same holds true if we set \(q=1\) and replace \({\text {W}}^{1,1}({\text {B}}(x_{0},r);{\mathbb {R}}^{n})\) by \({\text {LD}}({\text {B}}(x_{0},r))\). Moreover, if \(1<q\leqq p\), then there exists a constant \(c=c(n,q)>0\) such that for all \(u\in {\text {W}}^{1,p}({\text {B}}(x_{0},r);{\mathbb {R}}^{n})\) it holds that (with the same \(\pi _{u}\) as above)
Moreover, the map \(\Pi _{{\text {B}}(x_{0},r)}:u\mapsto \pi _{u}\) is \({\text {L}}^{q}\)-stable for each \(1\leqq q \leqq p\) in the above sense.
Clearly, a similar version holds for cubes. The former lemma is a consequence of the following result due to Reshetnyak [67] which we shall need independently. As we will exclusively use it for cubes, we directly state it in the following form:
Lemma 2.4
(Reshetnyak, [67]) For any open, non-empty cube \(Q\subset {\mathbb {R}}^{n}\), there exists a projection \({\widetilde{\Pi }}_{Q}:{\text {C}}^{\infty }(Q;{\mathbb {R}}^{n})\cap {\text {LD}}(Q)\rightarrow {\mathscr {R}}(Q)\) and an operator \(T_{Q}:({\text {C}}^{\infty }\cap {\text {L}}^{1})(Q;{\mathbb {R}}_{{\text {sym}}}^{n\times n})\rightarrow {\text {L}}^{1}(Q;{\mathbb {R}}^{n})\) such that for any \(v\in {\text {LD}}(Q)\) there holds
Moreover, the operator \(T_{Q}\) is of the form
where \(R_{Q}:Q\times Q\rightarrow {\mathscr {L}}({\mathbb {R}}_{{\text {sym}}}^{n\times n};{\mathbb {R}}^{n})\) satisfies \(|R_{Q}(x,y)|\leqq c/|x-y|^{n-1}\) for all \(x,y\in Q\) with \(c=c(n)>0\).
2.2.3 Negative Sobolev Spaces
The viscosity approximation strategy to be set up in Section 4 shall require certain negative Sobolev spaces in a crucial manner. As shall become clear later, we have to go beyond the space \({\text {W}}^{-1,1}\) as introduced in [14]. Given \(k\in {\mathbb {N}}\), we define the space \({\text {W}}^{-k,1}(\Omega ;{\mathbb {R}}^{n})\) as follows:
The linear space \({\text {W}}^{-k,1}(\Omega ;{\mathbb {R}}^{n})\) is canonically endowed with the norm
the infimum ranging over all representations \(T=\sum _{|\alpha |\leqq k}\partial ^{\alpha }T_{\alpha }\) with \(T_{\alpha }\in {\text {L}}^{1}(\Omega ;{\mathbb {R}}^{n})\). Similar as for \({\text {W}}^{-1,1}(\Omega ;{\mathbb {R}}^{n})\) as discussed in [14], \({\text {W}}^{-k,1}\) is not approachable by duality. We collect its most important properties in the following lemma:
Lemma 2.5
Let \(\Omega \subset {\mathbb {R}}^{n}\) be open and let \(k\in {\mathbb {N}}\) be given. Then the following holds:
-
(a)
\(({\text {W}}^{-k,1}(\Omega ;{\mathbb {R}}^{n}),\Vert \cdot \Vert _{{\text {W}}^{-k,1}(\Omega ;{\mathbb {R}}^{n})})\) is a Banach space.
-
(b)
For every \(u\in {\text {L}}^{1}(\Omega ;{\mathbb {R}}^{n})\) and every \(\beta \in {\mathbb {N}}_{0}^{n}\) with \(|\beta |\leqq k\) it holds that
$$\begin{aligned} \Vert \partial ^{\beta }u\Vert _{{\text {W}}^{-k,1}(\Omega ;{\mathbb {R}}^{n})}\leqq \Vert u\Vert _{{\text {W}}^{|\beta |-k,1}(\Omega ;{\mathbb {R}}^{n})}. \end{aligned}$$
Proof
In view of (a), we closely follow [14] and consider the mapping \(\Phi :{\text {L}}^{1}(\Omega ;{\mathbb {R}}^{n})^{N} \ni (T_{\alpha })_{|\alpha |\leqq k} \mapsto \sum _{|\alpha |\leqq k}\partial ^{\alpha }T_{\alpha }\in {\text {W}}^{-k,1}(\Omega ;{\mathbb {R}}^{n})\), where \(N:=\#\{\alpha \in {\mathbb {N}}_{0}^{n}:\;|\alpha |\leqq k\}\). By definition of \({\text {W}}^{-k,1}(\Omega ;{\mathbb {R}}^{n})\), \(\Phi \) is a bounded linear operator and thus \(\ker (\Phi )\) is a Banach space in itself. By definition of the quotient norm, the canonical quotient map \(\Psi :{\text {L}}^{1}(\Omega ;{\mathbb {R}}^{n})^{N}/\ker (\Phi )\rightarrow {\text {W}}^{-k,1}(\Omega ;{\mathbb {R}}^{n})\) is surjective and isometric. Thus, as \(\ker (\Phi )\) is Banach, so is \({\text {L}}^{1}(\Omega ;{\mathbb {R}}^{n})^{N}/\ker (\Phi )\) and eventually, as the isometric image of a Banach space, \(({\text {W}}^{-k,1}(\Omega ;{\mathbb {R}}^{n}),\Vert \cdot \Vert _{{\text {W}}^{-k,1}(\Omega ;{\mathbb {R}}^{n})})\). For (b), let \(\varepsilon >0\) and choose \((T_{\alpha })_{\alpha }\in {\text {L}}^{1}(\Omega ;{\mathbb {R}}^{n})^{N}\) such that \(u=\sum _{|\alpha |\leqq k-|\beta |}\partial ^{\alpha }T_{\alpha }\) and
On the other hand, \(\partial ^{\beta }u=\sum _{|\alpha |\leqq k-|\beta |}\partial ^{\alpha +\beta }T_{\alpha }=:\sum _{|\gamma |\leqq k}\partial ^{\gamma }S_{\gamma }\), where \(S_{\gamma }=T_{\alpha }\) if \(\gamma =\alpha +\beta \) for some \(\alpha \) with \(|\alpha |\leqq k-|\beta |\) and \(S_{\gamma }=0\) otherwise. Therefore,
and we then send \(\varepsilon \searrow 0\) to conclude the proof. \(\quad \square \)
Next, a lower semicontinuity result in the spirit of [46, Lemma 3.2], [14, Lemma 2.6]:
Lemma 2.6
Let \(1<q<\infty \), \(k\in {\mathbb {N}}\) be given and let \(\Omega \) be open and bounded with Lipschitz boundary \(\partial \Omega \). Suppose that \({\mathfrak {f}}:{\mathbb {R}}_{{\text {sym}}}^{n\times n}\rightarrow {\mathbb {R}}\) is a convex function that satisfies \(c^{-1}|z|^{q}-d\leqq {\mathfrak {f}}(z)\leqq c(1+|z|^{q})\) for some \(c,d>0\) and all \(z\in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\). Then, for every \(u_{0}\in {\text {W}}^{1,q}(\Omega ;{\mathbb {R}}^{n})\), the functional
is lower semicontinuous for the norm topology on \({\text {W}}^{-k,1}(\Omega ;{\mathbb {R}}^{n})\).
Proof
Let \(g,g_{1},g_{2},\ldots \in {\text {W}}^{-k,1}(\Omega ;{\mathbb {R}}^{n})\) be such that \(g_{m}\rightarrow g\) with respect to the norm topology on \({\text {W}}^{-k,1}(\Omega ;{\mathbb {R}}^{n})\). If \(\liminf _{m\rightarrow \infty }{\mathcal {F}}[g_{m}]=+\infty \), there is nothing to prove. Hence assume without loss of generality that \(\lim _{j\rightarrow \infty }{\mathcal {F}}[g_{m(j)}]=\liminf _{m\rightarrow \infty }{\mathcal {F}}[g_{m}]<\infty \). Then necessarily \(g_{m(j)}\in {\mathscr {D}}_{u_{0}}\) for all sufficiently large indices j and, since \(c^{-1}|z|^{q}-d\leqq {\mathfrak {f}}(z)\) for all \(z\in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\), we obtain that \((\varepsilon (g_{m(j)}))\) is bounded in \({\text {L}}^{q}(\Omega ;{\mathbb {R}}_{{\text {sym}}}^{n\times n})\). Since \(g_{m(j)}\in {\mathscr {D}}_{u_{0}}\) and \(q>1\), Korn’s inequality in \({\text {W}}_{0}^{1,q}(\Omega ;{\mathbb {R}}^{n})\) implies that \((g_{m(j)})\) is uniformly bounded in \({\text {W}}^{1,q}(\Omega ;{\mathbb {R}}^{n})\). Since \(1<q<\infty \), there exists a subsequence \((g_{m(j(i))})\subset (g_{m(j)})\) which converges weakly in \({\text {W}}^{1,q}(\Omega ;{\mathbb {R}}^{n})\) to some \({\widetilde{g}}\in {\mathscr {D}}_{u_{0}}\) (note that \({\mathscr {D}}_{u_{0}}\) is weakly closed in \({\text {W}}^{1,q}(\Omega ;{\mathbb {R}}^{n})\)). By the Rellich–Kondrachov theorem, we can moreover assume that \(g_{m(j(i))}\rightarrow {\widetilde{g}}\) strongly in \({\text {L}}^{q}(\Omega ;{\mathbb {R}}^{n})\). Then, since \({\text {L}}^{q}(\Omega ;{\mathbb {R}}^{n})\hookrightarrow {\text {W}}^{-k,1}(\Omega ;{\mathbb {R}}^{n})\) by Lemma 2.5(b),
and thus \(g={\widetilde{g}}\). By standard results on lower semicontinuity of convex variational integrals of superlinear growth (or, alternatively, Reshetnyak’s lower semicontinuity theorem, Theorem 2.9 below) \(\varepsilon (g_{m(j(i))}){\mathscr {L}}^{n}{\mathop {\rightharpoonup }\limits ^{*}}\varepsilon (g){\mathscr {L}}^{n}\) as \(i\rightarrow \infty \) thus yields
The proof is complete. \(\quad \square \)
2.3 The Ekeland Variational Principle
In this section we recall a variant of the Ekeland variational principle [33] that is suitable for our purposes. The version which we state here is a merger of [43, Theorem 5.6, Remark 5.5].
Proposition 2.7
Let (V, d) be a complete metric space and let \({\mathcal {F}}:V\rightarrow {\mathbb {R}}\cup \{\infty \}\) be a lower semicontinuous function (for the metric topology) which is bounded from below and takes a finite value at some point. Suppose that, for some \(u\in V\) and some \(\varepsilon >0\), there holds \({\mathcal {F}}[u]\leqq \inf {\mathcal {F}}[V]+\varepsilon \). Then there exists \(v\in V\) such that
-
(a)
\(d(u,v)\leqq \sqrt{\varepsilon }\),
-
(b)
\({\mathcal {F}}[v]\leqq {\mathcal {F}}[u]\),
-
(c)
for all \(w\in V\) there holds \({\mathcal {F}}[v]\leqq {\mathcal {F}}[w] + \sqrt{\varepsilon }d(v,w)\).
2.4 Functions of Measures and Convolutions
In this section we collect background facts on linear growth integrands and functionals of the form (1.2). We begin with
Lemma 2.8
Suppose that \(f\in {\text {C}}^{2}({\mathbb {R}}_{{\text {sym}}}^{n\times n})\) is convex and satisfies (LG) with \(c_{1},c_{2},\gamma >0\). Then f is Lipschitz with \({\text {Lip}}(f)\leqq c_{2}\).
The proof of the preceding lemma evolves in the same way as [43, Lemma 5.2]; the reader might notice that for the conclusion of Lemma 2.8 it is sufficient that f is symmetric rank-one convex—so convex with respect to directions \(a\odot b\), \(a,b\in {\mathbb {R}}^{n}\)—and satisfies (LG). As to (lower semi)continuity, we shall mostly rely on the following theorem due to Reshetnyak [66] (see [7, 8, 66] for more information on functions of measures):
Theorem 2.9
(Reshetnyak (lower semi-)continuity) Let V be a finite dimensional real vector space and let \((\nu _{j})\) be a sequence in \({\mathscr {M}}(\Omega ;V)\) that converges in the weak*-sense to some \(\nu \in {\mathscr {M}}(\Omega ;V)\). Moreover, assume that all of \(\nu ,\nu _{1},\nu _{2},\ldots \) take values in some closed convex cone \(K\subset V\). Then the following holds:
-
(a)
If \(g :K \rightarrow {\mathbb {R}}_{\geqq 0}\cup \{+\infty \}\) is lower semicontinuous, convex and 1-homogeneous, then it holds that
$$\begin{aligned} \int \nolimits _{\Omega } g\Big (\frac{\mathrm {d}\nu }{\mathrm {d}|\nu |}\Big )\mathrm {d}|\nu | \leqq \liminf _{j\rightarrow \infty }\int \nolimits _{\Omega }g\Big (\frac{\mathrm {d}\nu _{j}}{\mathrm {d}|\nu _{j}|}\Big )\mathrm {d}|\nu _{j}|. \end{aligned}$$ -
(b)
If \(g :K \rightarrow {\mathbb {R}}_{\geqq 0}\cup \{+\infty \}\) is continuous, 1-homogeneous and if \((\nu _{j})\) converges strictly to \(\nu \) (in the sense that \(\nu _{j}{\mathop {\rightharpoonup }\limits ^{*}}\nu \) and \(|\nu _{j}|(\Omega )\rightarrow |\nu |(\Omega )\)), then it holds that
$$\begin{aligned} \int \nolimits _{\Omega }g\Big (\frac{\mathrm {d}\nu }{\mathrm {d}|\nu |}\Big )\mathrm {d}|\nu | = \lim _{j\rightarrow \infty }\int \nolimits _{\Omega }g\Big (\frac{\mathrm {d}\nu _{j}}{\mathrm {d}|\nu _{j}|}\Big )\mathrm {d}|\nu _{j}|. \end{aligned}$$
Given a lower semicontinuous, convex function \(h:{\mathbb {R}}_{{\text {sym}}}^{n\times n}\rightarrow {\mathbb {R}}_{\geqq 0}\), we put \(V:={\mathbb {R}}\times {\mathbb {R}}_{{\text {sym}}}^{n\times n}\) and introduce the linear perspective integrand\(h^{\#}:{\mathbb {R}}_{\geqq 0}\times {\mathbb {R}}_{{\text {sym}}}^{n\times n}\rightarrow {\mathbb {R}}\cup \{+\infty \}\) by
where \(h^{\infty }(\xi )=\lim _{t\searrow 0}th(\frac{\xi }{t})\) so that \(h^{\#}\) is positively 1-homogeneous. Also, if h has linear growth, then \(h^{\#}<\infty \). We put \(K:={\mathbb {R}}_{\geqq 0}\times {\mathbb {R}}_{{\text {sym}}}^{n\times n}\). For \(\Omega \subset {\mathbb {R}}^{n}\) open and \(\mu \in {\mathscr {M}}(\Omega ;{\mathbb {R}}_{{\text {sym}}}^{n\times n})\), we put \(\nu :=({\mathscr {L}}^{n},\mu )\in {\mathscr {M}}(\Omega ;K)\) and define for \(A\in {\mathscr {B}}(\Omega )\)
In particular, if \(u,u_{1},u_{2},\ldots \in {\text {BD}}(\Omega )\) are such that \(u_{j}\rightarrow u\) symmetric area-strictly in \({\text {BD}}(\Omega )\) and \(f:{\mathbb {R}}_{{\text {sym}}}^{n\times n}\rightarrow {\mathbb {R}}_{\geqq 0}\) satisfies (LG), then \(f[{\text {E}}\!u_{j}](\Omega )\rightarrow f[{\text {E}}\!u](\Omega )\).
For \(\mu \in {\mathscr {M}}(\Omega ;{\mathbb {R}}_{{\text {sym}}}^{n\times n})\) and \(\xi _{0}\in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\), we use the convention
As for \({\text {L}}_{{\text {loc}}}^{1}\)-maps, we define the average of \(\mu \in {\mathscr {M}}(\Omega ;{\mathbb {R}}_{{\text {sym}}}^{n\times n})\) over \({\text {B}}(x_{0},r)\subset \Omega \) by
By the Lebesgue differentiation theorem for Radon measures, \({\mathscr {L}}^{n}\)-almost everywhere \(x_{0}\in {\mathbb {R}}^{n}\) is a Lebesgue point for \(\mu \) in the sense that there exists \(\xi _{0}\in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\) such that
The Jensen inequality here takes the following form, cf. [69, Lemma 4.12]: If \(h:{\mathbb {R}}_{{\text {sym}}}^{n\times n}\rightarrow {\mathbb {R}}_{\geqq 0}\) is convex, then
For future applications in Section 5 and 6, we call a compactly supported, radial function \(\rho :{\mathbb {R}}^{n}\rightarrow [0,1]\) a standard mollifier provided \(\Vert \rho \Vert _{{\text {L}}^{1}({\mathbb {R}}^{n})}=1\), \({\text {spt}}(\rho )\subset \overline{{\text {B}}(0,1)}\) and \(\rho \) is of class \({\text {C}}^{\infty }\) in \({\text {B}}(0,1)\). Given \(\varepsilon >0\), we then define the \(\varepsilon \)-rescaled variant by \(\rho _{\varepsilon }(x):=\varepsilon ^{-n}\rho (\tfrac{x}{\varepsilon })\). As a consequence of (2.14), whenever \(\mu \in {\mathscr {M}}_{{\text {loc}}}({\mathbb {R}}^{n};{\mathbb {R}}_{{\text {sym}}}^{n\times n})\) and \(\varepsilon >0\),
Below, we shall particularly work with the following two choices \(\rho ^{(1)},\rho ^{(2)}:{\mathbb {R}}^{n}\rightarrow {\mathbb {R}}\):
where \(\gamma _{n}\) is adjusted in a way such that \(\Vert \rho ^{(2)}\Vert _{{\text {L}}^{1}({\mathbb {R}}^{n})}=1\). Given \(u\in {\text {L}}_{{\text {loc}}}^{1}(\Omega ;{\mathbb {R}}^{n})\) and \(\mu \in {\mathscr {M}}_{{\text {loc}}}(\Omega ;{\mathbb {R}}_{{\text {sym}}}^{n\times n})\), we put
for \(\varepsilon >0\). Upon straightforward modification, the proof of [10, Lemma 5.2] then implies
Lemma 2.10
Let \(\mu \in {\mathscr {M}}_{{\text {loc}}}(\Omega ;{\mathbb {R}}_{{\text {sym}}}^{n\times n})\) and let \(x_{0}\in \Omega \), \(R>0\) be such that \({\text {B}}(x_{0},R)\Subset \Omega \). Moreover, let \(\varepsilon >0\) satisfy \(\varepsilon <\frac{R}{2}\). Then for any convex integrand \(f\in {\text {C}}^{2}({\mathbb {R}}_{{\text {sym}}}^{n\times n};{\mathbb {R}}_{\geqq 0})\) with (LG) the following holds:
-
(a)
If \(0<t_{1}<t_{2}<R-2\varepsilon \), then there exists \(t\in (t_{1},t_{2})\) such that
$$\begin{aligned} f[\mu _{\varepsilon ,\varepsilon }]({\text {B}}(x_{0},t))-f[\mu ]({\text {B}}(x_{0},t))\leqq \frac{4\varepsilon }{t_{2}-t_{1}}f[\mu ]({\text {B}}(x_{0},R)). \end{aligned}$$ -
(b)
If \(R/2<t_{1}<t_{2}<R-2\varepsilon \) and \(0<r<R/4\), then there exist \(r'\in (r,2r)\) and \(t'\in (t_{1},t_{2})\) such that, adopting the annulus notation of Section 2.1,
$$\begin{aligned} f[\mu _{\varepsilon ,\varepsilon }]({\mathcal {A}}(x_{0},r',t'))-f[\mu ]({\mathcal {A}}(x_{0},r',t'))\leqq 4\varepsilon \left( \frac{1}{t_{2}-t_{1}}+\frac{1}{r}\right) f[\mu ]({\text {B}}(x_{0},R)). \end{aligned}$$
2.5 Estimates on V-Functions and Shifted Integrands
We now collect estimates on auxiliary V-functions to be dealt with later. To this end, we define for \(z\in {\mathbb {R}}^{m}\) the auxiliary reference integrand
The functions V will help to define our excess quantity later on, and we record
Lemma 2.11
For every \(m\in {\mathbb {N}}\), all \(z,z'\in {\mathbb {R}}^{m}\) and \(t\geqq 0\) the following holds:
-
(a)
\(V(tz)\leqq 4\max \{t,t^{2}\}V(z)\),
-
(b)
\(V(z+z')\leqq 2(V(z)+V(z'))\),
-
(c)
\((\sqrt{2}-1)\min \{|z|,|z|^{2}\}\leqq V(z)\leqq \min \{|z|,|z|^{2}\}\),
-
(d)
and for every \(\ell >0\) there exists a constant \(c=c(\ell )>0\) such that if \(|z|\leqq \ell \), then
$$\begin{aligned} \tfrac{1}{c}|z|^{2}\leqq V(z) \leqq c|z|^{2}. \end{aligned}$$
All assertions (b)–(d) are contained in [47, Section 2.4, Equation (2.4)], [10, Proposition 2.5], easily implying (a). We conclude this preliminary section with estimates on shifted integrands. To this end, let \(f\in {\text {C}}^{2}({\mathbb {R}}_{{\text {sym}}}^{n\times n})\) be an integrand satisfying (LG). Given \(a\in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\), we define the shifted or linearised integrands \(f_{a}:{\mathbb {R}}_{{\text {sym}}}^{n\times n}\rightarrow {\mathbb {R}}\) by
We state the next lemma in a form that is directly applicable to our future objectives:
Lemma 2.12
Let \(f\in {\text {C}}^{2}({\mathbb {R}}_{{\text {sym}}}^{n\times n};{\mathbb {R}}_{\geqq 0})\) be convex and satisfy (LG). Moreover, let \(\xi _{0}\in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\) and \(0<\varrho _{\xi _{0}}<1\) be such that
Then for all matrices \(a\in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\) with \({\mathbb {B}}(a,\tfrac{\varrho _{\xi _{0}}}{2})\subset {\mathbb {B}}(\xi _{0},\varrho _{\xi _{0}})\) the following holds:
-
(a)
\(f_{a}\) is convex with \(f_{a}(0)=0\) and \(f'_{a}(0)=0\). Moreover, \(f_{a}\geqq 0\).
-
(b)
For all \(\xi \in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\) we have, with \(c(\tfrac{\varrho _{\xi _{0}}}{2})>0\) as in Lemma 2.11(d)
$$\begin{aligned} m_{\xi _{0},\varrho _{\xi _{0}}}\Big (\frac{\varrho _{\xi _{0}}}{2}\Big )^{2}V(\xi ) \leqq f_{a}(\xi ) \leqq \Big (c(\tfrac{\varrho _{\xi _{0}}}{2})\sup _{{\mathbb {B}}(\xi _{0},\varrho _{\xi _{0}})}|f''| +\frac{16{\text {Lip}}(f)}{(\sqrt{2}-1)\varrho _{\xi _{0}}}\Big )V(\xi ). \end{aligned}$$
The elementary proof of the preceding lemma is deferred to the appendix, Section 9.2.
3 Examples of Integrands and Limitations
In this quick intermediate section we present and discuss several sample integrands that underline the applicability and limitations of the main results of the present paper. Here, a scale of integrands \((\Phi _{a})_{1<a<\infty }\) is given by
Then, essentially by [19, Example 3.9 and 4.17], \(\Phi _{a}\) is a-elliptic and notb-elliptic for any \(1<b<a\). Such integrands are covered by Theorem 1.1 for if \(1<a<1+\frac{2}{n}\), and by Theorem 1.2 for all \(1<a<\infty \). The latter theorem particularly includes the example of the area integrand \(E(\xi ):=\sqrt{1+|\xi |^{2}}\), being 3-elliptic; recall that we dispense with 3-elliptic integrands in the framework of Theorem 1.1 as we do not have justification of generalised minima belonging to \({\text {L}}_{{\text {loc}}}^{\infty }\) for such integrands—a condition which is usually required for Sobolev regularity in the full gradient situation, too.
An intermediate class of integrands is given by \(({\text {M}}_{p})_{1\leqq p < \infty }\) defined by
These integrands are \(a=3\)-elliptic for if \(p=1\), and \(a=p+1\)-elliptic for if \(p>1\), cf. [14, Section 3.1]. However, integrands that indeed fall outside the scope of the paper are the linear growth integrands \(({\text {m}}_{p})_{1<p<\infty }\) given by \({\text {m}}_{p}(\xi ):=(1+|\xi |^{p})^{\frac{1}{p}}\) unless \(p=2\); cf. Schmidt [69, 70, p. 7] for the proof. In fact, if \(1<p<2\), then \({\text {m}}''_{p}(z)\) blows up as \(|z|\searrow 0\) and if \(2<p<\infty \), then \({\text {m}}''_{p}(0)=0\). In these situations, Theorem 1.2 applies only if \(u\in {\text {GM}}(F;u_{0})\) satisfies \(\inf _{\Omega }|\varepsilon (u)|>0\). Namely, if \(|z|\searrow 0\), then \({\text {m}}_{p}\) exhibits the behaviour of the p-Dirichlet energies and, as to partial regularity, forces to employ a p-harmonic comparison strategy. Whereas this does work well in the full gradient case [69] following the works of Duzaar and Mingione [31, 32], the requisite comparison estimates in the symmetric gradient context seem to be not available at present.
4 Local \(\mathbf{W}^{\mathbf{1,1}}\)-Regularity and the Proof of Theorem 1.1
In this section we establish the \({\text {W}}_{{\text {loc}}}^{1,1}\)–regularity result asserted by Theorem 1.1. Here we employ a refined version of a vanishing viscosity approach, to be set up in Section 4.2, with the ultimate objective to obtain suitable second order estimates in Section 4.4. In Section 4.5 we then establish Theorem 1.1 and collect selected implications in Section 4.6, thereby completing the lower three regularity assertions gathered in Figure 1.
4.1 Strategy and Obstructions
We start by clarifying the underlying obstructions first, thereby motivating the particular setup of the proof. For f is convex, the higher Sobolev regularity of Theorem 1.1 is usually accessed through the Euler–Lagrange system satisfied by \(u\in {\text {GM}}(F;u_{0})\). On the other hand, as \({\text {E}}\!u\) is a finite \({\mathbb {R}}_{{\text {sym}}}^{n\times n}\)-valued Radon measure, the relevant Euler–Lagrange system needs to be understood in the sense of Anzellotti [8], containing the gradient of the positively homogeneous recession function \(f^{\infty }\). Note that \(f^{\infty }\) essentially ignores the specific ellipticity of f (for example, with the integrands \({\text {m}}_{p}\) from the previous section, \({\text {m}}_{p}^{\infty }=|{\cdot }|\) for all\(1 \leqq p <\infty \)), and hence it is difficult to extract the relevant higher integrability as long as the presence of \({\text {E}}\!^{s}u\) is not ruled out per se. Equally, this also explains why directly working on the minima is in fact a useful device for the partial regularity to be addressed in Section 6; we here essentially restrict ourselves to neighbourhoods of points where \({\text {E}}\!^{s}u\) is assumed to vanish, cf. Theorem 1.2.
To overcome this issue in view of higher Sobolev regularity, one is led to consider good minimising sequences, usually obtained by a vanishing viscosity approach, and derive the requisite compactness estimates. As it is common in the case of degenerate p-growth functionals with \(1<p<2\), the original functionals are stabilised by adding quadratic Dirichlet energies \(\frac{1}{j}\Vert \nabla v\Vert _{{\text {L}}^{2}}^{2}\). The minima \(v_{j}\) of the correspondingly perturbed functionals then are proven to converge (up to a subsequence) to a minimiser v of the original functional, and uniform regularity restimates on the \(v_{j}\)’s carry over to v. When the p-growth integrand f (for \(p>1\)) is strongly convex, hence strictly convex, minima are unique and so this method in particular leads to the requisite regularity estimates for all minima. In the linear growth setting, cf. (LG), the recession function \(f^{\infty }\) is positively 1-homogeneous and thus never strictly convex despite possible a-ellipticity (and hereafter strict convexity) of f. Since, by the representation (1.3) of the weak*-relaxed functional \({\overline{F}}_{u_{0}}[-;\Omega ]\), the recession function acts on the singular part \({\text {E}}\!^{s}u\) of \(u\in {\text {GM}}(F;u_{0})\) exclusively, even strict convexity of f does not imply uniqueness of generalised minima—which cannot be expected in general anyway, compare the counterexamples in the more classical \({\text {BV}}\)-case [37, 68]. A vanishing viscosity approach as outlined above thus is only able to yield Sobolev regularity for at most one generalised minimiser as long as generalised minima are not known to belong to\({\text {LD}}_{({\text {loc}})}\).
In particular, based on this approach, we cannot rule out the existence of other, more irregular generalised minima. A similar issue has been encountered by Beck and Schmidt [14] in the \({\text {BV}}\)-setting and by Kristensen and the author [46]. To circumvent this issue, we adapt and extend the modified vanishing viscosity approaches outlined in [14, 46]. Effectively, we start from an arbitrary given generalised minimiser \(u\in {\text {GM}}(F;u_{0})\) and construct a suitable minimising sequence \((v_{j})\) that converges to u in the weak*-sense in \({\text {BD}}(\Omega )\). To do so, we consider an extension of a perturbed version of F to a suitable negative Sobolev space for whose topology the perturbed functional turns out lower semicontinuous. Then Ekeland’s variational principle provides us with an ’almost minimiser’ of the perturbed functional, cf. Section 4.2. Such almost minimisers satisfy Euler–Lagrange differential inequalities which make elliptic estimates available. Finally, these almost minimisers are shown to converge in the weak*-sense to the given generalised minimiser, and uniform regularity estimates will eventually inherit to the latter.
In the setting of functionals on \({\text {BV}}\) as considered in [14], perturbations in \({\text {W}}^{-1,1}\) are sufficient. This is due to the fact that the full gradients of generalised minima are a priori known to exist as finite Radon measures. As discussed at length in [44, 46], the implementation of the underlying difference quotient approach in the setup of functionals (1.2) leads to terms of the form
where \(\rho :\Omega \rightarrow [0,1]\) is a localisation function and, given \(v:{\mathbb {R}}^{n}\rightarrow {\mathbb {R}}^{n}\), \(h\ne 0\) and \(s\in \{1,\ldots ,n\}\),
denotes the difference quotient of v. Here, \((u_{j})\) is a suitable minimising sequence converging to u in the weak*-sense. In the \({\text {BV}}\)-setting (in which case the symmetric gradients in the definition of T are replaced by the full ones), the term T can be controlled by \(\Vert \nabla u_{j}\Vert _{{\text {L}}^{1}}\). As T is a priori not controllable by Ornstein’s Non-Inequality in the \({\text {BD}}\)-situation, Kristensen and the author [46] employ fractional estimates in order to avoid the appearance of T, simultaneously perturbing in the space \(({\text {W}}_{0}^{1,\infty })^{*}(\Omega ;{\mathbb {R}}^{n})\). The latter method, being based on the embedding \({\text {BD}}(\Omega )\hookrightarrow {\text {W}}^{s,n/(n-1+s)}(\Omega ;{\mathbb {R}}^{n})\) for \(0<s<1\) then yields weighted Nikolskiĭ estimates (and thus \({\text {W}}^{\alpha ,1}\)-estimates for some suitable \(0<\alpha <1\)) for the symmetric gradients of generalised minima. However, this only yields the smaller range of ellipticities \(1<a<1+\frac{1}{n}\). Still, since \(f\in {\text {C}}^{2}({\mathbb {R}}_{{\text {sym}}}^{n\times n})\), generalised minima should be expected to satisfy a differentiable Euler–Lagrange equation and hence the use of fractional methods does not give the expected optimal ellipticity range \(1<a<1+\frac{2}{n}\). In order to obtain the latter, it seems that we are bound to obtain uniform weighted second order estimates in the spirit of Bildhauer [19, Lemma 4.19] or Beck and Schmidt [14, Lemma 5.2]. Unlike the full gradient case, the requisite second estimates do not come out by a plain difference quotient approach but a fine analysis of the identities provided by suitably weakly perturbed Euler–Lagrange systems, see Theorem 4.3 below. In view of this aim, it turns out that the suitable perturbation space is \({\text {W}}^{-2,1}(\Omega ;{\mathbb {R}}^{n})\) (cf. Section 2.2.3), and we can now turn to the precise implementation of the approximation argument.
4.2 Viscosity Approximations
We now set up the Ekeland-type viscosity approximation scheme, and hereafter suppose that \(f\in {\text {C}}({\mathbb {R}}_{{\text {sym}}}^{n\times n})\) is convex with (LG) and \(u_{0}\in {\text {LD}}(\Omega )\). For ease of notation, we write \({\overline{F}}:={\overline{F}}_{u_{0}}[-;\Omega ]\) in the sequel. Let \(u\in {\text {GM}}(F;u_{0})\) be arbitrary. By smooth approximation in the (symmetric) area-strict topology, Lemma 2.1, we find a sequence \((u_{j})\subset {\mathscr {D}}_{u_{0}}:=u_{0}+{\text {LD}}_{0}(\Omega )\) such that
By Theorem 2.9 ff. and hereafter continuity of \(w\mapsto f[{\text {E}}\!w](\Omega )\) for the symmetric area-strict metric, \((u_{j})\) is a minimising sequence for \({\overline{F}}\), and we have \(F[u_{j}]={\overline{F}}[u_{j}]\rightarrow {\overline{F}}[u]=\min {\overline{F}}[{\text {BD}}(\Omega )]\). Passing to a non-relabeled subsequence, we may thus assume
Since the trace operator \({\text {Tr}}:{\text {LD}}({\mathbb {R}}^{n}{\setminus }{\overline{\Omega }})\rightarrow {\text {L}}^{1}(\partial \Omega ;{\mathbb {R}}^{n})\) is surjective, we find a compactly supported extension \({\overline{u}}_{0}\in {\text {LD}}({\mathbb {R}}^{n})\) of \(u_{0}\). After a routine mollification of \({\overline{u}}_{0}\), we obtain \(u_{j}^{\partial \Omega }\in {\text {W}}^{1,2}(\Omega ;{\mathbb {R}}^{n})\) such that
where \({\text {Lip}}(f)\) is the Lipschitz constant of f (cf. Lemma 2.8). We then put \({\mathscr {D}}_{j}:=u_{j}^{\partial \Omega }+{\text {W}}_{0}^{1,2}(\Omega ;{\mathbb {R}}^{n})\subset {\text {W}}^{1,2}(\Omega ;{\mathbb {R}}^{n})\). Since \(u_{j}-u_{0}\in {\text {LD}}_{0}(\Omega )\), we find \({\widetilde{u}}_{j}\in {\mathscr {D}}_{j}\) such that
from which it follows that
Since \({\text {W}}_{0}^{1,2}(\Omega ;{\mathbb {R}}^{n})\subset {\text {LD}}_{0}(\Omega )\), we find for arbitrary \(\varphi \in {\text {W}}_{0}^{1,2}(\Omega ;{\mathbb {R}}^{n})\) that
At this stage, we infimise the previous overall inequality over all \(\varphi \in {\text {W}}_{0}^{1,2}(\Omega ;{\mathbb {R}}^{n})\) to obtain
Then, since \(\min {\overline{F}}[{\text {BD}}(\Omega )]=\inf F[{\mathscr {D}}_{u_{0}}]\), we deduce that
We consequently introduce the quantities \(A_{j}\) and the integrands \(f_{j}:{\mathbb {R}}_{{\text {sym}}}^{n\times n}\rightarrow {\mathbb {R}}\) via
for \(\xi \in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\). In order to employ the Ekeland variational principle with respect to sufficiently weak perturbations, we extend the integral functionals corresponding to \(f_{j}\) to \({\text {W}}^{-2,1}(\Omega ;{\mathbb {R}}^{n})\) by
For each \(j\in {\mathbb {N}}\), the functional \(F_{j}\) is not identically \(+\infty \) on \({\text {W}}^{-2,1}(\Omega ;{\mathbb {R}}^{n})\). The latter space is Banach by Lemma 2.5 (a) and, by Lemma 2.6 with \({\mathfrak {f}}=f_{j}\), \(q=2\) and \(k=2\), \(F_{j}\) is lower semicontinuous with respect to the norm topology on \({\text {W}}^{-2,1}(\Omega ;{\mathbb {R}}^{n})\). Moreover, we record
having used the very definition of \(F_{j}\) in the ultimate step. Therefore, Ekeland’s variational principle, Proposition 2.7, provides us with \(v_{j}\in {\text {W}}^{-2,1}(\Omega ;{\mathbb {R}}^{n})\) such that
We extract from (4.10) some routine information by testing with \(w={\widetilde{u}}_{j}\):
The latter quantity is finite and so, by the very definition of \(F_{j}\), \(v_{j}\in {\mathscr {D}}_{j}\subset {\text {W}}^{1,2}(\Omega ;{\mathbb {R}}^{n})\). Moreover, as \(v_{j}-u_{j}^{\partial \Omega }\in {\text {W}}_{0}^{1,2}(\Omega ;{\mathbb {R}}^{n})\subset {\text {LD}}_{0}(\Omega )\),
For latter purposes, we record the perturbed Euler–Lagrange equation
This inequality can be obtained by testing (4.10\(_{2}\)) with \(w=v_{j}\pm \theta \varphi \) for \(\theta >0\), \(\varphi \in {\text {W}}_{c}^{1,2}(\Omega ;{\mathbb {R}}^{n})\), dividing the resulting inequalities by \(\theta \) and then sending \(\theta \searrow 0\). Moreover, by the linear growth hypothesis (LG) and \(\tfrac{1}{c}=\min \{\frac{1}{2},c_{1}\}\), we infer from (4.11) that
holds for all \(j\in {\mathbb {N}}\). Finally, we note that due to Poincaré’s inequality on \({\text {LD}}_{0}(\Omega )\) we obtain
where \(C>0\) is the constant appearing in the requisite Poincaré inequality. We finally record
Lemma 4.1
The sequence \((v_{j})\) as constructed in (4.10) possesses a subsequence \((v_{j(l)})\subset (v_{j})\) such that
where \(u\in {\text {GM}}(F;u_{0})\) is the generalised minimiser fixed in the beginning of the section.
Proof
By (4.14) and (4.15) we conclude that \((v_{j})\) is uniformly bounded in \({\text {BD}}(\Omega )\), and thus possesses a subsequence \((v_{j(l)})\subset (v_{j})\) such that \(v_{j(l)}{\mathop {\rightharpoonup }\limits ^{*}} v\) in \({\text {BD}}(\Omega )\) as \(l\rightarrow \infty \) for some \(v\in {\text {BD}}(\Omega )\). Since \({\text {L}}^{1}(\Omega ;{\mathbb {R}}^{n})\hookrightarrow {\text {W}}^{-2,1}(\Omega ;{\mathbb {R}}^{n})\) by Lemma 2.5(b), \(v_{j(l)}\rightarrow v\) in \({\text {W}}^{-2,1}(\Omega ;{\mathbb {R}}^{n})\). On the other hand, (4.2), (4.5) and (4.10) imply that \(v_{j(l)}\rightarrow u\) in \({\text {W}}^{-2,1}(\Omega ;{\mathbb {R}}^{n})\). Hence \(u=v\), and the proof is complete. \(\quad \square \)
4.3 Preliminary Regularity Estimates
To justify the manipulations on the perturbed Euler–Lagrange equations satisfied by the \(v_{j}\)’s, we now derive non-uniform regularity estimates. Since (4.13) do not display elliptic differential equations (but differential inequalities), the corresponding higher differentiability assertions need to be approached slightly more carefully than for plain viscosity methods.
Lemma 4.2
Let \(f\in {\text {C}}^{2}({\mathbb {R}}_{{\text {sym}}}^{n\times n})\) satisfy (LG) and, for some \(\Lambda \in (0,\infty )\), the bound
Define \(v_{j}\) for \(j\in {\mathbb {N}}\) by (4.10). Then it holds that \(v_{j}\in {\text {W}}_{{\text {loc}}}^{2,2}(\Omega ;{\mathbb {R}}^{n})\).
Proof
Let \(x_{0}\in \Omega \) and \(0<r<R<{\text {dist}}(x_{0},\partial \Omega )\). Also, let \(s\in \{1,\ldots ,n\}\), \(0<h<\frac{1}{2}({\text {dist}}(x_{0},\partial \Omega )-R)\) and pick \(\rho \in {\text {C}}_{c}^{\infty }(\Omega ;[0,1])\) be such that \(\mathbb {1}_{{\text {B}}(x_{0},r)}\leqq \rho \leqq \mathbb {1}_{{\text {B}}(x_{0},R)}\). We test the perturbed Euler–Lagrange equation (4.13) with \(\varphi :=\Delta _{s,-h}(\rho ^{2}\Delta _{s,h}v_{j})\in {\text {W}}_{c}^{1,2}(\Omega ;{\mathbb {R}}^{n})\). In consequence, integration by parts for difference quotients yields
We define for \({\mathscr {L}}^{n}\)-almost everywhere \(x\in {\text {B}}(x_{0},R)\) bilinear forms \({\mathscr {B}}_{j,s,h}(x):{\mathbb {R}}_{{\text {sym}}}^{n\times n}\times {\mathbb {R}}_{{\text {sym}}}^{n\times n}\rightarrow {\mathbb {R}}\) by
Then we note that, because of (4.16) and the definition of \(f_{j}\),
for all \(\xi \in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\), independently from s, h and x. Thus each \({\mathscr {B}}_{j,s,h}(x)\) is an elliptic bilinear form itself and a suitable version of Young’s inequality is available. With this notation, we infer from (4.17) by expanding the terms on the left and regrouping
Absorbing term \({\mathbf {II}}\) into \({\mathbf {I}}\), we obtain
and thus need to give bounds on \({\mathbf {III}}\) and \({\mathbf {IV}}\). As a consequence of (4.18), we immediately obtain
which is finite due to \(v_{j}\in {\text {W}}^{1,2}(\Omega ;{\mathbb {R}}^{n})\). As to term \({\mathbf {IV}}\), we use Lemma 2.5(b) to find by \({\text {L}}^{q}(\Omega ;{\mathbb {R}}^{n})\hookrightarrow {\text {W}}^{-2,1}(\Omega ;{\mathbb {R}}^{n})\) for some \(1<q<2\):
where \(c(\Omega ,n,j,q)\geqq 1\) and \(c(\Omega ,q)>0\) are constants. Here we used Korn’s inequality in \({\text {W}}_{0}^{1,q}(\Omega ;{\mathbb {R}}^{n})\) in the second and Young’s inequality in the penultimate step. The second term on the very right hand side of inequality (4.20) consequently is absorbed into the very left hand side of (4.19), and then we obtain \(\sup _{|h|<\frac{1}{2}({\text {dist}}(x_{0},\partial \Omega )-R)}{\mathbf {I}}<\infty \). Thus, \((\Delta _{s,h}\varepsilon (v_{j}))_{h}\) is uniformly bounded in \({\text {L}}^{2}({\text {B}}(x_{0},r);{\mathbb {R}}_{{\text {sym}}}^{n\times n})\) and hence \(\partial _{s}\varepsilon (v_{j})\) exists in \({\text {L}}^{2}({\text {B}}(x_{0},r);{\mathbb {R}}_{{\text {sym}}}^{n\times n})\) for each \(s\in \{1,\ldots ,n\}\). As a consequence, \(\partial _{s}v_{j}\in {\text {W}}^{1,2}({\text {B}}(x_{0},r);{\mathbb {R}}^{n})\) by Korn’s inequality. By arbitrariness of \(s\in \{1,\ldots ,n\}\), \(x_{0}\in \Omega \) and \(R>0\) sufficiently small, we thus obtain \(v_{j}\in {\text {W}}_{{\text {loc}}}^{2,2}(\Omega ;{\mathbb {R}}^{n})\). The proof is complete. \(\quad \square \)
4.4 Uniform Second Order Estimates
We now turn to uniform estimates (in \(j\in {\mathbb {N}}\)) for the viscosity approximating sequence \((v_{j})\). The following result is a key ingredient in the proof of Theorem 1.1, and we single it out as a theorem on its own right:
Theorem 4.3
Let \(f\in {\text {C}}^{2}({\mathbb {R}}_{{\text {sym}}}^{n\times n})\) satisfy (LG). Moreover, suppose that for some \(\Lambda >0\) there holds
Moreover, let \((v_{j})\) be the viscosity approximation sequence constructed in the previous subsection. Then there exists a constant \(c=c(\Lambda ,c_{1},c_{2},\gamma ,n)>0\) such that for every \(x_{0}\in \Omega \), \(0<r<1\) with \({\text {B}}(x_{0},2r)\Subset \Omega \) and all \(j\in {\mathbb {N}}\) there holds
In what follows, it is customary to introduce the shorthand notation
We begin by collecting the properties of \(\sigma _{j}\):
Lemma 4.4
Let the integrand \(f\in {\text {C}}^{2}({\mathbb {R}}_{{\text {sym}}}^{n\times n})\) satisfy (LG) and (4.21) and define \(v_{j}\) by (4.10). Then for all \(\ell \in \{1,\ldots ,n\}\) and \(\varphi \in {\text {W}}_{c}^{1,2}(\Omega ;{\mathbb {R}}^{n})\) we have
Proof
By Lemma 4.2, \(v_{j}\in {\text {W}}_{{\text {loc}}}^{2,2}(\Omega ;{\mathbb {R}}^{n})\). We note that \(\partial _{\ell }\sigma _{j} = f''_{j}(\varepsilon (v_{j}))\partial _{\ell }\varepsilon (v_{j})\), and since \(\sup _{z\in {\mathbb {R}}_{{\text {sym}}}^{n\times n}}|f''_{j}(z)|<\infty \), \(\sigma _{j}\in {\text {W}}_{{\text {loc}}}^{1,2}(\Omega ;{\mathbb {R}}_{{\text {sym}}}^{n\times n})\). Let \(\varphi \in {\text {C}}_{c}^{\infty }(\Omega ;{\mathbb {R}}^{n})\). Then \(\partial _{\ell }\varphi \) is an admissible competitor in (4.13) and so, since \(\sigma _{j}\in {\text {W}}_{{\text {loc}}}^{1,2}(\Omega ;{\mathbb {R}}_{{\text {sym}}}^{n\times n})\),
Here, the last estimate is valid by Lemma 2.5 (b). Then the case of general \({\text {W}}_{c}^{1,2}(\Omega ;{\mathbb {R}}^{n})\)-maps \(\varphi \) follows by routine smooth approximation and \({\text {W}}^{1,2}(\Omega ;{\mathbb {R}}^{n})\hookrightarrow {\text {W}}^{-1,1}(\Omega ;{\mathbb {R}}^{n})\). \(\quad \square \)
We now come to the
Proof of Theorem 4.3
We divide the proof into three steps, and fix \(j\in {\mathbb {N}}\) throughout.
Step 1. Modified perturbed Euler–Lagrange equations. To establish (4.22), we shall use the weak Euler–Lagrange equation (4.23) from Lemma 4.4 satisfied by \(\sigma _{j}\). Let \(k\in \{1,\ldots ,n\}\) and let \(x_{0}\in \Omega ,\,0<r<1\) be such that \({\text {B}}(x_{0},2r)\Subset \Omega \). We choose a cut-off function \(\rho \in {\text {C}}_{c}^{\infty }(\Omega ;[0,1])\) such that \(\mathbb {1}_{{\text {B}}(x_{0},r)}\leqq \rho \leqq \mathbb {1}_{{\text {B}}(x_{0},2r)}\) and \(|\nabla ^{k}\rho |\leqq \big (\frac{2}{r}\big )^{k}\) for \(k\in \{1,2,3\}\). Without loss of generality, the interior \({\text {B}}'\) of \({\text {spt}}(\rho )\) is a ball, too.
Then, since \(v_{j}\in {\text {W}}_{{\text {loc}}}^{2,2}(\Omega ;{\mathbb {R}}^{n})\) by Lemma 4.2, we obtain that \(\varphi :=\rho ^{2}\partial _{k}(v_{j}-a_{j})=:\rho ^{2}\partial _{k}w_{j}\) belongs to \({\text {W}}_{c}^{1,2}(\Omega ;{\mathbb {R}}^{n})\) and hence qualifies as a competitor map in (4.23). Here, \(a_{j}\in {\mathscr {R}}(\Omega )\) is a rigid deformation to be specified later on, and \(w_{j}\) is defined in the obvious manner. We write \(A=(A^{im})_{i,m=1}^{n}\) for an \((n\times n)\)–matrix A and denote the l–th component of a vector \(u\in {\mathbb {R}}^{n}\) by \(u^{(l)}\). Then applying (4.23) to \(\ell =k\) and summing over \(k\in \{1,\ldots ,n\}\) yields by virtue of Lemma 2.5 (b)
where the left-hand sum is taken over all indices \(k,i,m\in \{1,\ldots ,n\}\). Towards (4.22), we note that
whereby it suffices to estimate the right hand side in view of (4.24). From (4.24) we deduce
Step 2. Estimating the terms\({\mathbf {I}},{\mathbf {II}}\)and\({\mathbf {III}}\). Ad \({\mathbf {I}}\) and \({\mathbf {III}}\). Let us note that, since the indices i, m run over all numbers \(1,\ldots ,n\) and \(\sigma _{j}(x)\in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\) for \({\mathscr {L}}^{n}\)-almost everywhere \(x\in \Omega \), we have \({\mathbf {I}}={\mathbf {III}}\). Moreover, we note that the artificial terms leading to the appearance of \({\mathbf {II}}\) are just introduced to have the symmetric gradient appearing, that is, terms which are conveniently controllable. In consequence, defining \({}_{j}\Theta _{k} := ({}_{j}\Theta _{k}^{im})_{i,m=1}^{n}\) and \({}_{j}{\widetilde{\Theta }}_{k} := ({}_{j}{\widetilde{\Theta }}_{k}^{im})_{i,m=1}^{n}\) with
we find by \(\sigma _{j}(x)\in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\) for \({\mathscr {L}}^{n}\)-almost everywhere \(x\in \Omega \) and the definition of the Frobenius inner product on \({\mathbb {R}}^{n\times n}\)
We now employ the definition of \(\sigma _{j}\) and \({\mathcal {A}}_{j}[\varepsilon (v_{j});\cdot ,\cdot ]\). Then we obtain, applying the Cauchy–Schwarz inequality to the bilinear forms \({\mathcal {A}}_{j}[\varepsilon (v_{j});\cdot ,\cdot ]\):
Appealing to (4.21) and recalling \(|\nabla \rho |\leqq \frac{2}{r}\), we then further estimate
Ad \({\mathbf {II}}\). By symmetry of \(\sigma _{j}\), that is, \(\sigma _{j}^{im}(x)=\sigma _{j}^{mi}(x)\) for all \(i,m\in \{1,\ldots ,n\}\), \(j\in {\mathbb {N}}\) and for \({\mathscr {L}}^{n}\)-almost everywhere \(x\in \Omega \), and a permutation of indices, it suffices to estimate the term
with an obvious definition of \({\mathbf {VI}}\). Integrating by parts twice yields
where \({\mathbf {VI}}_{1},\ldots ,{\mathbf {VI}}_{4}\) are defined in the obvious manner. Note that, by the \({\text {W}}_{{\text {loc}}}^{2,2}\)–regularity of \(v_{j}\) and the \({\text {W}}_{{\text {loc}}}^{1,2}\)-regularity of \(\sigma _{j}\), this is a valid computation. The crucial point in this calculation is that the only derivatives that apply to \(w_{j}\) appear in the form \(\partial _{k}w_{j}^{(k)}\) (and are decoupled from the (i, m)–components), and summation over \(k\in \{1,\ldots ,n\}\) corresponds to taking the divergence of \(w_{j}\). We define \(\psi _{j,k}:=(\psi _{j,k}^{(i)})_{i=1}^{n}:=((\partial _{ik}\rho ^{2})w_{j}^{(k)})_{i=1}^{n}\in {\text {W}}_{c}^{1,2}(\Omega ;{\mathbb {R}}^{n})\). Then, with \({\text {div}}(\sigma _{j})\) denoting the row-wise divergence, we obtain
Here we used Lemma 2.5(b) in the penultimate inequality. The term \({\mathbf {VI}}_{3}\) is treated similarly, now defining \({\widetilde{\psi }}_{j,k} := ((\partial _{i}\rho ^{2})\partial _{k}w_{j}^{(k)})_{i=1}^{n}\in {\text {W}}_{c}^{1,2}(\Omega ;{\mathbb {R}}^{n})\) as \(w_{j}\in {\text {W}}_{{\text {loc}}}^{2,2}(\Omega ;{\mathbb {R}}^{n})\) by Lemma 4.2. Then we estimate analogously
At this stage, note that by repeated use of Lemma 2.5(b),
Hence, we obtain (by possibly enlarging the constant \(c(n)>0\) from the estimation of \(|{\mathbf {VI}}_{1}|\))
We turn to the estimation of \({\mathbf {VI}}_{2}\) and \({\mathbf {VI}}_{4}\). We recall that we still have the freedom to choose the rigid deformations \(a_{j}\) as they appear in the definition of \(w_{j}\). As \({\text {spt}}(\rho )={\text {B}}'\) is a ballFootnote 3, we find a constant \(C({\text {B}}')>0\) such that for every \(v\in {\text {W}}^{1,2}({\text {B}}';{\mathbb {R}}^{n})\) there exists \(a\in {\mathscr {R}}({\mathbb {R}}^{n})\) such that
It is important that for each such v we can choose one rigid deformation a to make both inequalities work, and by Lemma 2.3, this is in fact possible. Accordingly, we choose for each \(j\in {\mathbb {N}}\) some \(a_{j}\in {\mathscr {R}}({\mathbb {R}}^{n})\) such that inequality (4.32) holds with v being replaced by \(v_{j}\) and with a being replaced by \(a_{j}\). Turning to \({\mathbf {VI}}_{2}\), we go back to the definition of \(\sigma _{j}\) and thereby obtain by virtue of Young’s inequality and the above Poincaré inequalities (4.32) that
As to \({\mathbf {VI}}_{4}\), we note that since \((\varepsilon (v_{j}))\) is uniformly bounded in \({\text {L}}^{1}(\Omega ;{\mathbb {R}}_{{\text {sym}}}^{n\times n})\) by (4.14), so is \(({\text {div}}(v_{j}))\) in \({\text {L}}^{1}(\Omega )\). We then estimate, using the pointwise bound \(|{\text {div}}(w_{j})|\leqq |\varepsilon (v_{j})|\) and (4.14),
By our choice of \(a_{j}\) and (4.14), \({\mathbf {VII}}\) can now be estimated by
Step 3. Conclusion. We now gather estimates and start from (4.25) to find
whence we absorb \({\mathbf {V}}_{1}\) into the left side of the previous inequality. We then succesively combine (4.27), (4.30)–(4.36) with (4.14) to obtain via \(0<r<1\)
where we track constants to find that \(c=c({\text {Lip}}(f),\Lambda ,n,\gamma ,c_{1})>0\). Since \({\text {Lip}}(f)\) only depends on \(c_{2},\gamma \) by Lemma 2.8, this immediately gives (4.22) by (4.25), and the proof is hereby complete. \(\quad \square \)
4.5 Proof of Theorem 1.1
Based on Theorem 4.3, we can proceed to the proof of Theorem 1.1. It needs to be noted that the second order estimate given in (4.22) is the decisive ingredient which we lacked in [46], and in the following we demonstrate how (4.22) leads to a Sobolev regularity improvement. Here, we are led by the ideas exposed in [14, 18] for the gradient case.
Proof of Theorem 1.1
Let \(u\in {\text {GM}}(F;u_{0})\) be given and let \({\text {B}}(x_{0},5r)\subset \Omega \) be an open ball. In this situation, u is a local generalised minimiser, which in particular implies that \({\overline{F}}_{u}[u;{\text {B}}(x_{0},5r)]\leqq {\overline{F}}_{u}[v;{\text {B}}(x_{0},5r)]\) for all \(v\in {\text {BD}}({\text {B}}(x_{0},5r))\). We now denote \((v_{j})\) the specific Ekeland viscosity approximation sequence as constructed in (4.10)ff., with \(\Omega \) being replaced by \({\text {B}}(x_{0},5r)\) and \(u_{0}\) being replaced by \(u|_{{\text {B}}(x_{0},5r)}\). Lemma 4.1 then implies that there exists a subsequence \((v_{j(l)})\subset (v_{j})\) such that \(v_{j(l)}{\mathop {\rightharpoonup }\limits ^{*}} u\) in \({\text {BD}}({\text {B}}(x_{0},5r))\) as \(l\rightarrow \infty \).
We begin with \(n\geqq 3\). Since in particular \(1<a<2\) in the present situation, we introduce the auxiliary convex function \( V_{a}(\xi ):=(1+|\xi |^{2})^{\frac{2-a}{4}}\), \(\xi \in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\). Recalling \((v_{j(l)})\subset {\text {W}}_{{\text {loc}}}^{2,2}({\text {B}}(x_{0},5r);{\mathbb {R}}^{n})\) from Lemma 4.2 and differentiating \(V_{a}(\varepsilon (v_{j(l)}))\), we obtain for all \(k\in \{1,\ldots ,n\}\) that
Therefore, we find by the previous inequality, the lower bound in (1.4) and Theorem 4.3:
where \(c=c(n,a,\lambda ,\Lambda ,\gamma ,c_{1},c_{2})>0\). Here, the first estimate is valid by the scaled Sobolev inequality, whereas we have used \(\sqrt{1+t^{2}}\leqq 1+t\) for the ultimate one. As a consequence of the definition of \(V_{a}\), we find
This yields local uniform boundedness of \((\varepsilon (v_{j(i)}))\) in \({\text {L}}^{q}({\text {B}}(x_{0},r);{\mathbb {R}}_{{\text {sym}}}^{n\times n})\) for \(q=\frac{2-a}{n-2}n\), and the latter number satisfies \(q>1\) if and only if \(1<a<1+\frac{2}{n}\), which is the standing assumption of Theorem 1.1. Let us note in advance that (4.12) implies that \(\lim _{l\rightarrow \infty }{\mathbf {II}}_{l}=0\), whereas \(\lim _{l\rightarrow \infty }{\mathbf {III}}_{l}=0\) holds trivially. Now consider the function \(\Psi _{q}(t):=|t|^{q}\) for \(q>1\). Then \(\Psi _{q}^{\infty }(t)=\infty \) for if \(|t|>0\). Since , we obtain as a consequence of Reshetnyak’s theorem, Lemma 2.9, and the notation adopted in (2.11) afterwards with \(\nu =({\mathscr {L}}^{n},{\text {E}}\!u)\),
Since the very right hand side is finite, we conclude that \({\text {E}}\!^{s}u\) vanishes on \({\text {B}}(x_{0},r)\). By arbitrariness of \({\text {B}}(x_{0},r)\), we moreover infer that \({\text {E}}\!^{s}u\equiv 0\) in \(\Omega \) and so \(u\in {\text {LD}}(\Omega )\) together with \(\varepsilon (u)={\mathscr {E}}u\). Moreover, by Korn’s inequality, \(\nabla u\in {\text {L}}^{q}({\text {B}}(x_{0},r);{\mathbb {R}}^{n\times n})\). To obtain the precise form of (1.7), we choose a rigid deformation \(\pi _{u}\in {\mathscr {R}}({\text {B}}(x_{0},r))\) such that
Dividing the previous inequality by \(r^{\frac{n}{q}}=r^{\frac{n-2}{2-a}}\), we obtain
This is (1.7) and the proof is complete for if \(n\geqq 3\). Now let \(n=2\). As above, \((V_{a}(\varepsilon (v_{j(l)})))\) is locally uniformly bounded in \({\text {W}}^{1,2}({\text {B}}(x_{0},5r);{\mathbb {R}}^{n})\) and thus, using Trudinger’s embedding \({\text {W}}^{1,n}(\Omega )\hookrightarrow \exp {\text {L}}^{\frac{n}{n-1}}(\Omega )\), (4.37) equally yields
Working with \(\Psi (t)=\exp (t^{\frac{n}{n-1}\frac{2-a}{2}})=\exp (t^{2-a})\) instead of \(\Psi _{q}\) from above, we similarly conclude that \(u\in E^{1}\exp {\text {L}}^{2-a}({\text {B}}(x_{0},r))\). We then employ Cianchi’s inequality from Lemma 2.2(b) with \(\beta =2-a(>0)\) and hereafter \(\frac{\beta }{\beta +1}=\frac{2-a}{3-a}\). In consequence,
and the proof is complete.\(\quad \square \)
4.6 Selected Implications
We now collect some consequences of the results established above and particularly improve the results from [46]. We begin by strengthening [46, Corollary 3.8], justifying the second arrow from below in Figure 1.
Corollary 4.5
(Existence of second derivatives) Let \(n\geqq 2\) and suppose that \(f\in {\text {C}}^{2}({\mathbb {R}}_{{\text {sym}}}^{n\times n})\) satisfies (LG) and (1.4) for some \(1<a<\frac{n}{n-1}\). Then there holds \({\text {GM}}_{{\text {loc}}}(F)\subset {\text {W}}_{{\text {loc}}}^{2,q}(\Omega ;{\mathbb {R}}^{n})\) for any \(1<q<2\) if \( n=2\) and for any \(1<q\leqq n\frac{2-a}{n-a}\) if \(n\geqq 3\).
Proof
Let \(u\in {\text {GM}}_{{\text {loc}}}(F)\) and let \({\text {B}}(x_{0},r)\Subset \Omega \) be an open ball. Since \(\frac{n}{n-1}\leqq 1+\frac{2}{n}\), by Theorem 1.1 we have \({\text {GM}}_{{\text {loc}}}(F)\subset {\text {W}}_{{\text {loc}}}^{1,p}(\Omega ;{\mathbb {R}}^{n})\) for any \(1<p<\infty \) if \(n=2\) and any \(1<p\leqq \frac{2-a}{n-2}n\) if \(n\geqq 3\). Let \((v_{j})\) be the Ekeland viscosity approximation sequence constructed in (4.10) with \(\Omega \) being replaced by \({\text {B}}(x_{0},r)\) and \(u_{0}\) being replaced by \(u|_{{\text {B}}(x_{0},r)}\). Then we record, using Young’s inequality with exponents \(\frac{2}{q}\) and \(\frac{2}{2-q}\) for some \(1\leqq q<2\),
The first term is uniformly controlled by Theorem 4.3. If \(n=2\), then the second term is uniformly bounded in \(j\in {\mathbb {N}}\) regardless of \(1\leqq q <2\) as \(\sup _{j\in {\mathbb {N}}}\Vert \varepsilon (v_{j})\Vert _{{\text {L}}^{p}({\text {B}}(x_{0},r);{\mathbb {R}}_{{\text {sym}}}^{n\times n})}<\infty \) for all \(1<p<\infty \). If \(n\geqq 3\), the second term is uniformly bounded in \(j\in {\mathbb {N}}\) if
Note that \({\widetilde{q}}(n)> 1\) if and only if \(1< a < \frac{n}{n-1}\). Hence, \((v_{j})\) is locally uniformly bounded in \({\text {W}}^{2,q}\) for \(1<q\leqq \frac{n(2-a)}{n-a}\). From here the result follows in the same way as in the proof of Theorem 1.1, again using Korn’s inequality. \(\quad \square \)
Compared with [46], we have now established that for the ellipticity regime \(1<a<\frac{n}{n-1}\), all generalised minima possess second derivatives in some \({\text {L}}_{{\text {loc}}}^{q}\), \(q>1\). By standard results on Lebesgue points of Sobolev functions, we then have
Corollary 4.6
(Singular set bounds) Let \(f\in {\text {C}}^{2}({\mathbb {R}}_{{\text {sym}}}^{n\times n})\) satisfy (LG) and (1.4) for some \(1<a<1+\frac{2}{n}\). For a given map \(v\in {\text {BD}}_{{\text {loc}}}(\Omega )\), put
If \(n\geqq 2\) and \(1<a<\frac{n}{n-1}\), then any\(u\in {\text {GM}}_{{\text {loc}}}(F)\) satisfies \(\dim _{{\mathscr {H}}}(\Sigma _{u})\leqq n\frac{n-2}{n-a}\).
We conclude this section by describing the structure of \({\text {GM}}(F;u_{0})\) and begin with
Corollary 4.7
(Uniqueness modulo elements of\({\mathscr {R}}(\Omega )\)) Let \(\Omega \subset {\mathbb {R}}^{n}\) be an open, bounded and connected set with Lipschitz boundary and \(u_{0}\in {\text {LD}}(\Omega )\). In the situation of Theorem 1.1, generalised minimisers are unique up to rigid deformations, that is,
Proof
By Theorem 1.1, \({\text {GM}}(F;u_{0})\subset {\text {LD}}(\Omega )\). Now suppose that \(u,v\in {\text {GM}}(F;u_{0})\) are two generalised minima such that \(\varepsilon (u)\ne \varepsilon (v)\)\({\mathscr {L}}^{n}\)-almost everywhere. Then, by strict convexity of f and convexity of \(f^{\infty }\),
For \(u,v\in {\text {GM}}(F;u_{0})\), the right-hand side equals \(\min {\overline{F}}_{u_{0}}[{\text {BD}}(\Omega )]\) which consequently yields a contradiction. Hence, \(\varepsilon (u-v)=0\)\({\mathscr {L}}^{n}\)-almost everywhere, and since \(\Omega \) is connected, this implies that \(u=v+\pi \) for some \(\pi \in {\mathscr {R}}(\Omega )\). \(\quad \square \)
As in the \({\text {BV}}\)-case, Corollary 4.7 cannot be improved to yield full uniqueness. To this end, it is important to require a suitable variant of strict convexity on the recession function \(f^{\infty }\); note that \(f^{\infty }\) is positively 1-homogeneous and hence not strictly convex. In this respect, the relevant concept is as follows (also see [70, Section 4.5]): We say that a function \(g:{\mathbb {R}}_{{\text {sym}}}^{n\times n}\rightarrow {\mathbb {R}}\) has strictly convex sublevel sets provided for each \(t\in {\mathbb {R}}\) the set \(\Gamma _{t}(g):=\{z\in {\mathbb {R}}_{{\text {sym}}}^{n\times n}:\;g(z)<t\}\) is bounded, convex and if \(z_{1},z_{2}\in \partial \Gamma _{t}(g)\), then \(\lambda z_{1}+(1-\lambda )z_{2}\notin \partial \Gamma _{t}(g)\) for any \(0<\lambda <1\).
Corollary 4.8
(Uniqueness and structure of \({\text {GM}}(F;u_{0})\)) Let \(\Omega \subset {\mathbb {R}}^{n}\) be an open, bounded, connected set with Lipschitz boundary such that for any fixed \(a\in {\mathbb {R}}\) there holds
In the situation of Corollary 4.7, suppose that the map \(f_{\nu }^{\infty }:{\mathbb {R}}^{n}\ni z\mapsto f^{\infty }(z\odot \nu )\) has strictly convex sublevel sets for all \(\nu \in {\mathbb {R}}^{n}{\setminus }\{0\}\). Then there exists a generalised minimiser \(u\in {\text {GM}}(F;u_{0})\) and a rigid deformation \(\pi \in {\mathscr {R}}(\Omega )\) such that
Finally, if there exists a generalised minimiser u which attains the boundary values \({\text {Tr}}_{\partial \Omega }(u_{0})\)\({\mathscr {H}}^{n-1}\)-almost everywhere on \(\partial \Omega \), then \({\text {GM}}(F;u_{0})=\{u\}\).
The condition on \(f_{\nu }^{\infty }\) to have strictly convex sublevel sets is satisfied if, for example, f is spherically symmetric, ruling out that \((f^{\infty })^{-1}(\{1\})\) contains any line segments of positive length. Corollary 4.8 follows from Corollary 4.7 similarly as in the \({\text {BV}}\)-case, cf. [14, Theorem 1.16], but is technically more demanding; for the reader’s convenience, the appendix A, Section 8, includes the precise reasoning with emphasis on the two-dimensional case.
5 A Family of Convolution-Type Poincaré Inequalities
Approaching Theorem 1.2, we pause to provide a family of convolution inequalities to instrumentally enter the partial regularity proof below. We believe that the result might be of independent interest, and thus state selected versions thereof in the end of the section.
Proposition 5.1
Let \(\lambda >1\) and let \(V(z):=\sqrt{1+|z|^{2}}-1\) be the auxiliary reference integrand as usual. Then there exists a constant \(c=c(n,\lambda )>0\) such that the following holds: For every open and bounded Lipschitz domain \(\Omega \subset {\mathbb {R}}^{n}\), \(u\in {\text {BD}}_{{\text {loc}}}({\mathbb {R}}^{n})\) and numbers \(\varepsilon ,L>0\) it holds that
where \(\rho :{\mathbb {R}}^{n}\rightarrow {\mathbb {R}}_{\geqq 0}\) is an arbitrary standard mollifier in the sense of Section 2.4.
Before passing to the proof of the preceding proposition, let us remark that (5.1) cannot be established as in the full gradient case, cf. [10, Lemma 5.3]. Namely, if we wish to obtain (5.1) for \(u\in {\text {BV}}({\mathbb {R}}^{n};{\mathbb {R}}^{N})\) with the symmetric gradient on the right-hand side being replaced by the full gradient, it suffices to invoke the fundamental theorem of calculus in conjunction with Jensen’s inequality. In view of (5.1), Ornstein’s Non-Inequality forces us to avoid the appearance of the full gradient on the right-hand side. Upon localisation, a slightly weaker result can be readily obtained by invoking the Smith representation formula (2.4) and then arguing as in the full gradient case, so that we may conclude that for any \(0<\beta <1\) (but not for \(\beta =1\)) there exists \(C=C(\beta ,\text {diam}(\Omega ))>0\) with
for all \(u\in {\text {BD}}(\Omega )\) and \(L>0\)Footnote 5. However, this is neither optimal nor good enough for deriving the requisite decay estimate in Section 6; see the proof of Proposition 6.4 and Remark 6.6 afterwards.
Proof of Proposition 5.1
The proof consists in four main steps. After giving the geometric setup in a first step, we establish a preliminary Poincaré-type inequality involving the reference integrand V in the second step. Then we globalise by a covering argument with respect to cubes having edgepoints contained in a certain lattice, depending on the parameters \(\varepsilon \) and \(\lambda \). Lastly, we smoothly approximate to conclude the full claim. \(\quad \square \)
Step 1. Preliminaries. Let \(\lambda >1\) be given. Let \(\Omega \) be as in the proposition and denote, for \(t>0\), \(N_{t}(\Omega ):=\{x\in {\mathbb {R}}^{n}:\;{\text {dist}}(x,\Omega )<t\}\) the t-neighbourhood of \(\Omega \). We put \(\ell :=\lceil \frac{1}{\lambda -1}\rceil +1\in {\mathbb {N}}\) so that \(\frac{1}{\lambda -1}<\ell \), and define \(\varepsilon _{\lambda }:=\frac{\varepsilon }{\ell }\).
We now consider the lattice \(\Gamma _{\varepsilon _{\lambda }}:=\varepsilon _{\lambda }{\mathbb {Z}}^{n}\) and denote \({\mathcal {Q}}_{\varepsilon _{\lambda }}\) the collection of all open cubes of sidelength \(\varepsilon _{\lambda }\) and edge points contained in \(\Gamma _{\varepsilon _{\lambda }}\). Given \(Q\in {\mathcal {Q}}_{\varepsilon _{\lambda }}\), we denote \({\widetilde{Q}}\) the cube which has the same center as Q and sides parallel to those of Q but sidelength \((2\ell +1)\varepsilon _{\lambda }\). Then \({\widetilde{Q}}\) has all its edge points equally contained in \(\Gamma _{\varepsilon _{\lambda }}\), \(N_{\varepsilon }(Q)=Q+{\text {B}}(0,\varepsilon )\subset {\widetilde{Q}}\), and can be written as the union of \({\mathscr {N}}={\mathscr {N}}(\lambda ,n)\in {\mathbb {N}}\) cubes from \({\mathcal {Q}}_{\varepsilon _{\lambda }}\); for notational convenience, we denote these cubes \(Q^{(i)}\), \(i=1,\ldots ,{\mathscr {N}}\), and arrange that for all \(Q\in {\mathcal {Q}}_{\varepsilon _{\lambda }}\), the relative positioning of \(Q^{(i)}\) to Q is the same—see Figure 2 for this setup. Moreover, if \(Q\in {\mathcal {Q}}_{\varepsilon _{\lambda }}\) satisfies \(Q\cap \Omega \ne \emptyset \), then we have \({\widetilde{Q}}\subset N_{\lambda \sqrt{n}\varepsilon }(\Omega )\). In fact, in this case there exists \(x_{0}\in Q\cap \Omega \) and thus for any \(z\in {\widetilde{Q}}\) we have \({\text {dist}}(z,\Omega ) \leqq |x_{0}-z|\). By the geometry of \({\widetilde{Q}}\) (see Figure 2), it is clear that \(|x_{0}-z|\) does not exceed
and hence \({{\,\mathrm{dist}\,}}(z,\Omega )<\lambda \sqrt{n}\varepsilon \) so that \(z\in N_{\lambda \sqrt{n}\varepsilon }(\Omega )\). Summarising, for every \(Q\in {\mathcal {Q}}_{\varepsilon _{\lambda }}\) with \(Q\cap \Omega \ne \emptyset \), we have \({\widetilde{Q}}=\bigcup _{i=1}^{{\mathscr {N}}}Q^{(j)}\subset N_{\lambda \sqrt{n}\varepsilon }(\Omega )\).
Step 2. A Poincaré-type inequality for the reference integrandV. In a second step, we claim that there exists a constant \(c=c(n)>0\) such that for every open cube \(Q\subset {\mathbb {R}}^{n}\), every \(L>0\) and every \(u\in {\text {C}}^{\infty }({\mathbb {R}}^{n};{\mathbb {R}}^{n})\) there holds
Here, \({\widetilde{\Pi }}_{Q}u\) denotes the rigid deformation determined by Proposition 2.4. It is crucial for this inequality to be available in this very form, and so we provide the details. Thus let \(u\in {\text {C}}^{\infty }({\mathbb {R}}^{n};{\mathbb {R}}^{n})\) and employ the representation from Lemma 2.4: There exists \({\widetilde{\Pi }}_{Q}u\in {\mathscr {R}}(Q)\) such that for all \(x\in Q\) it holds that
where \(|R_{Q}(x,y)|\leqq C_{R}|x-y|^{1-n}\) for all \(x,y\in Q\), \(x\ne y\), with a constant \(C_{R}=C_{R}(n)>0\).
Let \(x\in Q\). We define a measure \(\mu _{x}:{\mathscr {B}}(Q)\rightarrow {\mathbb {R}}_{\geqq 0}\) by putting \(\mu _{x}(A):=\int \nolimits _{A}C_{R}|x-y|^{1-n}\mathrm {d}y\) for \(A\in {\mathscr {B}}(Q)\). Since \(|x-y|<\sqrt{n}\ell (Q)\) for all \(x,y\in Q\),
We also need a remark on the upper bound. Namely, if \(x\in Q\), then \(Q\subset {\text {B}}(x,\sqrt{n}\ell (Q))\) independently of x. Thus, with \(\omega _{n}={\mathscr {L}}^{n}({\text {B}}(0,1))\),
In conclusion, there exists \(c=c(n)>0\) such that
for all cubes Q and \(x\in Q\). Now, \(\mu _{x}/\mu _{x}(Q)\) is a probability measure on \({\mathscr {B}}(Q)\) for every \(x\in Q\). In consequence, as \(|u-{\widetilde{\Pi }}_{Q}u|\leqq |T_{Q}[\varepsilon (u)]|\) pointwisely in Q and \(V:{\mathbb {R}}_{\geqq 0}\rightarrow {\mathbb {R}}_{\geqq 0}\) is monotone, we estimate by Jensen’s inequality
Tracking the dependencies of constants, \(c=c(n)>0\), thereby establishing (5.3).
Step 3. Inequality (5.1) for\({\text {C}}^{\infty }\)-maps. As a main feature of the symmetric gradient operator, let us note that as first order polynomials, all elements \(\pi \in {\mathscr {R}}({\mathbb {R}}^{n})\) of its nullspace are harmonic. Thus they satisfy the mean value property and, as a consequence, convolution with standard mollifiers locally turns out to be the identity on the rigid deformations, cf. [36, Chapter 2.2.3, Theorem 6]. For any \(Q\in {\mathcal {Q}}_{\varepsilon _{\lambda }}\), we recall the definition of the cube \({\widetilde{Q}}\) from step 1. Then (5.3) holds true with Q and \({\widetilde{\Pi }}_{Q}u\) being replaced by \({\widetilde{Q}}\) and \({\widetilde{\Pi }}_{{\widetilde{Q}}}u\), respectively. We then obtain, using Lemma 2.11(i) in the third step, that
See Fig. 3 for the geometric setup. At this stage, we use Jensen’s and Young’s inequalities to conclude that for any \(Q\in {\mathcal {Q}}_{\varepsilon _{\lambda }}\) there holds
where \(c=c(\lambda ,n)\) (note that \(\ell \) only depends on \(\lambda \)). Note that, for any fixed \(j\in \{1,\ldots ,{\mathscr {N}}\}\) and all \(Q,Q'\in {\mathcal {Q}}_{\varepsilon _{\lambda }}\) with \(Q\ne Q'\), \(Q^{(j)}\cap Q'^{(j)}=\emptyset \). On the other hand, by step 1, for any fixed \(j\in \{1,\ldots ,n\}\), \(\bigcup _{Q\in {\mathcal {Q}}_{\varepsilon _{\lambda }},\;Q\cap \Omega \ne \emptyset }Q^{(j)}\subset \bigcup _{Q\in {\mathcal {Q}}_{\varepsilon _{\lambda }},\,Q\cap \Omega \ne \emptyset }{\widetilde{Q}}\subset N_{\lambda \sqrt{n}\varepsilon }(\Omega )\). Therefore,
Consequently, we obtain by \(Q\subset {\widetilde{Q}}\) and (5.5) in the first step that
Since \({\mathscr {N}}={\mathscr {N}}(\lambda ,n)\), \(c=c(\lambda ,n)\) in the previous estimation, and (5.1) follows for \(u\in {\text {C}}^{\infty }({\mathbb {R}}^{n};{\mathbb {R}}^{n})\).
Step 4. Passage to the general case. Let \(u\in {\text {BD}}_{{\text {loc}}}({\mathbb {R}}^{n})\). By localisation, it is no loss of generality to assume \(u\in {\text {BD}}({\mathbb {R}}^{n})\). Let \(\eta \in {\text {C}}_{c}^{\infty }({\text {B}}(0,1);[0,1])\) be a standard mollifier. We put \(u_{k}:=\eta _{1/k}*u\), so that, by passing to a non-relabeled subsequence, \(u_{k}\rightarrow u\)\({\mathscr {L}}^{n}\)-almost everywhere in \({\mathbb {R}}^{n}\). This yields by Fatou’s lemma for all \(\varepsilon >0\) that
where we used inequality (5.1) for smooth maps in the second and Jensen’s inequality in the third step. This is the inequality claimed in the proposition and the proof of (5.1) for \(u\in {\text {BD}}_{{\text {loc}}}({\mathbb {R}}^{n})\) is hereby complete. \(\quad \square \)
For consistency, let us note that if the right hand side of (5.1) is zero, then u must coincide with a rigid deformation on each of the connected components of \(U+{\text {B}}(0,\lambda \sqrt{n}\varepsilon )\) and so on those of U; in consequence, it must coincide with its mollification on each of these connected components and hence the left hand side is zero indeed.
Corollary 5.2
Let \(\lambda >1\) and let \(V(z):=\sqrt{1+|z|^{2}}-1\) be the auxiliary reference integrand as usual. Then there exists a constant \(c=c(n,\lambda )>0\) such that the following holds: For every open and bounded Lipschitz domain \(\Omega \subset {\mathbb {R}}^{n}\), \(u\in {\text {BD}}_{{\text {loc}}}({\mathbb {R}}^{n})\) and numbers \(\varepsilon ,L>0\) there holds
where \(\rho ,\eta :{\mathbb {R}}^{n}\rightarrow {\mathbb {R}}_{\geqq 0}\) are arbitrary standard mollifiers in the sense of Section 2.4.
Proof
Denote the left-hand side of (5.7) by \({\mathbf {I}}\). We use Lemma 2.11(b) and Jensen’s inequality to obtain
where again \(c=c(\lambda ,n)\). The proof is complete. \(\quad \square \)
We conclude this section by discussing a particular borderline case in the spirit of (5.1), for simplicity stated on the entire \({\mathbb {R}}^{n}\):
Corollary 5.3
(Sobolev–Poincaré inequality in convolution form) For any \(1\leqq p \leqq \frac{n}{n-1}\) there exists a constant \(c=c(n,p)>0\) with the following property: For every \(u\in {\text {BD}}({\mathbb {R}}^{n})\) and \(\varepsilon >0\) there holds
where \(\rho :{\mathbb {R}}^{n}\rightarrow {\mathbb {R}}_{\geqq 0}\) is an arbitrary standard mollifier in the sense of Section 2.4.
Proof
Let \(u\in {\text {LD}}({\mathbb {R}}^{n})\). By the Strauss inequality [78] and Poincaré’s Inequality, Lemma 2.3, there exists a constant \(c=c(n,p)>0\) such that \(\Vert u-{\widetilde{\Pi }}_{Q}u\Vert _{{\text {L}}^{p}(Q;{\mathbb {R}}^{n})}\leqq c\ell (Q)^{\frac{p-pn+n}{p}}\Vert \varepsilon (u)\Vert _{{\text {L}}^{1}(Q;{\mathbb {R}}_{{\text {sym}}}^{n\times n})}\) for all \(u\in {\text {LD}}({\mathbb {R}}^{n})\) and cubes \(Q\subset {\mathbb {R}}^{n}\). We argue as in the proof of Proposition 5.1, but now work with the lattice \(\Gamma _{\varepsilon }=\varepsilon {\mathbb {Z}}^{n}\) and, for \(Q\in {\mathcal {Q}}_{\varepsilon }\), define \({\widetilde{Q}}\) to be the cube with the same center as Q but \((2n+1)\)-times its sidelength. This yields
and from here the conclusion follows by smooth approximation as above. \(\quad \square \)
Following the scheme of proof, other inequalities can equally be obtained, so, for example, by replacing the \({\text {L}}^{p}\)-norm on the left-hand side of (5.8) by Sobolev–Slobodeckjiĭ (use \({\text {BD}}({\mathbb {R}}^{n})\hookrightarrow {\text {W}}^{s,\frac{n}{n-1+s}}({\mathbb {R}}^{n};{\mathbb {R}}^{n})\), \(0<s<1\), cf. [46]) or Triebel–Lizorkin seminorms.
6 Partial \(\mathbf{C}^{\mathbf{1,\alpha }}\)-Regularity and the Proof of Theorem 1.2
In this section we provide the proof of the second main result of this paper, Theorem 1.2, allowing for possibly very degenerate ellipticities.
6.1 Outline of the Proof and Setup
In order to reach the full degenerate elliptic regime which Theorem 1.2 applies to, we employ a direct comparison strategy that uses mollifications of generalised minima as comparison maps. A direct strategy here is suggested by both the very weak compactness properties of \({\text {BD}}\) and the general lack of higher integrability of generalised minima in the very degenerate ellipticity regime (for example, if \(1+\frac{2}{n}\leqq a <\infty \)). Comparison methods of this type, originally employed in [10] for the full gradient case, consequently require to control V-function-type distances of generalised minima to their mollifications. This is where the convolution-type Poincaré inequalities of the previous section enter crucially. More precisely, we proceed as follows:
-
(i)
Section 6.2.1: Estimates for comparison maps. By linearisation, Proposition 6.1 establishes that if a \({\text {C}}^{1,\alpha }\)-Hölder continuous function satisfies a certain smallness condition and has symmetric gradient close to some carefully chosen reference point, then it almost enjoys the typical decay for linear systems. For the linearised integrands, full gradient estimates are available by Korn’s inequality in \({\text {L}}^{2}\).
-
(ii)
Section 6.2.2: Smoothing and selection of good radii. To construct the requisite \({\text {C}}^{1,\alpha }\)-comparison maps for step (i), we carefully mollify the given generalised minimiser and demonstrate that, under suitable smallness assumptions, the mollification parameters can be chosen such that the comparison estimates from (i) become available, cf. Lemma 6.2 and Corollary 6.3.
-
(iii)
Section 6.2.3: Comparison estimates and decay. Here we give the aforementioned comparison argument and employ minimality to deduce a preliminary decay estimate for generalised minima, cf. Proposition 6.4. To control the emerging terms, the comparison will be essentially reduced to good annuli where the relevant differences can be dealt with conveniently. The construction of such annuli hinges on Lemma 2.10, giving control over the symmetric gradients, whereas Corollary 5.2 allows to suitably bound lower order terms.
These steps lead to an \(\varepsilon \)-regularity result, Corollary 6.7, finally implying Theorem 1.2; cf. Section 6.3. We now introduce the requisite terminology for the proof below: Given \(x_{0}\in \Omega \) and \(R>0\) such that \({\text {B}}(x_{0},R)\Subset \Omega \), we define for \(u\in {\text {BD}}_{{\text {loc}}}(\Omega )\) two excess quantities by
where the mean values in the definition of \(\Phi ,{\widetilde{\Phi }}\) are taken with respect to \({\mathscr {L}}^{n}\), cf. (2.12).
6.2 Preliminary Decay Estimates
After the preparations of the previous section, we now carry out the steps (i), (ii) and (iii) as outlined in Section 6.1 above.
6.2.1 Estimates for Comparison Maps
Let \(f\in {\text {C}}^{2}({\mathbb {R}}_{{\text {sym}}}^{n\times n})\) satisfy (LG) and let \(0<\alpha <1\). Throughout this paragraph, we fix \(\xi _{0}\in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\), a radius \(0<\varrho _{\xi _{0}}<1\) and assume that \(f\in {\text {C}}^{2}({\mathbb {R}}_{{\text {sym}}}^{n\times n})\) satisfies
for some \(0<\lambda \leqq \Lambda <\infty \). Moreover, we suppose that there exists a bounded and non-decreasing function \(\omega _{\xi _{0},\varrho _{\xi _{0}}}:{\mathbb {R}}_{\geqq 0}\rightarrow {\mathbb {R}}_{\geqq 0}\) with \(\lim _{t\searrow 0}\omega _{\xi _{0},\varrho _{\xi _{0}}}(t)=0\) such that
Finally, for \(0<r<R\) and \(x_{0}\in \Omega \) with \({\text {B}}(x_{0},R)\Subset \Omega \) and \(v\in {\text {C}}^{1,\alpha }(\overline{{\text {B}}(x_{0},r)};{\mathbb {R}}^{n})\) we put
The deviation \({\text {dev}}_{\alpha }\) captures how far v is away from minimising F on \({\text {C}}^{1,\alpha }(\overline{{\text {B}}(x_{0},r)};{\mathbb {R}}^{n})\) for its own boundary values. Conversely, \({\mathbf {t}}_{\alpha ,\xi _{0}}\) will prove instrumental to find the mentioned smallness condition which is necessary to infer the decay estimate of the Hölder continuous comparison maps. We have
Proposition 6.1
Let \(f,\alpha ,\xi _{0},\varrho _{\xi _{0}},\lambda ,\Lambda \) and \(\omega _{\xi _{0},\varrho _{\xi _{0}}}\) be as above. Then there exists \(1\leqq c_{{\text {comp}}}=c_{{\text {comp}}}(n,\lambda ,\Lambda )<\infty \) such that the following holds: If \(v\in {\text {C}}^{1,\alpha }(\overline{{\text {B}}(x_{0},R/2)};{\mathbb {R}}^{n})\) satisfies \({\mathbf {t}}_{\alpha ,\xi _{0}}(v;x_{0},R/2)<\varrho _{\xi _{0}}/c_{{\text {comp}}}\), then there exists a bounded, non-decreasing function \(\vartheta :{\mathbb {R}}_{\geqq 0}\rightarrow {\mathbb {R}}_{\geqq 0}\) with \(\lim _{t\searrow 0}\vartheta (t)=0\), only depending on \(n,\lambda ,\Lambda \) and \(\omega _{\xi _{0},\varrho _{\xi _{0}}}\), and a constant \(c=c(n,\lambda ,\Lambda )>0\) such that for all \(0<r<R/2\) we have
The preceding proposition essentially follows by reduction to the full gradient case as a consequence of Korn’s inequality. For the reader’s convenience, it is established in the Appendix, Section 9.3, together with the requisite estimates for linear systems.
6.2.2 Smoothing and Selection of Good Radii
In this section we concentrate on step (ii) and establish the required adjusting of the smoothing parameters. The following lemma and its corollary closely follow [10, Lemma 4.2] but with a slight change in the relevant constants. Here and in all of what follows, we choose and fix a constant \(\lambda _{{\text {con}}}>1\) for latter application of the convolution inequality from Proposition 5.1; for instance, \(\lambda _{{\text {con}}}:=1+\frac{1}{1000}\) will do.
Lemma 6.2
Let \(u\in {\text {BD}}_{{\text {loc}}}({\mathbb {R}}^{n})\), \(x_{0}\in {\mathbb {R}}^{n}\), \(r>0\) and put \(\xi _{0}:=({\text {E}}\!u)_{x_{0},r}\). Moreover, suppose that \({\widetilde{\Phi }}(u;x_{0},r)<1\), where \({\widetilde{\Phi }}\) is defined by (6.1). Then for each \(0<\alpha <1\) there exists \(c=c(n,\alpha )>0\) such that if
then the mollification \(u_{\varepsilon ,\varepsilon }\) of u [cf. (2.15)] satisfies
Proof
First observe that, as a consequence of elementary estimates for convolutions, we obtain with a constant \(c=c(n)>0\)
In fact, for \(x\in {\text {B}}(x_{0},\frac{r}{2})\) we have \(|\varepsilon (u_{\varepsilon ,\varepsilon })(x)-\xi _{0}| = |\rho _{\varepsilon }^{(2)}*(\varepsilon (u_{\varepsilon })-\xi _{0})(x)|\) and thus
On the other hand, for any radially symmetric standard mollifier \(\eta :{\text {B}}(0,1)\rightarrow [0,1]\) there exists a constant \(c_{\eta }>0\) such that for all \(g\in {\text {L}}^{1}({\mathbb {R}}^{n};{\mathbb {R}}_{{\text {sym}}}^{n\times n})\) and \(\delta >0\) there holds
which can be established by straightforward computation. Therefore, with \(c=c(n)>0\),
In consequence, adding (6.8) and (6.10) yields (6.7), and in order to arrive at the claimed estimate, we must give an estimate for \(\sup _{{\text {B}}(x_{0},r/2+\varepsilon )}|\varepsilon (u_{\varepsilon })-\xi _{0}|\). As \(\varepsilon \) is adjusted by (6.5) and thus \({\text {B}}(x,\varepsilon )\subset {\text {B}}(x_{0},r)\) for all \(x\in {\text {B}}(x_{0},\frac{r}{2}+\varepsilon )\), we obtain by Jensen’s inequality
for all such x, where \({\widetilde{\ell }}=(48\sqrt{n}\lambda _{{\text {con}}})^{n}\). Here, the ultimate estimate is due to our assumption \({\widetilde{\Phi }}(u;x_{0},r)<1\). By Lemma 2.11(d) with \(\ell =\sqrt{{\widetilde{\ell }}^{2}+2{\widetilde{\ell }}}\) and using (6.11), we obtain for all \(x\in {\text {B}}(x_{0},\frac{r}{2}+\varepsilon )\) with a constant \(c(n)>0\) (as our choice of \(\ell \) only depends on n)
Now, by (6.7), the specific choice of \(\varepsilon \) by (6.5), (6.12) and since \({\widetilde{\Phi }}(u;x_{0},r)<1\),
This is (6.6), and the proof is complete. \(\quad \square \)
Working from (6.6), Jensen’s inequality in conjunction with Lemma 2.11(d) then yields
Corollary 6.3
In the situation and adopting the terminology of Lemma 6.2, we have
6.2.3 Comparison Estimates and Decay
In this section, we let \(u\in {\text {GM}}(F;u_{0})\) be a generalised minimiser, where f satisfies the requirements of Theorem 1.2. Throughout, let \(x_{0}\in \Omega \) and \(R>0\) with \({\text {B}}(x_{0},R)\Subset \Omega \) be given. We put \(\xi _{0}:=({\text {E}}\!u)_{x_{0},R}\) and let \(\varrho _{\xi _{0}}>0\). For \(a\in {\mathbb {B}}(\xi _{0},\varrho _{\xi _{0}})\) we recall from (2.16) the shifted integrand \(f_{a}:{\mathbb {R}}_{{\text {sym}}}^{n\times n}\rightarrow {\mathbb {R}}\) defined by
Given a map \(w:{\text {B}}(x_{0},R)\rightarrow {\mathbb {R}}^{n}\), we then define \({\widetilde{w}}_{a}:{\text {B}}(x_{0},R)\rightarrow {\mathbb {R}}^{n}\) by
Proposition 6.4
(Preliminary decay estimate) Let \(f\in {\text {C}}^{2}({\mathbb {R}}_{{\text {sym}}}^{n\times n})\) be a convex function with (LG). Also, suppose that \(\xi _{0}\in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\), \(0<\varrho _{\xi _{0}}<1\) are such that the following hold:
-
(a)
There exists a bounded and non-decreasing function \(\omega _{\xi _{0},\varrho _{\xi _{0}}}:{\mathbb {R}}_{\geqq 0}\rightarrow {\mathbb {R}}_{\geqq 0}\) with
$$\begin{aligned} \begin{aligned}&\lim _{t\searrow 0}\omega _{\xi _{0},\varrho _{\xi _{0}}}(t)=0,\\&|f''(\xi )-f''(\xi _{0})|\leqq \omega _{\xi _{0},\varrho _{\xi _{0}}}(|\xi -\xi _{0}|)\;\;\;\text {for all}\;\xi \in {\mathbb {B}}(\xi _{0},\varrho _{\xi _{0}}). \end{aligned} \end{aligned}$$(6.15) -
(b)
\(m_{\xi _{0},\varrho _{\xi _{0}}}:=\min \{\lambda (z)\;\text {smallest eigenvalue of}\;f''(z):\;z\in \overline{{\mathbb {B}}(\xi _{0},\varrho _{\xi _{0}})}\}>0.\)
Then there exist constants \(\Theta =\Theta (\varrho _{\xi _{0}},n,\alpha )\in (0,1)\) and
such that
imply that
holds for all \(0<r<R/4\). Here we have set \(v:=u_{\varepsilon ,\varepsilon }\) [cf. (2.15)] where
Proof
The comparison argument underlying the proof consists of three ingredients: Lemma 2.10 and Proposition 5.1, both expressing properties of (the symmetric gradients of) generic \({\text {BD}}_{{\text {loc}}}\)-maps, and generalised local minimality of u.
Step 1. Preliminaries. Let \(0<r<R/4\) and put \(a:=(\varepsilon (v))_{x_{0},r}\). By (6.18) and since \(t\mapsto {\mathbf {t}}_{\alpha ,\xi _{0}}(v;x_{0},t)\) is non-decreasing, we have by Lemma 6.2 and \(\xi _{0}=({\text {E}}\!u)_{x_{0},R}\)
where we can assume without loss of generality that \(c(n,\alpha )>1\). From here we deduce
and put, with \(c(n,\alpha )>0\) as in the preceding inequality,
With this choice of \(\Theta \), \({\widetilde{\Phi }}(u;x_{0},R)<\Theta \) implies \(|\xi _{0}-a|<\varrho _{\xi _{0}}/2\) by virtue of (6.19). Hence \({\mathbb {B}}(a,\frac{\varrho _{\xi _{0}}}{2})\subset {\mathbb {B}}(\xi _{0},\varrho _{\xi _{0}})\), and so (6.15\(_{2}\)) continues to hold in \({\mathbb {B}}(a,\frac{\varrho _{\xi _{0}}}{2})\). Lemma 2.12(b) moreover implies that there exists \(c_{0}=c_{0}(\xi _{0},\varrho _{\xi _{0}},m_{\xi _{0},\varrho _{\xi _{0}}},{\text {Lip}}(f),\sup _{{\mathbb {B}}(\xi _{0},\varrho _{\xi _{0}})}|f''|))>1\) such that
Step 2. Selection of good radii. For this proof, we put for \(w\in {\text {BD}}_{{\text {loc}}}(\Omega )\) with slight abuse of notation
whenever \(\omega \Subset \Omega \) has Lipschitz boundary \(\partial \omega \). By Lemma 2.12(a), \(f_{a}\geqq 0\) and so \({\overline{F}}_{a}\geqq 0\). To employ the comparison argument in step 3 from below, we require a suitable bound on the difference \({\overline{F}}_{a}[{\widetilde{v}}_{a};{\mathcal {A}}]-{\overline{F}}_{a}[{\widetilde{u}}_{a};{\mathcal {A}}]\) in terms of the excess \({\widetilde{\Phi }}(u;x_{0},R)\), \({\mathcal {A}}\subset {\text {B}}(x_{0},R)\) denoting an annulus. This task can, in general, only be achieved on certain annuli \({\mathcal {A}}\), and we proceed by constructing the latter. We define an exit index as
Then, by (6.16) and (6.20), \(N\geqq 15/({\widetilde{\Phi }}(u;x_{0},R))^{\frac{1}{2n+8\alpha }}\). We then put, for \(k\in \{1,\ldots ,8N\}\),
so that \(\kappa _{k}\in [\frac{5}{8}R,\frac{7}{8}R]\). By our choice (6.18) of \(\varepsilon \), we have \(R-2\varepsilon >\tfrac{7}{8}R\). Also, by Lemma 2.12(b), \(f_{a}\in {\text {C}}^{2}({\mathbb {R}}_{{\text {sym}}}^{n\times n})\) is of linear growth. Since moreover \(r<\tfrac{R}{4}\), Lemma 2.10(b) is applicable and yields that for any \(k\in \{1,\ldots ,N\}\) there exist \(t_{k}\in (\kappa _{8k-1},\kappa _{8k})\) and \(r_{k}\in (r,2r)\) with
Now, recalling the choice (6.18) of \(\varepsilon \), for \(k=1,\ldots ,N\) the annuli
are pairwise disjoint and contained in \({\text {B}}(x_{0},R)\). Let us address this point in detail: By our choice of \(\varepsilon \), disjointness of \({\mathcal {A}}_{k}\) and \({\mathcal {A}}_{k+1}\) is equivalent to
Now note that by construction, \(t_{k+1}-t_{k}>\kappa _{8k+7}-\kappa _{8k}=\frac{7}{500}R({\widetilde{\Phi }}(u;x_{0},R))^{\frac{1}{2n+8\alpha }}\), and so the last inequality of (6.26) is certainly satisfied provided \(\frac{72}{500}>({\widetilde{\Phi }}(u;x_{0},R))^{\frac{1}{2n+8\alpha }}\), which in turn follows from (6.20). Now, succesively employing (6.25), \(t_{N}\leqq \kappa _{8N}\leqq \tfrac{7}{8}R\), (6.18) and (6.20), we similarly arrive at \(s_{N}+2\lambda _{{\text {con}}}\sqrt{n}\varepsilon <R\). Thus, \({\mathcal {A}}_{k}\subset {\text {B}}(x_{0},R)\) for all \(k\in \{1,\ldots ,N\}\).
By pairwise disjointness of the \({\mathcal {A}}_{k}\)’s and \({\mathcal {A}}_{k}\subset {\text {B}}(x_{0},R)\), we can therefore conclude that there exists \(k'\in \{1,\ldots ,N\}\) such that
To extract information from this estimate, we employ the lower bound on N, cf. (6.22)ff., to obtain
For future purposes, let us particularly remark that
because of \(\sqrt{n}\geqq \sqrt{2}\) and hence
Step 3. Comparison estimates. Let now \(r_{k'},t_{k'},s_{k'}\) be defined as in step 2 so that \(r<r_{k'}<\tfrac{R}{2}<t_{k'}<s_{k'}<R\). We define a Lipschitz function \(\rho :{\text {B}}(x_{0},R)\rightarrow [0,1]\) by
for \(x\in {\text {B}}(x_{0},R)\). Then we have \(\psi :={\widetilde{v}}_{a}+\rho ({\widetilde{u}}_{a}-{\widetilde{v}}_{a})\in {\text {BD}}({\text {B}}(x_{0},R))\) and, in particular, \(\psi |_{\partial \!{\text {B}}(x_{0},s_{k'})}={\widetilde{u}}_{a}|_{\partial \!{\text {B}}(x_{0},s_{k'})}\). Since thus \(u|_{\partial \!{\text {B}}(x_{0},s_{k'})}=(\psi +A_{x_{0}})|_{\partial \!{\text {B}}(x_{0},s_{k'})}\)\({\mathscr {H}}^{n-1}\)-almost everywhere on \(\partial \!{\text {B}}(x_{0},s_{k'})\), generalised local minimality of u for F implies by virtue of the integration by parts formula (2.2)
Splitting \({\text {B}}(x_{0},s_{k'})\) according to the definition of \(\rho \), we consequently arrive at
Regrouping terms and employing (6.30), we consequently arrive at
Ad\({\mathbf {I}}\). By Jensen’s inequality and Lemma 2.11(b) in the first and (6.21), \(r\leqq r_{k'}\) in the second step, we find
Ad\({\mathbf {II}}\). In a similar vein as in the estimation of (6.30), we recall \(a=(\varepsilon (v))_{x_{0},r}\) to obtain
Ad\({\mathbf {III}}\). By our choice of \(r_{k'},t_{k'}\) in step 2, cf. (6.24), we use \(\Phi (u;x_{0},R)<1\) to bound \({\mathbf {III}}\) by
Ad\({\mathbf {IV}}\). This step of the proof crucially utilises the convolution inequality from Section 5, and to this end, we employ Corollary 5.2 with \(L=\tfrac{1}{s_{k'}-t_{k'}}\). In combination with (6.28), we hereafter obtain
We then arrive at the following string of inequalities:
where, in the final two steps, we have used that \( s_{k'}+2\lambda _{{\text {con}}}\sqrt{n}\varepsilon < R\) as established in step 2. We may now gather the estimates for \({\mathbf {I}},\ldots ,{\mathbf {IV}}\) to obtain with a constant \(c=c(n,\lambda _{{\text {con}}},c_{0})>0\)
Step 4. Conclusion. In order to arrive at the requisite form of the preliminary decay estimate (6.17), we estimate by succesive application of Jensen’s inequality and (6.21):
the ultimate estimate being valid due to our choice \(\xi _{0}=({\text {E}}\!u)_{x_{0},R}\) and \(r+2\varepsilon <R\). Combining this estimate with (6.34), we obtain (6.17), and the proof is complete. \(\quad \square \)
Proposition 6.5
In the situation of Proposition 6.4 we have
Proof
Adopting the terminology of step 1 of the previous proof, we leave the setting unchanged up to formula (6.23). Instead of Lemma 2.10(b) we use Lemma 2.10(a) to find, for each \(k\in \{1,\ldots ,N\}\), a number \(t_{k}\in (\kappa _{8k-1},\kappa _{8k})\) such that
providing the requisite substitute for formula (6.24). Equally, we find \(s_{k'}=t_{k'}+\frac{R}{500}{\widetilde{\Phi }}(u;x_{0},R)^{\frac{1}{2n+8\alpha }}\) such that
the annulus \({\mathcal {A}}_{k'}\) now being defined as in (6.25) with the obvious change of \(t_{k'}\) and \(s_{k'}\). Let \(\theta >0\) be arbitrary. We then put
and find \(\varphi _{1}\in {\mathcal {C}}_{1}\), \(\varphi _{2}\in {\mathcal {C}}_{2}\) such that
Let us note that, employing an integration by parts, for all \(\varphi \in {\mathcal {C}}_{1}\) there holds
By definition of \(f_{a}\), we then obtain
Since \(\varphi _{1},\varphi _{2}\) are Lipschitz and coincide on \(\partial \!{\text {B}}(x_{0},t_{k'})\), we deduce that the glued map \(\varphi _{3}:=\mathbb {1}_{\overline{{\text {B}}(x_{0},t_{k'})}}\varphi _{1}+\mathbb {1}_{{\mathcal {A}}(x_{0};t_{k'},s_{k'})}\varphi _{2}\) belongs to \({\text {W}}^{1,\infty }({\text {B}}(x_{0},s_{k'});{\mathbb {R}}^{n})\). We then obtain, using that \(t\mapsto {\text {dev}}_{\alpha }(v;x_{0},t)\) is non-decreasing in the first inequality,
Ad\({\mathbf {V}}\). Splitting \({\text {B}}(x_{0},s_{k'})={\text {B}}(x_{0},t_{k'})\cup {\mathcal {A}}(x_{0};s_{k'},t_{k'})\) and employing (6.38), we obtain
Ad\({\mathbf {VI}}\). Different from step 3 of the proof of Proposition 6.4, we now use the comparison map \({\overline{\psi }}:=\varphi _{3}+\rho ({\widetilde{u}}_{a}-{\widetilde{v}}_{a})\), \(\rho \) still being defined by (6.29) but now with the new choices of \(t_{k'}\) and \(s_{k'}\). In advance, we note that \({\overline{\psi }}={\widetilde{u}}_{a}\)\({\mathscr {H}}^{n-1}\)-almost everywhere on \(\partial \!{\text {B}}(x_{0},s_{k'})\). Since
the generalised local minimality of u for F and \(\rho |_{{\text {B}}(x_{0},t_{k'})}=0\) yields
Combining the estimates for \({\mathbf {V}}\) and \({\mathbf {VI}}\) yields
Now we employ (6.35) and send \(\theta \searrow 0\) to conclude. The proof is complete. \(\quad \square \)
Remark 6.6
As mentioned after Proposition 5.1, the crude but easier obtainable estimate (5.2) is not sufficient for applications in the proof of Propositions 6.4 and 6.5. In fact, by the above proof we are bound to set \(L=\frac{1}{s_{k'}-t_{k'}}\). With the particular choice of \(\varepsilon \) by (6.18), we then find that \(L\varepsilon ^{\beta }\) cannot be suitably bounded to still arrive at the requisite decay estimate. On the other hand, one might redefine \(\varepsilon \), but then estimates of the remaining proof cannot be obtained in the requisite form and the decay estimate cannot be inferred.
Corollary 6.7
(\(\varepsilon \)-regularity) Let \(f\in {\text {C}}^{2}({\mathbb {R}}_{{\text {sym}}}^{n\times n})\) be a convex integrand with (LG) and suppose that there exist \(z_{0}\in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\) and \(\varrho _{z_{0}}>0\) such that the following hold:
-
(C1)
\(m_{z_{0},\varrho _{z_{0}}}>0\),
-
(C2)
For all \(\xi ,\xi '\in {\mathbb {B}}(z_{0},\varrho _{z_{0}})\) there holds \(|f''(\xi )-f''(\xi ')|\leqq \omega (|\xi -\xi '|)\) with a bounded and non-decreasing function \(\omega :{\mathbb {R}}_{\geqq 0}\rightarrow {\mathbb {R}}_{\geqq 0}\) such that \(\lim _{t\searrow 0}\omega (t)=0\).
For any \(\alpha \in (0,1)\) there exist \(\varepsilon _{0}\in (0,1]\) and \(\sigma \in (0,1)\) such that the following holds for all \(u\in {\text {GM}}_{{\text {loc}}}(F)\): If \(x\in \Omega \) and \(R>0\) are such that \({\text {B}}(x,R)\Subset \Omega \) and
-
(a)
\(|({\text {E}}\!u)_{x,R}-z_{0}|<\varrho _{z_{0}}/3\),
-
(b)
\({\widetilde{\Phi }}(u;x,R)<\varepsilon _{0}\),
then there holds \({\widetilde{\Phi }}(u;x,\sigma ^{j}R)\leqq \sigma ^{2\alpha j}{\widetilde{\Phi }}(u;x,R)\) for all \(j\in {\mathbb {N}}_{0}\). In particular, \(\varepsilon _{0}\) and \(\sigma \) only depend on \(n,\lambda _{{\text {con}}},\varrho _{z_{0}},m_{z_{0},\varrho _{z_{0}}},\omega ,c_{1},c_{2}\) and \(\sup _{{\mathbb {B}}(z_{0},\varrho _{z_{0}})}|f''|\).
Proof
Let \(\alpha \in (0,1)\), \(x\in \Omega \) and \(R>0\) be such that \({\text {B}}(x,R)\subset \Omega \). Note that, if \(\xi _{0}\in {\mathbb {B}}(z_{0},2\varrho _{\xi _{0}})\) with \(\varrho _{\xi _{0}}:=\varrho _{z_{0}}/3\), then \(m_{\xi _{0},\varrho _{\xi _{0}}}>0\) and \(|f''(\xi )-f''(\xi _{0})|\leqq \omega _{\xi _{0},\varrho _{\xi _{0}}}(|\xi -\xi _{0}|)\) for all \(\xi \in {\mathbb {B}}(\xi _{0},\varrho _{\xi _{0}})\) with \(\omega _{\xi _{0},\varrho _{\xi _{0}}}=\omega \). We put \(\xi _{0}:=({\text {E}}\!u)_{x,R}\).
We pick the constants \(\Theta ,c>0\) from Proposition 6.4 with \(x_{0}=x\), fix the mollification parameter \(\varepsilon \) as in (6.18) and let \(\varepsilon _{0}\in (0,\Theta )\) to be fixed later on. Thus, for all \(0<r<\frac{R}{4}\), (6.17) is in action with \(v=u_{\varepsilon ,\varepsilon }\). From Lemma 6.2 applied to the radius R, we obtain \({\mathbf {t}}_{\alpha ,\xi _{0}}(v;x,\frac{R}{2})\leqq c(n,\alpha ){\widetilde{\Phi }}(u;x,R)^{\frac{\alpha }{n+4\alpha }}\), \(c(n,\alpha )>0\) denoting the constant from Lemma 6.2. Thus, diminishing \(\varepsilon _{0}\), we may assume that \(c(n,\alpha )\varepsilon _{0}^{\frac{\alpha }{n+4\alpha }}<\min \{\frac{\varrho _{\xi _{0}}}{c_{{\text {comp}}}},\frac{1}{2}\}\), \(c_{{\text {comp}}}>0\) being the constant from Proposition 6.1. This entails \({\mathbf {t}}_{\alpha ,\xi _{0}}(v;x,\frac{R}{2})<\frac{1}{2}\) and thus \(|\varepsilon (v)(y)-(\varepsilon (v))_{x,r}|\leqq 1\) for all \(0<r<\frac{R}{4}\) and \(y\in {\text {B}}(x,r)\). Therefore, a consecutive application of Proposition 6.1, Lemma 2.11(c), Corollary 6.3 and Proposition 6.5 yields
for all \(0<r<\frac{R}{4}\). In conclusion, (6.17) yields the existence of \(\varepsilon _{0}^{(1)}\in (0,1]\) and \(c_{{\text {dec}}}>0\) such that there holds
for all \(0<r<\frac{R}{4}\), the non-negative function \(H:{\mathbb {R}}_{\geqq 0}\rightarrow {\mathbb {R}}_{\geqq 0}\) being given by
cf. Proposition 6.1 for the introduction of \(\vartheta \). Tracking dependencies, \(\varepsilon _{0}^{(1)}\), \(c_{{\text {dec}}}\) and H only depend on \(n,\lambda _{{\text {con}}},\varrho _{z_{0}},m_{z_{0},\varrho _{z_{0}}},\omega ,{\text {Lip}}(f)\) and \(\sup _{{\mathbb {B}}(z_{0},\varrho _{z_{0}})}|f''|\). We now define
and, using that \(\lim _{t\searrow 0}H(t)=0\), choose \(\varepsilon _{0}^{(2)}>0\) such that there holds
We now define \(\varepsilon _{0}:=\min \{\varepsilon _{0}^{(1)},\varepsilon _{0}^{(2)}\}\) and claim that, if \(x\in \Omega \) and \(R>0\) are such that \({\text {B}}(x,R)\Subset \Omega \) with \(|({\text {E}}\!u)_{x,R}-z_{0}|<\varrho _{z_{0}}/3\) and \({\widetilde{\Phi }}(u;x,R)<\varepsilon _{0}\), then there holds \({\widetilde{\Phi }}(u;x,\sigma ^{j}R)\leqq \sigma ^{\alpha j}{\widetilde{\Phi }}(u;x,R)\) for all \(j\in {\mathbb {N}}_{0}\). To conclude the proof by iteration, we put
and establish validity of
for all \(j\in {\mathbb {N}}_{0}\). This is trivial for \(j=0\). Now assume validity of (\(\text {Dec}_{j}\)) for some \(j\in {\mathbb {N}}_{0}\), the second part of which implies \(\xi _{0}^{(j)}\in {\mathbb {B}}(z_{0},\tfrac{2}{3}\varrho _{z_{0}})\) so that C1 and C2 continue to hold in \({\mathbb {B}}(\xi _{0}^{(j)},\frac{1}{3}\varrho _{z_{0}})\). Moreover, the first part of (\(\text {Dec}_{j}\)) yields \({\widetilde{\Phi }}(u;x,\sigma ^{j}R)<\varepsilon _{0}^{(1)}\). Therefore,
Toward the second part of (\(\text {Dec}_{j}\)), it suffices to establish \(|\xi _{0}^{(j+1)}-\xi _{0}^{(j)}|\leqq 0 \frac{1}{3}\varrho _{z_{0}}2^{-j-1}\). Observe that
which, by definition of V, entails \(|\xi _{0}^{(j+1)}-\xi _{0}^{(j)}|<\sqrt{3}\). Therefore, by Lemma 2.11(a), (c),
The proof of the corollary is thereby complete.\(\quad \square \)
6.3 Proof of Theorem 1.2
We can now proceed to the
Proof of Theorem 1.2
Let \(u\in {\text {GM}}(F;u_{0})\) and \((x_{0},z_{0})\in \Omega \times {\mathbb {R}}_{{\text {sym}}}^{n\times n}\) be such that \(f''(z_{0})\) is positive definite and (1.8) is satisfied. Since \(f''\) is continuous, there exists \(\varrho _{z_{0}}>0\) such that C1 and C2 from Corollary 6.7 are satisfied. Let \(\varepsilon _{0}>0\) be as in Corollary 6.7. By (1.8), \(\lim _{R\searrow 0}(|{\mathscr {E}}u-z_{0}|)_{x_{0},R}+|({\text {E}}\!^{s}u)_{x_{0},R}|=0\), and since \(V(\cdot )\leqq |{\cdot }|\),
By (1.8) and \({\mathscr {E}}u\in {\text {L}}_{{\text {loc}}}^{1}(\Omega ;{\mathbb {R}}_{{\text {sym}}}^{n\times n})\), we conclude that there exists some \(R_{0}>0\) and an open neighbourhood \(U_{1}\) of \(x_{0}\) such that \({\text {B}}(x_{0},2R_{0})\Subset \Omega \) and
hold for all \(x\in U_{1}\). Diminishing \(U_{1}\) if necessary, we can assume that \(U_{1}\subset {\text {B}}(x_{0},R_{0})\). Let \(x\in U_{1}\), so that \({\text {B}}(x,R_{0})\subset {\text {B}}(x_{0},2R_{0})\). Thus,
On the other hand, since \(V(\cdot )\leqq |{\cdot }|\) and \(V^{\infty }(\cdot )=|{\cdot }|\),
As a conclusion, conditions (a) and (b) from Corollary 6.7 are satisfied for all \(x\in U_{1}\) and \(R=R_{0}\). Therefore, there exists \(C>0\) such that \({\widetilde{\Phi }}(u;x,r)\leqq C (r/R_{0})^{2\alpha }{\widetilde{\Phi }}(u;x,R_{0})\) holds for all \(x\in U_{1}\) and \(0<r<R_{0}/4\). By definition of \({\widetilde{\Phi }}\), this implies \({\text {E}}\!^{s}u\equiv 0\) in \(U_{1}\) and hence . Now, for all such x and r,
and so, Lemma 2.11(d) yields a constant \(c(R_{0},\varepsilon _{0})>0\) such that
Now, the usual Campanato–Meyers characterisation of Hölder continuity [43, Theorem 2.9] implies that \(\varepsilon (u)\) is of class \({\text {C}}^{0,\alpha }\) and hence \({\text {L}}^{2}\) in a neighbourhood \({\widetilde{U}}\) of \(x_{0}\). Thus, by Lemma 2.2 (a),
We again invoke the Campanato–Meyers characterisation of Hölder continuity to conclude that u is of class \({\text {C}}^{1,\alpha }\) in an open neighbourhood of \(x_{0}\). Finally, by the Lebesgue differentiation theorem for Radon measures, cf. (2.13), \({\mathscr {L}}^{n}\)-almost everywhere \(x_{0}\in \Omega \) satisfies (1.8), and the proof of the theorem is complete. \(\quad \square \)
7 Remarks and Extensions
We conclude the paper with some remarks on possible generalisations of Theorems 1.1 and 1.2 with focus on non-autonomous problems. First, by the very nature of the proofs, Theorem 1.1 and 1.2 straightforwardly generalise to local generalised minima. Second, in analogy with [10, Section 6], if \(f:\Omega \times {\mathbb {R}}_{{\text {sym}}}^{n\times n}\rightarrow {\mathbb {R}}\) and \(g:\Omega \times {\mathbb {R}}^{n}\rightarrow {\mathbb {R}}\) are such that
for some \(c_{1},\ldots ,c_{4}>0\), \(\gamma >0\) and \(0<\mu <1\), then Theorem 1.2 generalises to functionals
More precisely, let \(u\in {\text {GM}}_{{\text {loc}}}(F)\) and suppose that \((x_{0},z_{0})\in \Omega \times {\mathbb {R}}_{{\text {sym}}}^{n\times n}\) is such that \(z_{0}\) is the Lebesgue value of \({\text {E}}\!u\) at \(x_{0}\), Moreover, assume that there exists \(\lambda >0\) such that \(\lambda |z|^{2}\leqq \langle {\text {D}}_{z}^{2}f(x,z_{0})z,z\rangle \) holds for all \(z\in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\) uniformly in an open neighbourhood of \(x_{0}\). Then there exists an open neighbourhood U of \(x_{0}\) such that u has \(\alpha =\alpha (\mu )\)-Hölder continuous full gradients in U. Let us, however, note that a corresponding result is far from clear if the overall variational integrand \((x,y,z)\mapsto f(x,z)+g(x,y)\) does not possess the splitting structure but is of the general form \((x,y,z)\mapsto f(x,y,z)\).
Namely, here one usually invokes Caccioppoli’s inequality in conjunction with Gehring’s lemma on higher integrability to conclude that minima of elliptic problems belong to some \({\text {W}}_{{\text {loc}}}^{1,r}\), \(r>p\), where p is the Lebesgue exponent of the natural energy space \({\text {W}}^{1,p}\). As explained in [47], there exist linear growth integrands and generalised minimisers \(u\in {\text {BV}}{\setminus }{\text {W}}^{1,1}\) which do satisfy a Caccioppoli type inequality. This easily carries over to the \({\text {BD}}\)-situation, and hereby rules out any integrability boost by virtue of Gehring. On the other hand, even for semiautonomous integrands \((x,z)\mapsto f(x,z)\), a well-known counterexample due to Bildhauer [18, Theorem 4.39] asserts that if \(f\in {\text {C}}^{2}({\overline{\Omega }}\times {\mathbb {R}}^{N\times n})\) satisfies a uniform variant of (1.4) for \(a>3\), then generalised minima might in fact belong to \({\text {BV}}{\setminus }{\text {W}}^{1,1}\). In particular, the Caccioppoli inequality itself cannot yield higher integrability results in the linear growth setting.
On the other hand, the approach of Section 6 is robust enough to apply to integrands\((x,y,z)\mapsto f(x,y,z)\) indeed if suitable superlinear growth in the last variable is imposed and thus the Gehring-type improvement is available (cf. [10, Theorem 6.1]):
Remark 7.1
(p-growth functionals: Partial regularity) Let \(1<p<\infty \), \(0<\mu <1\) and let \(f:\Omega \times {\mathbb {R}}^{n}\times {\mathbb {R}}_{{\text {sym}}}^{n\times n}\rightarrow {\mathbb {R}}\) be a variational integrand that satisfies
for all \(x,x_{1},x_{2}\in \Omega \), \(y,y_{1},y_{2}\in {\mathbb {R}}^{n}\) and \(z\in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\) and constants \(c_{1},\ldots ,c_{4}>0\), \(\gamma >0\). Let \(u\in {\text {W}}_{{\text {loc}}}^{1,p}(\Omega ;{\mathbb {R}}^{n})\) be a local minimiser of the variational integral corresponding to f. Moreover, let \((x_{0},y_{0},z_{0})\in \Omega \times {\mathbb {R}}^{n}\times {\mathbb {R}}_{{\text {sym}}}^{n\times n}\) is such that \(x_{0}\) is a Lebesgue point for both u and \(\varepsilon (u)\), with Lebesgue values \(y_{0}\) or \(z_{0}\), respectively. If there exists \(\lambda >0\) such that \(\lambda |z|^{2}\leqq \langle {\text {D}}_{z}^{2}f(x,y,z_{0})z,z\rangle \) holds for all \(z\in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\) uniformly in an open neighbourhood of \((x_{0},y_{0})\), then u has Hölder continuous gradients in an open neighbourhood of \(x_{0}\).
An adaption to convex Orlicz integrands in the spirit of [28] is equally possible, and the preceding remark in the superlinear growth case goes along with the results from [25] (also see [40]). In view of partial regularity, we have omitted symmetric quasiconvex functionals throughout. In fact, at present it is not known how to modify the method exposed in Section 6 even in the full gradient case (also see the discussion in [10, 69]). The only result available in the \({\text {BV}}\)-full gradient, strongly quasiconvex case is due to Kristensen and the author [47], and the case of strongly symmetric-quasiconvex functionals on \({\text {BD}}\) is due to the author [45]. If the condition of strong symmetric quasiconvexity pro forma is introduced for convex\({\text {C}}^{2}\)-integrands, then it translates to 3-elliptic integrands in the sense of (1.4) and does not apply to the very degenerate ellipticity regime covered by Theorem 1.2. Whereas the main obstructions in [45, 47] stem from the weakened convexity notion, they moreover require higher regularity of the variational integrands, namely, \({\text {C}}_{{\text {loc}}}^{2,\mu }\) for some \(\mu >1-\frac{2}{n}\). In this sense, the results of [45] and Theorem 1.2 are independent.
As to Sobolev regularity, the case of non-autonomous integrands \((x,z)\mapsto f(x,z)\) which satisfy the obvious modification of (1.4) uniformly in x, however, is more intricate. Even if f is of class \({\text {C}}^{2}\) in the joint variable and satisfies the estimates corresponding to [19, Assumption 4.22], it is not fully clear to arrive at the decoupling estimates that eliminate the divergence as done in the proof of Theorem 4.3. Whereas for partial regularity \({\text {C}}^{0,\alpha }\)-Hölder continuous x-dependence of \({\text {D}}_{z}f\) still suffices, the corresponding Sobolev regularity theory is far from clear when aiming at an ellipticity regime beyond \(1<a<1+\frac{1}{n}\) (also see Baroni, Colombo and Mingione [12] for the related borderline case \(\frac{q}{p}=1+\frac{\alpha }{n}\) in the superlinear growth regime). Namely, in this case the Euler–Lagrange equations satisfied by (generalised) minima cannot be differentiated. In the full gradient, superlinear growth regime, this setting has been extensively studied by Mingione [61, 63, 64] and Kristensen and Mingione [54,55,56] in the (fully) non-autonomous context. Here, Nikolskiĭ estimates are employed, and an adaptation to linear growth functionals with \({\text {C}}^{0,\alpha }\)-Hölder continuous x-dependence of \({\text {D}}_{z}f\) will yield the ellipticity regime \(1<a<1+\frac{\alpha }{n}\) for \({\text {W}}_{{\text {loc}}}^{1,1}\)-regularity of generalised minima (along the lines of [46]), being unclear to be improved via the second order estimates of Section 4.
Lastly, let \({\mathbb {A}}[D]\) be a first order, constant-coefficient, homogeneous differential operator \({\mathbb {A}}[D]\) on \({\mathbb {R}}^{n}\) between the two finite dimensional vector spaces V, W. Then the canonical Dirichlet problem (1.2) has a relaxed minimiser in \({\text {BV}}^{{\mathbb {A}}}(\Omega ):=\{v\in {\text {L}}^{1}(\Omega ;V):\;{\mathbb {A}}[D]u\in {\mathscr {M}}(\Omega ;W)\}\) if \({\mathbb {A}}\) is \({\mathbb {C}}\)-elliptic (and hence \({\text {L}}^{1}\)-traces of \({\text {BV}}^{{\mathbb {A}}}\)-maps are definable), cf. [21, Theorem 5.3]. By means of Hilbert–Nullstellensatz-techniques [48, 76], the splitting strategy underlying Theorem 4.3—yet being technically more demanding—is likely to work as well. On the other hand, based on the Poincaré-type inequalities from [21], the partial regularity result from Theorem 1.2 hinges on the existence of a mollifier \(\rho \) such that\(\rho *\pi =\pi \) for all \(\pi \in \ker ({\mathbb {A}}[D])\). This is a consequence of the Bramble–Hilbert lemma, and we shall pursue this elsewhere.
Notes
From now, if the symmetric gradient of an integrable map is a measure, we write \({\text {E}}\!u\) instead of \(\varepsilon (u)\).
The only systematic \({\text {W}}^{1,1}\)-regularity theory for \(a>3\) is available for Neumann-type problems on \({\text {BV}}\), cf. Beck, Bulíček and the author [13], being conceptually different from the Dirichlet problem.
In view of Poincaré’s inequality, it would be sufficient to assume that \({\text {spt}}(\rho )\) is a connected Lipschitz domain.
Note that rigid deformations have zero divergence.
Namely, express the difference \(u-\rho _{\varepsilon }*u\) by the convolution integrals emerging from (2.4) and then use the embedding of \({\text {BD}}({\mathbb {R}}^{n})\) into \({\text {W}}_{{\text {loc}}}^{\alpha ,1}({\mathbb {R}}^{n};{\mathbb {R}}^{n})\) or \({\text {W}}^{s,(n-1+s)}({\mathbb {R}}^{n};{\mathbb {R}}^{n})\) for \(0<s<1\) (cf. [47]). Since suitable fractional potentials of \({\text {W}}^{\alpha ,1}\)-maps can be controlled conveniently, this allows to arrive at the claimed estimate for all \(0<\beta <1\).
The recession function is convex and of linear growth, thus Lipschitz, too.
References
Acerbi, E., Fusco, N.: A regularity theorem for minimizers of quasiconvex integrals. Arch. Ration. Mech. Anal. 99(3), 261–281, 1987
Acerbi, E., Fusco, N.: Regularity for minimizers of nonquadratic functionals: the case \(1<p<2\). J. Math. Anal. Appl. 140(1), 115–135, 1989
Acerbi, E., Mingione, G.: Regularity results for stationary electro-rheological fluids. Arch. Ration. Mech. Anal. 164, 213–259, 2002
Allard, W.K.: On the first variation of a varifold. Ann. Math. 2(95), 417–491, 1972
Almgren Jr., F.J.: Existence and regularity almost everywhere of solutions to elliptic variational problems among surfaces of varying topological type and singularity structure. Ann. Math. 2(87), 321–391, 1968
Ambrosio, L., Coscia, A., Dal Maso, G.: Fine properties of functions with bounded deformation. Arch. Ration. Mech. Anal. 139(3), 201–238, 1997 (English summary)
Ambrosio, L., Fusco, N., Pallara, D.: Functions of bounded variation and free discontinuity problems, p. xviii+434. Oxford Mathematical Monographs. The Clarendon Press, Oxford University Press, New York 2000
Anzellotti, G.: The Euler equation for functionals with linear growth. Trans. Am. Math. Soc. 290, 483–501, 1985
Anzellotti, G., Giaquinta, M.: Existence of the displacement field for an elastoplastic body subject to Hencky’s law and von Mises yield condition. Manuscr. Math. 32(1–2), 101–136, 1980
Anzellotti, G., Giaquinta, M.: Convex functionals and partial regularity. Arch. Ration. Mech. Anal. 102(3), 243–272, 1988
Babadjian, J.-F.: Traces of functions of bounded deformation. Indiana Univ. Math. J. 64(4), 1271–1290, 2015
Baroni, P., Colombo, M., Mingione, G.: Regularity for general functionals with double phase. J. Calc. Var. 57, 62, 2018
Beck, L., Bulíček, M., Gmeineder, F.: On a Neumann problem for variational functionals of linear growth. To appear at Annali della Scuola Normale Superiore
Beck, L., Schmidt, T.: On the Dirichlet problem for variational integrals in BV. J. Reine Angew. Math. 674, 113–194, 2013
Beck, L., Schmidt, T.: Interior gradient regularity for BV minimizers of singular variational problems. Nonlinear Anal. 120, 86–106, 2015
Bernstein, S.: Sur les equations du calcul des variations. Ann. Sci. Ecole Norm. Sup. 29, 431–485, 1912
Bildhauer, M., Fuchs, M.: On a class of variational integrals with linear growth satisfying the condition of \(\mu \)-ellipticity. Rend. Mat. Appl. (7)22(2002), 249–274, 2003
Bildhauer, M.: A priori gradient estimates for bounded generalised solutions of a class of variational problems with linear growth. J. Convex Anal. 9, 117–137, 2002
Bildhauer, M.: Convex variational problems. Linear, nearly linear and anisotropic growth conditions. Lecture Notes in Mathematics, 1818, p. x+217. Springer-Verlag, Berlin 2003
Breit, D., Diening, L.: Sharp conditions for Korn inequalities in Orlicz spaces. J. Math. Fluid Mech. 14(3), 565–573, 2012
Breit, D., Diening, L., Gmeineder, F.: On the trace operator for functions of bounded \({\mathbb{A}}\)-variation. Anal. PDE13(2), 559–594, 2020
Carozza, M., Kristensen, J., Passarelli Di Napoli, A.: Higher differentiability of minimizers of convex variational integrals. Ann. l’Instit. Henri Poincare (C) Anal. Non Lineaire Issue 328, 395–411, 2011
Cianchi, A.: Korn type inequalities in Orlicz spaces. J. Funct. Anal. 267, 2313–2352, 2014
Conti, S., Faraco, D., Maggi, F.: A new approach to counterexamples to \({{\rm L}}^{1}\)-estimates: Korn’s inequality, geometric rigidity, and regularity for gradients of separately convex functions. Arch. Ration. Mech. Anal. 175, 287–300, 2005
Conti, S., Focardi, M., Iurlano, F.: A note on the Hausdorff dimension of the singular set of solutions to elasticity type systems. Commun. Contemp. Math. 21(06), 1950026, 2019
Christiansen, E., Matthies, H., Strang, G.: The saddle point of a differential program, pp. 309–318. Energy methods in finite element analysis, Wiley, Chichester 1979
De Giorgi, E.: Frontiere orientate di misura minima. (Italian) Seminario di Matematica della Scuola Normale Superiore di Pisa, 1960-61 Editrice Tecnico Scientifica, Pisa 57 pp 1961
Diening, L., Lengeler, D., Stroffolini, B., Verde, A.: Partial regularity for minimizers of quasi-convex functionals with general growth. SIAM J. Math. Anal. 44(5), 3594–3616, 2012
Duzaar, F., Grotowski, J.F.: Optimal interior partial regularity for nonlinear elliptic systems: the method of A-harmonic approximation. Manuscr. Math. 103(3), 267–298, 2000
Duzaar, F., Grotowski, J.F., Kronz, M.: Regularity of almost minimizers of quasi-convex variational integrals with subquadratic growth. Ann. Mat. Pura Appl. (4)184(4), 421–448, 2005
Duzaar, F., Mingione, G.: Regularity for degenerate elliptic problems via \(p\)-harmonic approximation. Ann. Inst. Henri Poincaré, Anal. Non Linéaire21(5), 735–766, 2004
Duzaar, F., Mingione, G.: The \(p\)-harmonic approximation and the regularity of \(p\)-harmonic maps. Calc. Var. Partial. Differ. Equ. 20, 235–256, 2004
Ekeland, I.: On the variational principle. J. Math. Anal. Appl. 47, 324–353, 1974
Esposito, L., Leonetti, F., Mingione, G.: Higher integrability for minimizers of integral functionals with \((p, q)\)-growth. J. Differ. Equations157, 414–438, 1999
Evans, L.C.: Quasiconvexity and partial regularity in the calculus of variations. Arch. Ration. Mech. Anal. 95(3), 227–252, 1986
Evans, L.C.: Partial differential equations. Graduate Studies in Mathematics, 19., 2nd edn. American Mathematical Society, Providence, RI 2010
Finn, R.: Remarks relevant to minimal surfaces and to surfaces of prescribed mean curvature. J. Anal. Math. 14, 139–160, 1965
Fuchs, M., Mingione, G.: Full \({{\rm C}}^{1,\alpha }\)-regularity for free and constrained local minimizers of elliptic variational integrals with nearly linear growth. Manuscr. Math. 102, 227–250, 2000
Fuchs, M., Seregin, G.: A regularity theory for variational integrals with \(L ln L\)-growth. Calc. Var. Partial Differential Equations6(2), 171–187, 1998
Fuchs, M., Seregin, G.: Variational Methods for Problems from Plasticity Theory and for Generalized Newtonian Fluids. Lecture Notes in Mathematics, vol. 1749. Springer, Berlin 2000
Giaquinta, M.: Multiple integrals in the calculus of variations and nonlinear elliptic systems. Annals of Mathematics Studies, 105. Princeton University Press, Princeton, NJ, 1983
Giaquinta, M., Modica, G., Soucek, J.: Functionals with linear growth in the calculus of variations. I, II. Comment. Math. Univ. Carolin. 20(1), 143-156–172, 1979
Giusti, E.: Direct Methods in the Calculus of Variations. World Scientific Publishing Co., Inc., River Edge 2003
Gmeineder, F.: Symmetric-convex functionals of linear growth. J. Ell. Par. Equations 2(1–2), 59–71, 2016
Gmeineder, F.: Partial regularity for symmetric quasiconvex functionals. Preprint. arXiv:1903.08639
Gmeineder, F., Kristensen, J.: Sobolev regularity for convex functionals on \({{\rm BD}}\). J. Calc. Var. 58, 56, 2019
Gmeineder, F., Kristensen, J.: Partial regularity for BV minimizers. Arch. Ration. Mech. Anal. 232(3), 1429–1473, 2019
Gmeineder, F., Raita, B., Van Schaftingen, J.: On limiting trace inequalities for vectorial differential operators. To appear at Indiana Univ. Math. J. Preprint. arXiv:1903.08633
Goffman, C., Serrin, J.: Sublinear functions of measures and variational integrals. Duke Math. J. 31, 159–178, 1964
Kazaniecki, K., Stolyarov, D.M., Wojciechowski, M.: Anisotropic Ornstein non inequalities. Anal. PDE10, 351–366, 2017
Kirchheim, B., Kristensen, J.: On rank one convex functions that are homogeneous of degree one. Arch. Ration. Mech. Anal. 221(1), 527–558, 2016
Kohn, R.V.: New Estimates for Deformations in Terms of their Strains. Ph.D. Thesis, Princeton Univ., 1979
Kohn, R.V.: New integral estimates for deformations in terms of their nonlinear strains. Arch. Ration. Mech. Anal. 78(2), 131–172, 1982
Kristensen, J., Mingione, G.: The singular set of minima of integral functionals. Arch. Ration. Mech. Anal. 180(3), 331–398, 2006
Kristensen, J., Mingione, G.: The singular set of \(\omega \)-minima. Arch. Ration. Mech. Anal. 177(1), 93–114, 2005
Kristensen, J., Mingione, G.: Non-differentiable functionals and singular sets of minima. C. R. Math. Acad. Sci. Paris340(1), 93–98, 2005
Ladyzhenskaya, O.A., Ural’ceva, N.N.: Local estimates for gradients of solutions of non-uniformly elliptic and parabolic equations. Commun. Pure Appl. Math. 83, 677–703, 1970
Marcellini, P., Sbordone, C.: On the existence of minima of multiple integrals of the calculus of variations. J. Math. Pures Appl. IX. Sér. 62, 1–9, 1983
Marcellini, P.: Regularity of minimizers of integrals of the calculus of variations with non standard growth conditions. Arch. Ration. Mech. Anal. 105, 267–284, 1989
Marcellini, P.: Regularity and existence of solutions of elliptic equations with \(p, q\)-growth conditions. J. Differ. Equations90, 1–30, 1991
Mingione, G.: The singular set of solutions to non-differentiable elliptic systems. Arch. Ration. Mech. Anal. 166(4), 287–301, 2003
Mingione, G.: Regularity of minima: an invitation to the dark side of the calculus of variations. Appl. Math. 51(4), 355–426, 2006
Mingione, G.: Singularities of minima: a walk on the wild side of the calculus of variations. J. Global Optim. 40(1–3), 209–223, 2008 (English summary)
Mingione, G.: Bounds for the singular set of solutions to non linear elliptic systems. Calc. Var. Partial Differential Equations18(4), 373–400, 2003
Ornstein, D.: A non-equality for differential operators in the \({{\rm L}}_{1}\)-norm. Arch. Ration. Mech. Anal. 11, 40–49, 1962
Reshetnyak, YuG: Weak convergence of completely additive vector functions on a set. Sib. Math. J. 9(6), 1039–1045, 1968
Reshetnjak, Yu.G: Estimates for certain differential operators with finite-dimensional kernel. Sibirsk. Mat. Z. 11, 414–428, 1970 (in Russian)
Santi, E.: Sul problema al contorno per l’equazione delle superfici di area minima su domini limitati qualqunque. Ann. Univ. Ferrara, N. Ser., Sez. VII17, 13–26, 1972
Schmidt, T.: Partial regularity for degenerate variational problems and image restoration models in BV. Indiana Univ. Math. J. 63, 213–279, 2014
Schmidt, T.: BV Minimizers of Variational Integrals: Existence, Uniqueness, Regularity. Habilitationsschrift (postdoctoral thesis), Friedrich-Alexander-Universität Erlangen-Nürnberg, 86 pp. 2015
Seregin, G.: Variation-difference schemes for problems in the mechanics of ideally elastoplastic media. U.S.S.R. Comput. Math. Math. Phys.25, 153–165, 1985; translation from Zh. Vychisl. Mat. Mat. Fiz. 25, 237–253, 1985
Seregin, G.: Differential properties of solutions of variational problems for functionals of linear growth. J. Sov. Math.64, 1256–1277, 1993; translation from Probl. Mat. Anal. 11(51–79), 1990
Seregin, G.: Two-dimensional variational problems of the theory of plasticity. Izv. Math.60, 179–216, 1996; translation from Izv. Ross. Akad. Nauk Ser. Mat.60, 175–210, 1996
Seregin, G.: Differentiability properties of weak solutions of certain variational problems in the theory of perfect elastoplastic plates. Appl. Math. Optim. 28, 307–335, 1993
Serrin, J.: The problem of Dirichlet for quasilinear elliptic differential equations with many independent variables. Phil. Trans R. Soc. Lond. Ser. A264(1153), 413–496, 1969
Smith, K.T.: Formulas to represent functions by their derivatives. Math. Ann. 188, 53–77, 1970
Strang, G., Temam, R.: Functions of bounded deformation. Arch. Ration. Mech. Anal.75(1), 7–21, 1980/81
Strauss, M.J.: Variations of Korn’s and Sobolev’s equalities. In: Partial Differential Equations (Proc. Sympos. Pure Math., Vol. XXIII, Univ. California, Berkeley, Calif., 1971), pp. 207–214. Amer. Math. Soc., Providence, RI, 1973
Suquet, P.-M.: Sur un nouveau cadre fonctionnel pour les Équations de la plasticité. C. R. Acad. Sci. Paris Sér. A-B286(23), A1129–A1132, 1978 (in French)
Temam, R.: Problemes mathématiques en plasticité. Gauthier-Villars, Paris 1985
Uhlenbeck, K.: Regularity for a class of nonlinear elliptic systems. Acta Math. 138, 219–240, 1977
Ural’ceva, N.N.: Quasilinear degenerate elliptic systems. Leningrad Odtel. Mat. Inst. Steklov (LOMI)7, 184–222, 1968. (in Russian)
Acknowledgements
Open Access funding provided by Projekt DEAL. I am grateful to Jan Kristensen for a fruitful interaction on the theme of the paper. Moreover, I am indebted to Gianni Dal Maso and Gregory Seregin for commenting on previous results of mine and thereby making valuable suggestions, which motivated the Sobolev regularity improvement compared to [46]. I am also thankful to Lars Diening for discussions related to the convolution-type Poincaré inequalities of Section 5, and to the anonymous referee for the careful reading of the manuscript and valuable suggestions. Financial support through the Hausdorff Center in Mathematics, Bonn, is gratefully acknowledged.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The author hereby declares that there are no conflict of interest.
Additional information
Communicated by G. Dal Maso
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix A: On Uniqueness and the Structure of \(\pmb {GM(F;u_{0})}\)
In Section 4.6 we addressed some uniqueness assertions and the structure of the set of generalised minimisers. Working from the assumption that generalised minima are unique up to rigid deformations, we here complete the proof of Corollary 4.8 with
Proposition 8.1
Let \(\Omega \subset {\mathbb {R}}^{n}\) be open and bounded with Lipschitz boundary \(\partial \Omega \) and let \(u_{0}\in {\text {LD}}(\Omega )\). Moreover, suppose that \(f:{\mathbb {R}}_{{\text {sym}}}^{n\times n}\rightarrow {\mathbb {R}}\) is convex integrand with (LG) such that for each \(\nu \in {\mathbb {R}}^{n}{\setminus }\{0\}\) the map \(f_{\nu }^{\infty }:{\mathbb {R}}^{n}\ni z \mapsto f^{\infty }(z\odot \nu )\) has strictly convex sublevel sets (in the sense of Section 4.6) and every two generalised minima differ by a rigid deformation. Then the following hold:
-
(a)
If there exists one generalised minimiser \(u\in {\text {GM}}(F;u_{0})\) with \({\text {Tr}}_{\partial \Omega }(u)={\text {Tr}}_{\partial \Omega }(u_{0})\)\({\mathscr {H}}^{n-1}\)-almost everywhere on \(\partial \Omega \), then \({\text {GM}}(F;u_{0})=\{u\}\).
-
(b)
If \(\partial \Omega \) moreover satisfies for all \(a\in {\mathbb {R}}\)
$$\begin{aligned} {\mathscr {H}}^{n-1}(\{x\in \partial \Omega :\;x_{i}=a\})= 0\qquad \text {for all}\;i\in \{1,\ldots ,n\}, \end{aligned}$$(8.1)then there exists \(u\in {\text {GM}}(F;u_{0})\) and \(\pi \in {\mathcal {R}}(\Omega )\) such that
$$\begin{aligned} {\text {GM}}(F;u_{0})=\{u+\lambda \pi :\;\lambda \in [-1,1]\}. \end{aligned}$$(8.2)
Note that the hypotheses of Corollary 4.7 imply those of the preceding proposition. For the rest of this section, we tacitly assume that the hypotheses of Proposition 8.1 are in action.
We begin with some preliminary considerations. Given \(u_{0}\in {\text {LD}}(\Omega )\) and a convex integrand with (LG), we start by noting that for any \(u\in {\text {GM}}(F;u_{0})\), the set
Convexity of \({\mathcal {R}}_{u}\) is a direct consequence of convexity of \({\overline{F}}_{u_{0}}[-;\Omega ]\) on \({\text {BD}}(\Omega )\). If \((\pi _{j})\subset {\mathcal {R}}_{u}\) satisfies \(\pi _{j}\rightarrow \pi \) in \({\mathscr {R}}(\Omega )\), then Lipschitz continuity of \(f^{\infty }\) readily implies that \({\overline{F}}_{u_{0}}[u+\pi ;\Omega ]=\lim _{j\rightarrow \infty }{\overline{F}}_{u_{0}}[u+\pi _{j};\Omega ]=\min {\overline{F}}_{u_{0}}[{\text {BD}}(\Omega );\Omega ]\) and hence \(\pi \in {\mathcal {R}}_{u}\), too.Footnote 6 Lastly, if \({\mathcal {R}}_{u}\) were not bounded, we would find \((\pi _{j})\subset {\mathcal {R}}_{u}\) with \(\Vert \pi _{j}\Vert \rightarrow \infty \) for an arbitrary norm \(\Vert \cdot \Vert \) on \({\mathscr {R}}(\Omega )\). There exists a constant \(c=c_{n}>0\) such that \(c|a|\,|b|\leqq |a\odot b|\) for all \(a,b\in {\mathbb {R}}^{n}\). Since, by (LG), \(c_{1}|z|\leqq f^{\infty }(z)\) for all \(z\in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\), we find
so that the triangle inequality and equivalence of all norms on \({\mathscr {R}}(\Omega )\) yields the contradictory \(\sup _{j\in {\mathbb {N}}}\Vert \pi _{j}\Vert <\infty \). In consequence, (8.3) follows.
As an adaptation of [14, Lemma 6.2], we now establish that whenever \(\pi \in {\mathscr {R}}(\Omega )\) is such that \(u+\pi \in {\text {GM}}(F;u_{0})\), then there exists a \({\mathscr {H}}^{n-1}\)-measurable function \(\beta :\partial \Omega \rightarrow {\mathbb {R}}{\setminus } (0,1)\) such that \({\text {Tr}}_{\partial \Omega }(u_{0})(x)={\text {Tr}}_{\partial \Omega }(u)(x)+\beta (x)\pi (x)\) for \({\mathscr {H}}^{n-1}\)-almost everywhere \(x\in \partial \Omega \).
In fact, if \(u+\pi \in {\text {GM}}(F;u_{0})\), then by (8.3), \(u+\beta \pi \in {\text {GM}}(F;u_{0})\) for all \(\beta \in [0,1]\). In particular, we find
and since
we deduce that we have equality in (8.4) \({\mathscr {H}}^{n-1}\)-almost everywhere on \(\partial \Omega \). Because the map \(z\mapsto f^{\infty }(z\odot \nu _{\partial \Omega }(x))\) has strictly convex level sets for \({\mathscr {H}}^{n-1}\)-almost everywhere \(x\in \partial \Omega \), by [70, Lemma 4.8], for \({\mathscr {H}}^{n-1}\)-almost everywhere \(x\in \partial \Omega \) there exists \(R(x)\geqq 0\) such that
Clearly, on \(\{x\in \partial \Omega :\;\pi (x)=0\}\) we must have \(R=1\). Conversely, on \(\{x\in \partial \Omega :\;\pi (x)\ne 0\}\), we have \(R\ne 1\), \({\text {Tr}}_{\partial \Omega }((1-R)(u_{0}-u)-\pi )=0\) and hence \({\text {Tr}}_{\partial \Omega }(u_{0}-u)=\frac{1}{1-R}{\text {Tr}}_{\partial \Omega }(\pi )\). We may thus define
so that \(\beta (x)\in {\mathbb {R}}{\setminus } (0,1)\), and it is easily seen that \(\beta \) has the required properties.
Proof of Proposition 8.1 (a)
In [46] this has been established for convex domains, and we here give the general case. Suppose that \(v\in {\text {GM}}(F;u_{0})\) is a generalised minimiser. Then \(v=u+\pi \), and generalised minimality of v together with \({\text {Tr}}_{\partial \Omega }(u)={\text {Tr}}_{\partial \Omega }(u_{0})\)\({\mathscr {H}}^{n-1}\)-almost everywhere on \(\partial \Omega \) yields
Since \(f^{\infty }(a\odot b)\geqq C|a|\,|b|\) for some \(C>0\) and all \(a,b\in {\mathbb {R}}^{n}\), \(\pi =0\)\({\mathscr {H}}^{n-1}\)-almost everywhere (and thus, by continuity, everywhere) on \(\partial \Omega \). Write \(\pi (x)=Ax+b\) with \(A\in {\mathbb {R}}_{{\text {skew}}}^{n\times n}\) and \(b\in {\mathbb {R}}^{n}\). Clearly, for \(\Omega \) is open and bounded, \(\partial \Omega \) cannot be contained in an \((n-1)\)-dimensional affine hyperplane. If \(\dim (\ker (A))\leqq n-1\), then \(\ker (A)\) is contained in an \((n-1)\)-dimensional hyperplane H. We have, for some \(x_{0}\in \partial \Omega \), \(\{y:\;Ay=-b\}=x_{0}+\ker (A)\). Since \(\partial \Omega \not \subset x_{0}+\ker (A)\), we find \(x_{1}\in \partial \Omega \cap (x_{0}+\ker (A))^{c}\). Then, however, \(\pi (x_{1})=0\) implies \(Ax_{1}=-b\) and so \(x_{1}\in x_{0}+\ker (A)\), a contradiction. In consequence, necessarily \(\dim (\ker (A))=n\), in which case \(A=0\) so that, because of \(\pi \equiv 0\) on \(\partial \Omega \), \(b=0\) and hence \(\pi \equiv 0\) on \({\mathbb {R}}^{n}\). In conclusion, \(u=v\) and hence \({\text {GM}}(F;u_{0})=\{u\}\). \(\quad \square \)
We now establish Proposition 8.1(b) for \(n=2\); the higher dimensional case can be established by an analagous reasoning (note that elements of \({\mathscr {R}}(\Omega )\) then have slightly more complicated structure).
Proof of Proposition 8.1 (b)
By assumption, \({\text {GM}}(F;u_{0})=u+{\mathcal {R}}_{u}\), \({\mathcal {R}}_{u}\) being defined as in (8.3). Suppose that \({\mathcal {R}}_{u}\) contains two linearly independent elements \(\pi _{1},\pi _{2}\). Then, by the above discussion, we may write \({\text {Tr}}_{\partial \Omega }(u_{0})={\text {Tr}}_{\partial \Omega }(u)+\beta _{1}\pi _{1}={\text {Tr}}_{\partial \Omega }(u)+\beta _{2}\pi _{2}\)\({\mathscr {H}}^{1}\)-almost everywhere on \(\partial \Omega \) for some suitable \(\beta _{1,2}:\partial \Omega \rightarrow {\mathbb {R}}{\setminus } (0,1)\). Therefore, \(\beta _{1}\pi _{1}-\beta _{2}\pi _{2}=0\)\({\mathscr {H}}^{1}\)-almost everywhere on \(\partial \Omega \). We write \(\pi _{1}(x)=A_{1}x+b_{1}\), \(\pi _{2}(x)=A_{2}x+b_{2}\), where
for some suitable \(\lambda ,\mu \in {\mathbb {R}}\), \(b_{1},b_{2}\in {\mathbb {R}}^{2}\); in two dimensions, every rigid deformation is of this form. Now suppose that \(\beta _{1}\pi _{1}-\beta _{2}\pi _{2}=0 {\mathscr {H}}^{1}\)-almost everywhere on \(\partial \Omega \), and denote \(\Theta \subset \partial \Omega \) the set where equality holds; hence, \({\mathscr {H}}^{1}(\partial \Omega {\setminus }\Theta )=0\). Then for any \(x=(x_{1},x_{2})\in \Theta \),
Denote \(\Gamma :=\{x\in \Theta :\;\beta _{1}(x)\ne 0\}\). Our aim is to establish \({\mathscr {H}}^{1}(\Gamma )=0\). We split
For \(\Gamma _{1}\), note that if \(\mu \ne 0\), then \(\Gamma _{1}\) consists at most of one single point and hence \({\mathscr {H}}^{1}(\Gamma _{1})=0\). If \(\mu =0\) and \({\mathscr {H}}^{1}(\Gamma _{1})>0\), then \(\Gamma _{1}\ne \emptyset \) implies \({\mathbf {b}}_{21}={\mathbf {b}}_{22}=0\) and hence, in total, by \(\mu =0\), \(\pi _{2}\equiv 0\), which is ruled out by linear independence of \(\pi _{1},\pi _{2}\). Hence, \({\mathscr {H}}^{1}(\Gamma _{1})=0\).
Now, for \(x\in \Gamma \), we may put \(\gamma (x):=\frac{\beta _{2}(x)}{\beta _{1}(x)}\) and obtain from (8.5) that
-
The treatment of \(\Gamma _{2}\) and \(\Gamma _{3}\) is symmetric (interchange the roles of \(x_{1}\) and \(x_{2}\)). So suppose that \({\mathscr {H}}^{1}(\Gamma _{2})>0\). If \(\mu \ne 0\), then \(\Gamma _{2}\subset \{x\in \Gamma :\;x_{2}=\frac{{\mathbf {b}}_{21}}{\mu }\}\) and hence \({\mathscr {H}}^{1}(\Gamma _{2})=0\) by (8.1), a contradiction. Thus \(\mu =0\). From (8.6) we deduce that \(\lambda x_{2}+{\mathbf {b}}_{11}=0\) for all \(x\in \Gamma _{2}\). Again, if \(\lambda \ne 0\), then \(\Gamma _{2}\subset \{x\in \Gamma :\; x_{2}=-\frac{{\mathbf {b}}_{11}}{\lambda }\}\) and hence \({\mathscr {H}}^{1}(\Gamma _{2})=0\) by (8.1). Hence \(\lambda =0\), and so \(\pi _{1}=b_{1}\), \(\pi _{2}=b_{2}\). In this situation, linear independence of \(\pi _{1},\pi _{2}\) and hereafter of \(b_{1},b_{2}\) implies that \(\beta _{1}=\beta _{2}=0\)\({\mathscr {H}}^{1}\)-almost everywhere on \(\Gamma \), a contradiction to \(\beta _{1}\ne 0\)\({\mathscr {H}}^{1}\)-almost everywhere on \(\Gamma \). As a conclusion, \({\mathscr {H}}^{1}(\Gamma _{2})=0\), and similarly, now invoking the first part of (8.1), \({\mathscr {H}}^{1}(\Gamma _{3})=0\).
-
Suppose that \({\mathscr {H}}^{1}(\Gamma _{4})>0\). For \(x\in \Gamma _{4}\), we have \(\mu x_{2}+{\mathbf {b}}_{21}\ne 0\) and \(-\mu x_{1}+{\mathbf {b}}_{22}\ne 0\). From here we deduce
$$\begin{aligned} \frac{\lambda x_{2}+{\mathbf {b}}_{11}}{\mu x_{2}+{\mathbf {b}}_{21}}= \gamma (x) = \frac{\lambda x_{1}-{\mathbf {b}}_{12}}{\mu x_{1}-{\mathbf {b}}_{22}}\qquad \text {for all}\;x\in \Gamma _{4}. \end{aligned}$$(8.7)Therefore, \(\gamma (x)\) must be independent of \(x_{1},x_{2}\) and thus is constant. Hence, there exists \(a\in {\mathbb {R}}\) such that \(\pi _{1}=a\pi _{2}\) on \(\Gamma _{4}\). The affine-linear map \(\pi _{1}-a\pi _{2}\) thus vanishes on a set of positive \({\mathscr {H}}^{1}\)-measure. Therefore, it necessarily vanishes on a line \(\ell \subset {\mathbb {R}}^{2}\). In other words,
$$\begin{aligned} (A_{1}-aA_{2})x=a(b_{2}-b_{1})\qquad \text {for}\;x\in \ell . \end{aligned}$$(8.8)If \(A_{1}-aA_{2}\) is invertible, then \((A_{1}-aA_{2})x=a(b_{2}-b_{1})\) has a unique solution and thus contradicts (8.8) for all \(x\in \ell \). Thus, \(A_{1}-a A_{2}\) is not invertible, and by the structure of \(A_{1},A_{2}\), this implies \(\lambda = a\mu \). Either \(a=0\), in which case (8.7) yields \({\mathbf {b}}_{11}={\mathbf {b}}_{12}=0\). Then \(\pi _{1}\equiv 0\), contradicting the linear independence of \(\pi _{1},\pi _{2}\). If \(a\ne 0\), then (8.6) yields \({\mathbf {b}}_{11}=a{\mathbf {b}}_{21}\) and \({\mathbf {b}}_{12}=a{\mathbf {b}}_{22}\). In conclusion, \(\pi _{1}=a\pi _{2}\), again contradicting the linear independence of \(\pi _{1}\) and \(\pi _{2}\). Therefore, \({\mathscr {H}}^{1}(\Gamma _{4})=0\).
In conclusion, \({\mathscr {H}}^{1}(\Gamma )=0\) so that \(\beta _{1}=0\)\({\mathscr {H}}^{1}\)-almost everywhere on \(\partial \Omega \). Then we obtain from \({\text {Tr}}_{\partial \Omega }(u_{0})={\text {Tr}}_{\partial \Omega }(u)+\beta _{1}\pi _{1}={\text {Tr}}_{\partial \Omega }(u)\)\({\mathscr {H}}^{1}\)-almost everywhere on \(\partial \Omega \) that \(u\in {\text {GM}}(F;u_{0})\) is a generalised minimiser which attains the correct boundary data \(u_{0}\)\({\mathscr {H}}^{1}\)-almost everywhere. In this situation, Proposition 8.1(a) yields \({\text {GM}}(F;u_{0})=\{u\}\). In total, \({\text {GM}}(F;u_{0})\subset u + {\mathbb {R}}\pi \) for some suitable \(\pi \in {\mathscr {R}}(\Omega )\). Since in this situation \({\text {GM}}(F;u_{0})\) is a closed and bounded interval by (8.3), the statement of Proposition 8.1 for \(n=2\) follows.\(\quad \square \)
Proposition 8.1 rises the question under which minimal geometric assumptions on \(\partial \Omega \) the representation (8.2) continues to hold, an issue that we intend to pursue elsewhere.
Appendix B: Proofs of Auxiliary Results
We now collect here the proofs of some minor auxiliary results used in the main part of the paper.
1.1 On the \({\text {L}}^{q}\)-Stability (2.7)
We start by justifying (2.7). Let \(x_{0}\in {\mathbb {R}}^{n}\) and \(r>0\). Pick an \({\text {L}}^{2}\)-orthonormal basis \(\{\pi _{1},\ldots ,\pi _{N}\}\) of \({\mathscr {R}}({\text {B}}(0,1))\) and consider the orthonormal projection \(\Pi _{{\text {B}}(0,1)}:{\text {L}}^{2}({\text {B}}(0,1);{\mathbb {R}}^{n})\rightarrow {\mathscr {R}}({\text {B}}(0,1))\) given by \(\Pi _{{\text {B}}(0,1)}v:=\sum _{k=1}^{N}\langle v,\pi _{k}\rangle _{{\text {L}}^{2}}\pi _{k}\). For \({\mathscr {R}}({\text {B}}(0,1))\) consists of polynomials, it is clear that we may also admit \(v\in {\text {L}}^{1}({\text {B}}(0,1);{\mathbb {R}}^{n})\) in the last formula. By (2.6), this yields the estimate
for \(v\in {\text {L}}^{1}({\text {B}}(0,1);{\mathbb {R}}^{n})\) so that \(\Pi _{{\text {B}}(0,1)}\) extends to a bounded linear operator from \({\text {L}}^{1}({\text {B}}(0,1);{\mathbb {R}}^{n})\) to \({\mathscr {R}}({\text {B}}(0,1))\). Now (2.7) follows by rescaling.
1.2 Proof of Lemma 2.12
Let \(a\in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\) be fixed and let \(\xi \in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\) be arbitrary. Assertion (a) follows by differentiation, and \(f_{a}\geqq 0\) is a consequence of convexity of f. As to (b), since f is Lipschitz by Lemma 2.8 and because of \({\mathbb {B}}(a,\tfrac{\varrho _{\xi _{0}}}{2})\subset {\mathbb {B}}(\xi _{0},\varrho _{\xi _{0}})\),
Therefore, if \(|\xi |>\tfrac{\varrho _{\xi _{0}}}{2}\), we may successively apply Lemma 2.11(c) and (a) to find
Thus, by Lemma 2.11(d) with \(\ell =\tfrac{\varrho _{\xi _{0}}}{2}\) and the corresponding constant \(c=c(\ell )=c(\tfrac{\varrho _{\xi _{0}}}{2})\)
For the lower bound, we observe that by (2.17) and \({\mathbb {B}}(a,\tfrac{\varrho _{\xi _{0}}}{2})\subset {\mathbb {B}}(\xi _{0},\varrho _{\xi _{0}})\),
Similarly, if \(\tfrac{\varrho _{\xi _{0}}}{2}\leqq |\xi |\), then positive definiteness of \(f''\) on \({\mathbb {R}}_{{\text {sym}}}^{n\times n}\) yields
Hence, we obtain for all \(\xi \in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\) by Lemma 2.11(c) and monotonicity of \({\mathbb {R}}\ni t\mapsto V(t)\),
The proof is complete.
1.3 Linear Comparison Estimates and the Proof of Proposition 6.1
Let \(\Omega \subset {\mathbb {R}}^{n}\) be an open and bounded domain with smooth boundary. For \(w\in {\text {W}}^{1,2}(\Omega ;{\mathbb {R}}^{n})\), consider the variational principle
where \(g(z):={\mathscr {A}}[z,z] + \langle b,z\rangle + c\) is a polynomial of degree two on \({\mathbb {R}}_{{\text {sym}}}^{n\times n}\) with a symmetric bilinear form \({\mathscr {A}}:{\mathbb {R}}_{{\text {sym}}}^{n\times n}\times {\mathbb {R}}_{{\text {sym}}}^{n\times n}\rightarrow {\mathbb {R}}\), \(b\in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\) and \(c\in {\mathbb {R}}\). We moreover assume that \({\mathscr {A}}\) is elliptic in the sense that there exists \(\ell _{1},\ell _{2}>0\) such that \(\ell _{1}|z|^{2}\leqq {\mathscr {A}}[z,z] \leqq \ell _{2} |z|^{2}\) holds for all \(z\in {\mathbb {R}}_{{\text {sym}}}^{n\times n}\).
Lemma 9.1
There exists a unique solution \(u\in w+{\text {W}}_{0}^{1,2}(\Omega ;{\mathbb {R}}^{n})\) of (9.1). Moreover, this solution satisfies the following:
-
(a)
There exists a constant \(c=c(n,\ell _{1},\ell _{2})>0\) such that if \({\text {B}}(x_{0},R)\Subset \Omega \), then for all \(0<r<R/2\) it holds that
$$\begin{aligned} \int \nolimits _{{\text {B}}(x_{0},r)}|\varepsilon (u)-(\varepsilon (u))_{x_{0},r}|^{2}\mathrm {d}x \leqq c\left( \frac{r}{R}\right) ^{n+2}\int \nolimits _{{\text {B}}(x_{0},R/2)}|\varepsilon (u)-(\varepsilon (u))_{x_{0},R/2}|^{2}\mathrm {d}x. \end{aligned}$$ -
(b)
If \(\Omega ={\text {B}}(x_{0},R)\) for some \(x_{0}\in {\mathbb {R}}^{n}\) and \(R>0\), then for any \(\alpha \in (0,1)\) there exists a constant \(c=c(n,\alpha ,\ell _{1},\ell _{2})>0\) such that if \(w\in {\text {C}}^{1,\alpha }(\overline{{\text {B}}(x_{0},R)};{\mathbb {R}}^{n})\), then
$$\begin{aligned}{}[\varepsilon (u)]_{{\text {C}}^{0,\alpha }(\overline{{\text {B}}(x_{0},R)};{\mathbb {R}}_{{\text {sym}}}^{n\times n})}\leqq c [\varepsilon (w)]_{{\text {C}}^{0,\alpha }(\overline{{\text {B}}(x_{0},R)};{\mathbb {R}}_{{\text {sym}}}^{n\times n})}. \end{aligned}$$
Proof
Korn’s inequality \(\Vert \nabla \varphi \Vert _{{\text {L}}^{2}(\Omega ;{\mathbb {R}}^{n\times n})}\leqq c\Vert \varepsilon (\varphi )\Vert _{{\text {L}}^{2}(\Omega ;{\mathbb {R}}^{n\times n})}\) for all \(\varphi \in {\text {W}}_{0}^{1,2}(\Omega ;{\mathbb {R}}^{n})\) implies that minimising sequences are bounded in \({\text {W}}^{1,2}(\Omega ;{\mathbb {R}}^{n})\) (as the Dirichlet datum w is fixed). From here, the existence of minima is standard by convexity of g, and uniqueness follows from strict convexity of g. The proof of (a) follows along the lines of [40, Lemma 3.0.5]. For (b), consider the symmetric bilinear form \({\mathscr {B}}:{\mathbb {R}}^{n\times n}\times {\mathbb {R}}^{n\times n}\rightarrow {\mathbb {R}}\) defined by \({\mathscr {B}}[z,\xi ]:={\mathscr {A}}[z^{{\text {sym}}},\xi ^{{\text {sym}}}]\) for \(z,\xi \in {\mathbb {R}}^{n\times n}\). Then \({\mathscr {B}}\) is strongly elliptic in the sense of Legendre–Hadamard: for all \(a,b\in {\mathbb {R}}^{n}\) it holds that
and since trivially \(|{\mathscr {B}}[z,\xi ]|\leqq c(n,\ell _{1},\ell _{2})|z||\xi |\) for all \(z,\xi \in {\mathbb {R}}^{n\times n}\), \({\mathscr {B}}\) is a strongly elliptic bilinear form on \({\mathbb {R}}^{n\times n}\). By minimality of u for G, u satisfies the Euler–Lagrange equation
Therefore, by the classical Schauder estimates for strongly elliptic systems and scaling, there exists a constant \(c=c(n,\ell _{1},\ell _{2})>0\) such that
Trivially, \([\varepsilon (u)]_{{\text {C}}^{0,\alpha }(\overline{{\text {B}}(x_{0},R)};{\mathbb {R}}_{{\text {sym}}}^{n\times n})}\leqq [\nabla u]_{{\text {C}}^{0,\alpha }(\overline{{\text {B}}(x_{0},R)};{\mathbb {R}}^{n\times n})}\). By the simple geometry of \({\text {B}}(x_{0},R)\), \({\mathcal {L}}^{2,n+2\alpha }({\text {B}}(x_{0},R);{\mathbb {R}}_{{\text {sym}}}^{n\times n})\simeq {\text {C}}^{0,\alpha }(\overline{{\text {B}}(x_{0},R)};{\mathbb {R}}_{{\text {sym}}}^{n\times n})\) with the Campanato spaces \({\mathcal {L}}^{p,\lambda }\). We then estimate, using Lemma 2.2(a) in the third step and scaling,
where still \(c=c(n,\ell _{1},\ell _{2})\). This yields (b), and the proof is complete. \(\quad \square \)
The key in the above proof is that an easy reduction to the strongly elliptic bilinear forms applied to the full gradients is possible. This is not the case for elliptic bilinear forms. Clearly, in (b) we could have allowed for more general domains, but this is not needed for the
Proof of Proposition 6.1
We split the proof into two steps, linearisation and comparison estimates.
Step 1. Linearisation. We begin by defining the auxiliary integrand \(g:{\mathbb {R}}_{{\text {sym}}}^{n\times n}\rightarrow {\mathbb {R}}\) by
Using a Taylor expansion of f up to order two around \(\xi _{0}\), we deduce by (6.3) that
By Lemma 9.1, the unique solution h of the auxiliary variational principle
belongs to \({\text {C}}^{1,\alpha }(\overline{{\text {B}}(x_{0},R/2)};{\mathbb {R}}^{n})\). By Lemma 9.1 (a), there exists \(c=c(n,\lambda ,\Lambda )>0\) such that
for all \(0<r<R/2\). Moreover, enlarging \(c>0\) if necessary, Lemma 9.1 (b) gives
Since h is a solution of the variational principle (9.4), the bounds of (6.2) yield that
where still \(c=c(n,\lambda ,\Lambda )>0\). Therefore we deduce for every \(x\in {\text {B}}(x_{0},\tfrac{R}{2})\) that
where \(c_{{\text {comp}}}=c_{{\text {comp}}}(\lambda ,\Lambda ,n)>1\) shall be the constant claimed in the proposition, and so
Step 2. Comparison estimates. We will now compare v with h. To this end, we first notice that by Jensen’s inequality, (9.5) and \(0<r<\tfrac{R}{2}\),
where \(c=c(n,\lambda ,\Lambda )>0\). In view of (6.4), we thus need to control the first term on the very right hand side of the previous inequality. Since h solves (9.4) and \(v-h\in {\text {W}}_{0}^{1,2}({\text {B}}(x_{0},\tfrac{R}{2});{\mathbb {R}}^{n})\), an elementary integration by parts establishes that
Using this equality in the second step, we then deduce
the single terms \({\mathbf {I}}_{1},{\mathbf {I}}_{2},{\mathbf {I}}_{3}\) being defined in the obvious manner.
Ad \({\mathbf {I}}_{1}\). Since \(c_{{\text {comp}}}>1\) and by virtue of our assumption \({\mathbf {t}}_{\alpha ,\xi _{0}}(v;x_{0},R/2)<\varrho _{\xi _{0}}/c_{{\text {comp}}}\), we obtain \(\varepsilon (v)(x)\in {\mathbb {B}}(\xi _{0},\varrho _{\xi _{0}})\) for all \(x\in {\text {B}}(x_{0},R/2)\). In consequence, by (9.3), the definition of \({\mathbf {t}}_{\alpha ,\xi _{0}}\) and because \(\omega _{\xi _{0},\varrho _{\xi _{0}}}\) is non-decreasing,
Ad \({\mathbf {I}}_{2}\). Here we invoke the definition of \({\text {dev}}_{\alpha }\) and minimality of h for (9.4), yielding \({\mathbf {I}}_{2}\leqq {\text {dev}}_{\alpha }(v;x_{0},R/2)\).
Ad \({\mathbf {I}}_{3}\). By our choice (9.8) of \(c_{{\text {comp}}}>1\) and \({\mathbf {t}}_{\alpha ,\xi _{0}}(v;x_{0},R/2)<\varrho _{\xi _{0}}/c_{{\text {comp}}}(<1)\), (9.8) implies that \(\varepsilon (h)(x)\in {\mathbb {B}}(\xi _{0},\varrho _{\xi _{0}})\) for all \(x\in {\text {B}}(x_{0},R/2)\). Hence, by (9.3),
for all \(x\in {\text {B}}(x_{0},R/2)\). Now, because \(\omega _{\xi _{0},\varrho _{\xi _{0}}}\) is non-decreasing, (9.8) and (9.7) imply
where \(c=c(n,\lambda ,\Lambda )>0\). In conclusion, we find with some constant \(c=c(n,\lambda ,\Lambda )>0\)
where \(\vartheta (t):=\omega _{\xi _{0},\varrho _{\xi _{0}}}(t)+\omega _{\xi _{0},\varrho _{\xi _{0}}}(c_{{\text {comp}}}t)\) meets the required properties. This is (6.4), and the proof is complete. \(\quad \square \)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Gmeineder, F. The Regularity of Minima for the Dirichlet Problem on BD. Arch Rational Mech Anal 237, 1099–1171 (2020). https://doi.org/10.1007/s00205-020-01507-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00205-020-01507-5