Efficient approximation of solutions of parametric linear transport equations by ReLU DNNs

Laakmann, Fabian; Petersen, Philipp

doi:10.1007/s10444-020-09834-7

Efficient approximation of solutions of parametric linear transport equations by ReLU DNNs

Open access
Published: 28 January 2021

Volume 47, article number 11, (2021)
Cite this article

Download PDF

You have full access to this open access article

Advances in Computational Mathematics Aims and scope Submit manuscript

Efficient approximation of solutions of parametric linear transport equations by ReLU DNNs

Download PDF

We’re sorry, something doesn't seem to be working properly.

Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

Abstract

We demonstrate that deep neural networks with the ReLU activation function can efficiently approximate the solutions of various types of parametric linear transport equations. For non-smooth initial conditions, the solutions of these PDEs are high-dimensional and non-smooth. Therefore, approximation of these functions suffers from a curse of dimension. We demonstrate that through their inherent compositionality deep neural networks can resolve the characteristic flow underlying the transport equations and thereby allow approximation rates independent of the parameter dimension.

Article PDF

Numerical Solution of the Parametric Diffusion Equation by Deep Neural Networks

Article Open access 05 June 2021

Translating Numerical Concepts for PDEs into Neural Architectures

Deep Neural Networks Motivated by Partial Differential Equations

Article 18 September 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

Allaire, G., Blanc, X., Després, B., Golse, F.: Transport et diffusion. Ecole Polytechnique (2019)
Ambrosio, L.: Transport equation and cauchy problem for non-smooth vector fields. In: Calculus of Variations and Nonlinear Partial Differential Equations, pp 1–41. Springer, Berlin (2008)
Ambrosio, L., Gigli, N., Savare, G.: Gradient Flows. Birkhäuser-Verlag (2005)
Barron, A.R.: Universal approximation bounds for superpositions of a sigmoidal function. IEEE Transactions on Information Theory 39(3), 930–945 (1993)
Article MathSciNet Google Scholar
Beck, C., Becker, S., Grohs, P., Jaafari, N., Jentzen, A.: Solving stochastic differential equations and kolmogorov equations by means of deep learning. arXiv:1806.00421 (2018)
Beck, C., Jentzen, A., Kuckuck, B.: Full error analysis for the training of deep neural networks. arXiv:1910.00121 (2019)
Bellman, R.: On the theory of dynamic programming. Proc. Natl. Acad. Sci. U.S.A. 38(8), 716 (1952)
Article MathSciNet Google Scholar
Berner, J., Grohs, P., Jentzen, A.: Analysis of the generalization error: empirical risk minimization over deep artificial neural networks overcomes the curse of dimensionality in the numerical approximation of Black-Scholes partial differential equations. arXiv:1809.03062 (2018)
Bölcskei, H., Grohs, P., Kutyniok, G., Petersen, P.: Optimal approximation with sparsely connected deep neural networks. SIAM Journal on Mathematics of Data Science 1(1), 8–45 (2019)
Article MathSciNet Google Scholar
Bouchut, F., Golse, F., Pulvirenti, M.: Kinetic Equations and Asymptotic Theory. Series in Applied Mathematics, 4. Gauthier-Villars, Editions Scientifiques et M’edicales Elsevie (2000)
Chen, M., Jiang, H., Liao, W., Zhao, T.: Efficient approximation of deep relu networks for functions on low dimensional manifolds. In: Advances in Neural Information Processing Systems, pp 8172–8182 (2019)
Courant, R., Hilbert, D.: Methods of Mathematical Physics. Wiley, New York (1989)
Book Google Scholar
Cybenko, G.: Approximation by superpositions of a sigmoidal function. Mathematics of Control, Signals and Systems 2(4), 303–314 (1989)
Article MathSciNet Google Scholar
Dahmen, W., Gruber, F., Mula, O.: An adaptive nested source term iteration for radiative transfer equations. Mathematics of Computation (2020)
Dahmen, W., Huang, C., Kutyniok, G., Lim, W.-Q., Schwab, C., Welper, G.: Efficient resolution of anisotropic structures. In: Extraction of Quantifiable Information from Complex Systems, pp 25–51. Springer (2014)
Dahmen, W., Kutyniok, G., Lim, W.-Q., Schwab, C., Welper, G.: Adaptive anisotropic Petrov–Galerkin methods for first order transport equations. J. Comput. Appl. Math. 340, 191–220 (2018)
Article MathSciNet Google Scholar
Dahmen, W., Plesken, C., Welper, G.: Double greedy algorithms: reduced basis methods for transport dominated problems. ESAIM: Mathematical Modelling and Numerical Analysis 48(3), 623–663 (2014)
Article MathSciNet Google Scholar
DiPerna, R.J., Lions, P.L.: Ordinary differential equations, transport theory and Sobolev spaces. Invent. Math. 98(3), 511–547 (1989)
Article MathSciNet Google Scholar
E, W., Yu, B.: The deep ritz method: a deep learning-based numerical algorithm for solving variational problems. Communications in Mathematics and Statistics 6(1), 1–12 (2018)
Article MathSciNet Google Scholar
Egger, H., Schlottbom, M.: A mixed variational framework for the radiative transfer equation. Mathematical Models and Methods in Applied Sciences 22(03), 1150014 (2012)
Article MathSciNet Google Scholar
Elbrächter, D., Grohs, P., Jentzen, A., Schwab, C.: DNN expression rate analysis of high-dimensional PDEs: application to option pricing. arXiv:1809.07669 (2018)
Evans, L.: Partial differential equations. American Mathematical Society (2010)
Fresca, S., Dede, L., Manzoni, A.: A comprehensive deep learning-based approach to reduced order modeling of nonlinear time-dependent parametrized PDEs. arXiv:2001.04001 (2020)
, F. Golse.: Distributions, analyse de Fourier, équations aux dérivées partielles. Ecole polytechnique (2012)
Golse, F.: Lecture notes on mean field kinetic equations. https://www.cmls.polytechnique.fr/perso/golse/M2/PolyKinetic.pdf (2013)
Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press, Cambridge (2016). http://www.deeplearningbook.org
MATH Google Scholar
Grella, K.: Sparse tensor approximation for radiative transport. PhD thesis, ETH Zurich (2013)
Gühring, I., Kutyniok, G., Petersen, P.: Error bounds for approximations with deep ReLU neural networks in W^s,p norms. Analysis and Applications (in Press)
Hartman, P.: Ordinary differential equations. Society for Industrial and Applied Mathematics (2002)
He, J., Li, L., Xu, J., Zheng, C.: ReLU deep Neural Networks and Linear Finite Elements. arXiv:1807.03973 (2018)
Hesthaven, J.S., Rozza, G., Stamm, B., et al.: Certified Reduced Basis Methods for Parametrized Partial Differential Equations, vol. 590. Springer, Berlin (2016)
Book Google Scholar
Horn, R.A., Johnson, C.R. (eds.): Matrix Analysis. Cambridge University Press, Cambridge (1985)
Hornik, K., Stinchcombe, M., White, H., et al.: Multilayer feedforward networks are universal approximators. Neural Networks 2(5), 359–366 (1989)
Article Google Scholar
Hutzenthaler, M., Jentzen, A., Kruse, T., Nguyen, T.A.: A proof that rectified deep neural networks overcome the curse of dimensionality in the numerical approximation of semilinear heat equations. arXiv:1901.10854(2019)
John, F.: Partial differential equations. Springer, US (1978)
Book Google Scholar
Kutyniok, G., Petersen, P., Raslan, M., Schneider, R.: A theoretical analysis of deep neural networks and parametric PDEs. arXiv:1904.00377(2019)
Laakmann, F.: A theoretical analysis of high-dimensional parametric transport equations and neural networks. Project thesis, University of Oxford (2019)
LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436–444 (2015)
Article Google Scholar
Mhaskar, H.N.: Approximation properties of a multilayered feedforward artificial neural network. Adv. Comput. Math. 1(1), 61–80 (1993)
Article MathSciNet Google Scholar
Mhaskar, H.N.: Neural networks for optimal approximation of smooth and analytic functions. Neural Comput. 8(1), 164–177 (1996)
Article Google Scholar
Montanelli, H., Yang, H., Du, Q.: Deep ReLU networks overcome the curse of dimensionality for bandlimited functions. arXiv:1903.00735 (2019)
Mouhot, C.: Hyperbolicity: scalar transport equations, wave equations. University of Cambridge. https://cmouhot.files.wordpress.com/1900/10/chapter41.pdf (2013)
Novak, E., Woźniakowski, H.: Approximation of infinitely differentiable multivariate functions is intractable. J. Complex. 25(4), 398–404 (2009)
Article MathSciNet Google Scholar
Obermeier, A., Grohs, P.: On the approximation of functions with line singularities by ridgelets. Journal of Approximation Theory 237, 30–95 (2019)
Article MathSciNet Google Scholar
Ohlberger, M., Rave, S.: Reduced basis methods: success, limitations and future challenges. arXiv:1511.02021 (2015)
Opschoor, J.A., Petersen, P., Schwab, C.: Deep ReLU networks and high-order finite element methods. Analysis and Applications (in Press)
Opschoor, J.A.A., Schwab, C., Zech, J.: Exponential ReLU DNN expression of holomorphic maps in high dimension. Technical Report 2019-35, Seminar for Applied Mathematics, ETH Zürich, Switzerland (2019)
Petersen, P., Voigtlaender, F.: Optimal approximation of piecewise smooth functions using deep reLU neural networks. Neural Netw. 108, 296–330 (2018)
Article Google Scholar
Poggio, T., Mhaskar, H., Rosasco, L., Miranda, B., Liao, Q.: Why and when can deep-but not shallow-networks avoid the curse of dimensionality: a review. Int. J. Autom. Comput. 14(5), 503–519 (2017)
Article Google Scholar
Quarteroni, A., Rozza, G., et al.: Reduced Order Methods for Modeling and Computational Reduction, vol. 9. Springer, Berlin (2014)
Book Google Scholar
Schmidt-Hieber, J.: Deep ReLU network approximation of functions on a manifold. arXiv:1908.00695 (2019)
Schwab, C., Zech, J.: Deep learning in high dimension: neural network expression rates for generalized polynomial chaos expansions in UQ. Anal. Appl. 17(01), 19–55 (2019)
Article MathSciNet Google Scholar
Serre, D.: Systems of Conservation Laws 1. Cambridge University Press (1999)
Shaham, U., Cloninger, A., Coifman, R.R.: Provable approximation properties for deep neural networks. Appl. Comput Harmon. Anal. 44(3), 537–557 (2018)
Article MathSciNet Google Scholar
Sirignano, J., Spiliopoulos, K.: DGM: a deep learning algorithm for solving partial differential equations. J. Comput. Phys. 375, 1339–1364 (2018)
Article MathSciNet Google Scholar
Suzuki, T.: Adaptivity of deep ReLU network for learning in Besov and mixed smooth Besov spaces: optimal rate and curse of dimensionality. arXiv:1810.08033 (2018)
Yarotsky, D.: Error bounds for approximations with deep reLU networks. Neural Netw. 94, 103–114 (2017)
Article Google Scholar

Download references

Acknowledgments

The authors would like to thank Avi Mayorcas for inspiring discussions in the early stage of this work. The authors thank Francois Golse for advice and various helpful suggestions on the theory of transport equations. P.P is grateful for the hospitality and support of the Institute of Mathematics of the University of Oxford during his visit in January 2020.

Funding

Open Access funding provided by University of Vienna.

Author information

Authors and Affiliations

Mathematical Institute, University of Oxford, Andrew Wiles Building, Woodstock Road, Oxford, OX2 6GG, UK
Fabian Laakmann
Institut für Mathematik, Universität Wien, Kolingasse 14-16, 1090, Wien, Austria
Philipp Petersen

Authors

Fabian Laakmann
View author publications
You can also search for this author in PubMed Google Scholar
Philipp Petersen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Philipp Petersen.

Additional information

Communicated by: Jan Hesthaven

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix: 1: Bounds for $\|X\|_{C^{k}}, k=0,1$

Proposition A.1

Let X be defined as in Theorem 3.3. Then, for every compact set $K\subset \mathbb {R}^{n}$ there holds

$$ \begin{array}{@{}rcl@{}} \|X\|_{C^{0}([0,T]\times[0,T]\times K \times [0,1]^{D})}& \leq& (|K| + CT) \exp(C T) := G_{0} \end{array} $$

(A.1)

$$ \begin{array}{@{}rcl@{}} \|X\|_{C^{1}([0,T]\times[0,T]\times K \times [0,1]^{D})}&\leq& \max\left\{ G_{0}, T\|V\|_{C^{1}} \exp (T \|V\|_{C^{1}})\right\} \end{array} $$

(A.2)

with $\|V\|_{C^{1}}:= \|V\|_{C^{1}([0,T]\times B_{G_{0}}(0)\times [0,1]^{D})}$.

Proof

We start with the definition of X given by

The fundamental theorem of calculus implies

$$ \begin{array}{@{}rcl@{}} X(s,t,x,\eta)=x + {{\int}^{s}_{t}} V(\tau,X(\tau,t,x,\eta),\eta) \mathrm{d}\tau. \end{array} $$

(A.4)

With the help of the sub-linear growth-condition (H2), we conclude

$$ \begin{array}{@{}rcl@{}} |X(s,t,x,\eta)|&\leq& |x| + \Big|{{\int}_{t}^{s}} |V(\tau,X(\tau,t,x,\eta), \eta)| \mathrm{d} \tau \Big|\\ &\leq& |x| + C \Big | {{\int}_{t}^{s}} (1+|X(\tau,t,x,\eta)|) \mathrm{d} \tau\Big| \\ &\leq& |x| + CT + C {{\int}_{t}^{s}} |X(\tau,t,x,\eta)| \mathrm{d} \tau. \end{array} $$

Moreover, by Gronwall’s inequality

$$ \begin{array}{@{}rcl@{}} \sup\limits_{s \in [0,T]} |X(s,t,x,\eta)| \leq (|x| + CT) \exp(C T). \end{array} $$

Hence,

$$ \begin{array}{@{}rcl@{}} \|X\|_{C^{0}([0,T]\times[0,T]\times K \times [0,1]^{D})} \leq (|K| + CT) \exp(C T). \end{array} $$

We have by (A.3a)

$$ \begin{array}{@{}rcl@{}} \|\partial_{s} X\|_{C^{0}} \leq \|V\|_{C^{0}}. \end{array} $$

Furthermore, applying Leibniz integral rule to (A.4) yields

$$ \begin{array}{@{}rcl@{}} \partial_{t} X(s,t,x,\eta) = -V(t,x,\eta) + {{\int}^{s}_{t}} \nabla_{x} V(\tau, X(\tau,t,x,\eta), \eta) \partial_{t} X(\tau,t,x,\eta) \mathrm{d}\tau \end{array} $$

and therefore

$$ \begin{array}{@{}rcl@{}} |\partial_{t} X(s,t,x,\eta)| \leq \|V\|_{C^{0}} + \|V\|_{C^{1}} {{\int}^{s}_{t}} |\partial_{t} X(\tau,t,x,\eta)| \mathrm{d}\tau. \end{array} $$

Gronwall’s inequality implies then

$$ \begin{array}{@{}rcl@{}} \|\partial_{t} X\|_{C^{0}} \leq \|V\|_{C^{0}} \exp(T \|V\|_{C^{1}}). \end{array} $$

The same procedure results for ∇_xX and ∇_ηX in

$$ \begin{array}{@{}rcl@{}} \|\nabla_{x} X\|_{C^{0}}& \leq& \exp(T \|V\|_{C^{1}}), \\ \|\nabla_{\eta} X\|_{C^{0}}& \leq& T\|V\|_{C^{1}}\exp(T \|V\|_{C^{1}}). \end{array} $$

Thus, we get after assuming without loss of generality that $T\geq 1, \|V\|_{C^{1}}\geq 1$

$$ \begin{array}{@{}rcl@{}} \|X\|_{C^{1}}\leq \max\left\{G_{0}, T\|V\|_{C^{1}} \exp (T \|V\|_{C^{1}})\right\}. \end{array} $$

(A.5)

□

Appendix: 2: Construction of a NN emulating the left Riemann sum

Proposition B.1

Let $d \in \mathbb {N}_{\geq 2}$, T > 0, ${\Omega } \subset \mathbb {R}^{d-1}$, and let Φ be a NN with d-dimensional input. Then there exists a NN $\widetilde {I}_{N}({\Phi })$ such that

$$ \begin{array}{@{}rcl@{}} &&\bullet \ L\left( \widetilde{I}_N({\Phi})\right) = L({\Phi}) + c_1, \\ &&\bullet \ W\left( \widetilde{I}_N({\Phi})\right) \leq c_2 \cdot N \cdot W({\Phi}),\\ &&\bullet \ \sup\limits_{t \in [0,T], x \in {\Omega}}\left|\mathrm{R}\left( \widetilde{I}_N({\Phi})(t,x)\right) - \frac{1}{N}\sum\limits_{i=0}^{\lceil t N /T \rceil-1} \mathrm{R}({\Phi})\left( \frac{i T }{N} , x\right) \right| \leq \frac{c_3}{N}, \end{array} $$

(B.1)

where c₁,c₂ > 0 are independent of Φ and $c_{3} := 3 \|\mathrm {R}({\Phi })\|_{L^{\infty }([0,T]\times {\Omega })}$.

Proof

Let, for $i \in \{ 0, 1, \dots , N\}$, t_i := iT/N. We define, for $i\in \{0, \dots , N-1\}$,

$$ \begin{array}{@{}rcl@{}} {\Phi}_{i}^{\text{(shift)}} := {\Phi} \odot \left( \left( \begin{array}{cc} 0 & 0 \\ 0 & \text{Id}_{\mathbb{R}^{d-1}} \end{array}\right), \left( \begin{array}{c} t_{i} \\ 0 \end{array}\right) \right) . \end{array} $$

Then $\mathrm {R}({\Phi }_{i}^{\text {(shift)}})(t,x) = \mathrm {R}({\Phi })(t_{i}, x)$, for all t ∈ [0,T],x ∈Ω. Moreover, $W({\Phi }_{i}^{\text {(shift)}})$ ≤ 2W(Φ) + 2d and $L({\Phi }_{i}^{\text {(shift)}}) = L({\Phi }) + 2$ by Proposition 2.2. Next, we define the following indicator networks for $i \in \{0, \dots , N-1\}$:

$$ \begin{array}{@{}rcl@{}} \begin{aligned} {\Phi}^{\text{(ind)}}_{i} \!:=\! \left( \left( \left[{1\quad 0\quad \dots\quad 0}\right], 0 \right), \left( \left[\begin{array}{c|c} 1 & 0_{\mathbb{R}^{d}}\\ 1 & 0_{\mathbb{R}^{d}} \end{array}\right], \binom{-t_{i}}{-t_{i+1}}\!\right), \left( \left[ {N \quad - N \quad |\quad 0 \quad| \quad 0_{\mathbb{R}^{d}}}\right], 0\right)\right). \end{aligned} \end{array} $$

We have that $W({\Phi }^{\text {(ind)}}_{i}) = 7$, $L({\Phi }^{\text {(ind)}}_{i}) = 3$ and, for t ∈ [0,T] and x ∈Ω,

$$ \begin{array}{@{}rcl@{}} \!\!\!\!\!\!\!\!\!\!\mathrm{R}\left( {\Phi}^{\text{(ind)}}_{i}\right)(t,x) = N \cdot (\varrho(t - t_{i}) - \varrho(t - t_{i+1})) = \left\{ \begin{array}{ll} 0 & \text{ if } t \leq t_{i},\\ N \cdot (t - t_{i}) & \text{ if } t_{i} \!<\! t \!<\! t_{i+1},\\ 1 & \text{ if } t \geq t_{i+1}. \end{array}\right. \end{array} $$

(B.2)

Let $\bar {a} := \|\mathrm {R}({\Phi })\|_{L^{\infty }([0,T]\times {\Omega })}$. Now we set, for $i \in \{0, \dots , N-1\}$,

$$ \begin{array}{@{}rcl@{}} {\Phi}^{\text{(clip)}}_{i}:= \left( \left( \left( \begin{array}{cc} 2\bar{a} & 1\\2\bar{a} & 0 \end{array} \right), \binom{-\bar{a}}{-\bar{a}}\right), \left( \left[{1\quad -1 \quad | \quad0 \quad 0}\right], 0 \right)\right) \odot \mathrm{P}\left( {\Phi}^{\text{(ind)}}_{i}, {\Phi}_{i}^{\text{(shift)}}\right). \end{array} $$

We have that

$$ \begin{array}{@{}rcl@{}} \mathrm{R}\left( {\Phi}^{\text{(clip)}}_{i}\right)(t,x) &=& \varrho\left( 2\bar{a}\mathrm{R}\left( {\Phi}_{i}^{\text{(ind)}}\right)(t,x) + \mathrm{R}\left( {\Phi}_{i}^{\text{(shift)}}\right)(t,x) -\bar{a} \right) \\&&- \varrho\left( 2\bar{a} \mathrm{R}\left( {\Phi}_{i}^{\text{(ind)}}\right)(t,x) -\bar{a} \right). \end{array} $$

(B.3)

It follows from (B.2) and (B.3) that, for t ∈ [0,T] and x ∈Ω,

$$ \begin{array}{@{}rcl@{}} \mathrm{R}\left( {\Phi}^{\text{(clip)}}_{i}\right)(t,x) &=& 0, \text{ if } t \leq t_{i} \end{array} $$

(B.4)

$$ \begin{array}{@{}rcl@{}} \mathrm{R}\left( {\Phi}^{\text{(clip)}}_{i}\right)(t,x) &=& \mathrm{R}\left( {\Phi}_{i}^{\text{(shift)}}\right)(t,x), \text{ if } t \geq t_{i+1}, \end{array} $$

(B.5)

$$ \begin{array}{@{}rcl@{}} \left|\mathrm{R}\left( {\Phi}^{\text{(clip)}}_{i}\right)(t,x)\right| &\leq& 2\bar{a}, \text{ else}. \end{array} $$

(B.6)

In addition, by Propositions 2.2 and 2.3,

$$ \begin{array}{@{}rcl@{}} L\left( {\Phi}^{\text{(clip)}}_{i}\right) &=& 2 + \max\{3, L({\Phi}) + 2\}, \end{array} $$

(B.7)

$$ \begin{array}{@{}rcl@{}} W\left( {\Phi}^{\text{(clip)}}_{i}\right) &\leq& 16 + 2\cdot (7 + 2 W({\Phi}) + 2d). \end{array} $$

(B.8)

Finally, we set

$$ \begin{array}{@{}rcl@{}} \widetilde{I}_{N}({\Phi}) := \left( \left( \left[{\frac{1}{N} \quad {\dots} \quad \frac{1}{N}}\right], 0 \right) \right) \odot \mathrm{P}\left( {\Phi}^{\text{(clip)}}_{0}, \dots, {\Phi}^{\text{(clip)}}_{N-1}\right). \end{array} $$

Now we have that t ≤ t_i if ⌈tN/T⌉≤ i and t ≥ t_i+ 1 if i ≤⌈tN/T⌉− 1. Hence, for t ∈ [0,T] and x ∈Ω,

$$ \begin{array}{@{}rcl@{}} \mathrm{R}\left( \widetilde{I}_{N}({\Phi})\right)(t,x) &=& \frac{1}{N} \sum\limits_{i=0}^{N-1} \mathrm{R}\left( {\Phi}_{i}^{\text{(clip)}}\right)(t,x)\\ &\underset{{(\text{B.4})}}{=}& \frac{1}{N} \sum\limits_{i=0}^{\lceil tN/T\rceil-1} \mathrm{R}\left( {\Phi}_{i}^{\text{(clip)}}\right)(t,x)\\ &\underset{{(\text{B.5})}}{=}& \frac{1}{N} \sum\limits_{i=0}^{\lceil tN/T\rceil-2} \mathrm{R}\left( {\Phi}_{i}^{\text{(shift)}}\right)(t,x) + \frac{1}{N} \mathrm{R}\left( {\Phi}_{\lceil tN/T\rceil-1}^{\text{(clip)}}\right)(t,x)\\ &= &\frac{1}{N} \sum\limits_{i=0}^{\lfloor tN/T\rfloor - 2} \mathrm{R}\left( {\Phi}\right)(t_{i},x) + \frac{1}{N} \mathrm{R}\left( {\Phi}_{\lceil tN/T\rceil - 1}^{\text{(clip)}}\right)(t,x)\\ &= &\frac{1}{N} \sum\limits_{i=0}^{\lceil tN/T\rceil-1} \mathrm{R}\left( {\Phi}\right)(t_{i},x) \\&&+ \frac{1}{N} \left( \mathrm{R}\left( {\Phi}_{\lceil tN/T\rceil-1}^{\text{(clip)}}\right)(t,x) - \mathrm{R}\left( {\Phi}\right)\left( t_{\lceil tN/T\rceil-1},x\right)\right). \end{array} $$

Since, by (B.6),

$$ \frac{1}{N} \left|\mathrm{R}\left( {\Phi}_{\lceil tN/T\rceil-1}^{\text{(clip)}}\right)(t,x) - \mathrm{R}\left( {\Phi}\right)\left( t_{\lceil tN/T\rceil-1},x\right)\right| \leq 3\|\mathrm{R}({\Phi})\|_{L^{\infty}([0,T]\times {\Omega})}, $$

we conclude the proof by observing with (B.7) and (B.8) that

$$ \begin{array}{@{}rcl@{}} L\left( \widetilde{I}_{N}({\Phi})\right) &\leq& 3 + \max\{3, L({\Phi}) + 2\},\\ W\left( \widetilde{I}_{N}({\Phi})\right) &\leq& 2N + N\cdot (32 + 4\cdot (7 + 2 W({\Phi}) + 2d)) \\&=& 62N + 8 W({\Phi}) N + 8d N. \end{array} $$

□

Proposition B.2 (Left Riemann sum)

Let M > 0, $n \in \mathbb {N}$, $f \in C^{1}([0,T]\times [-M,M]^{n}; \mathbb {R})$, and $N \in \mathbb {N}$. The approximation of the integral of f with respect to its first argument from 0 to t ≤ T, T ≥ 1 by the left Riemann sum is given by

Then

$$ \begin{array}{@{}rcl@{}} \sup\limits_{t\in [0,T], x \in [-M,M]^{n}}\left|{{\int}_{0}^{t}} f(\tau, x) \mathrm{d}\tau -I_{N}(f)(t,x)\right| \leq \frac{2 T^{2}}{N} \|f\|_{C^{1}}. \end{array} $$

Proof

Let $N(t) := \max \limits \{i \in \mathbb {N} | t_{i}<t\}$. Then

$$ \begin{array}{@{}rcl@{}} &&\left|{{\int}_{0}^{t}} f(\tau, x) \mathrm{d}\tau -I_{N}(f)(t,x)\right|\\ &=& \left|\sum\limits_{i=0}^{N(t)} {\int}_{t_{i}}^{t_{i+1}} f(\tau,x) - f(t_{i},x) \mathrm{d}\tau - {\int}_{{t}}^{{t_{N(t)+1}}} f(\tau,x) \mathrm{d}\tau \right|\\ &\leq& \left|\sum\limits_{i=0}^{N(t)} {\int}_{t_{i}}^{t_{i+1}} f(\tau,x) - f(t_{i},x) \mathrm{d}\tau \right| + \left|{\int}_{t}^{t_{N(t)+1}} f(\tau,x) \mathrm{d}\tau \right|\\ &\leq& \frac{T^{2}}{N} \|f\|_{C^{1}} + \frac{T}{N} \|f\|_{C^{0}} \leq \frac{2T^{2}}{N} \|f\|_{C^{1}}. \end{array} $$

□

Remark B.3

Equation (B.1) implies that for a NN Φ with n + 1-dimensional input there holds

$$ \begin{array}{@{}rcl@{}} \sup_{t\in [0,T], x \in [-M,M]^{n}}\left| \mathrm{R}\left( \widetilde{I}_{N}({\Phi})(t,x)\right) - I_{N}(\mathrm{R}({\Phi}))(t,x)\right| \leq \frac{c}{N} \end{array} $$

with $c = 3\|\mathrm {R}({\Phi })\|_{L^{\infty }([0,T]\times {\Omega })} > 0$.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Laakmann, F., Petersen, P. Efficient approximation of solutions of parametric linear transport equations by ReLU DNNs. Adv Comput Math 47, 11 (2021). https://doi.org/10.1007/s10444-020-09834-7

Download citation

Received: 05 February 2020
Accepted: 15 December 2020
Published: 28 January 2021
DOI: https://doi.org/10.1007/s10444-020-09834-7

Keywords

Mathematics Subject Classification (2010)

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Efficient approximation of solutions of parametric linear transport equations by ReLU DNNs

Abstract

Article PDF

Similar content being viewed by others

Numerical Solution of the Parametric Diffusion Equation by Deep Neural Networks

Translating Numerical Concepts for PDEs into Neural Architectures

Deep Neural Networks Motivated by Partial Differential Equations

References

Acknowledgments

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Appendices

Appendix: 1: Bounds for \(\|X\|_{C^{k}}, k=0,1\)

Proposition A.1

Proof

Appendix: 2: Construction of a NN emulating the left Riemann sum

Proposition B.1

Proof

Proposition B.2 (Left Riemann sum)

Proof

Remark B.3

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification (2010)

Navigation

Efficient approximation of solutions of parametric linear transport equations by ReLU DNNs

Abstract

Article PDF

Similar content being viewed by others

Numerical Solution of the Parametric Diffusion Equation by Deep Neural Networks

Translating Numerical Concepts for PDEs into Neural Architectures

Deep Neural Networks Motivated by Partial Differential Equations

References

Acknowledgments

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Appendices

Appendix: 1: Bounds for \(\|X\|_{C^{k}}, k=0,1\)

Proposition A.1

Proof

Appendix: 2: Construction of a NN emulating the left Riemann sum

Proposition B.1

Proof

Proposition B.2 (Left Riemann sum)

Proof

Remark B.3

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification (2010)

Search

Navigation