Fundamental lemma of the calculus of variations

Initial result in using test functions to find extremum

In mathematics, specifically in the calculus of variations, a variation δf of a function f can be concentrated on an arbitrarily small interval, but not a single point. Accordingly, the necessary condition of extremum (functional derivative equal zero) appears in a weak formulation (variational form) integrated with an arbitrary function δf. The fundamental lemma of the calculus of variations is typically used to transform this weak formulation into the strong formulation (differential equation), free of the integration with arbitrary function. The proof usually exploits the possibility to choose δf concentrated on an interval on which f keeps sign (positive or negative). Several versions of the lemma are in use. Basic versions are easy to formulate and prove. More powerful versions are used when needed.

Basic version

If a continuous function

f

on an open interval

(a,b)

satisfies the equality

\int _{a}^{b}f(x)h(x)\,\mathrm {d} x=0

for all compactly supported smooth functions

h

(a,b)

, then

f

is identically zero.

Here "smooth" may be interpreted as "infinitely differentiable", but often is interpreted as "twice continuously differentiable" or "continuously differentiable" or even just "continuous", since these weaker statements may be strong enough for a given task. "Compactly supported" means "vanishes outside $[c, d]$ for some $c$ , $d$ such that $a<c<d<b$ "; but often a weaker statement suffices, assuming only that $h$ (or $h$ and a number of its derivatives) vanishes at the endpoints $a$ , $b$ ; in this case the closed interval $[a, b]$ is used.

Proof

Suppose $f({\bar {x}})\neq 0$ for some ${\bar {x}}\in (a,b)$ . Since $f$ is continuous, it is nonzero with the same sign for some $c,d$ such that $a<c<{\bar {x}}<d<b$ . Without loss of generality, assume $f({\bar {x}})>0$ . Then take an $h$ that is positive on $(c,d)$ and zero elsewhere, for example

h(x)={\begin{cases}\exp \left(-{\frac {1}{(x-c)(d-x)}}\right),&c<x<d\\0,&\mathrm {otherwise} \end{cases}}

Note this bump function satisfies the properties in the statement, including $C^{\infty }$ . Since

\int _{a}^{b}f(x)h(x)dx>0,

we reach a contradiction.

Version for two given functions

If a pair of continuous functions f, g on an interval (a,b) satisfies the equality

\int _{a}^{b}(f(x)\,h(x)+g(x)\,h'(x))\,\mathrm {d} x=0

for all compactly supported smooth functions h on (a,b), then g is differentiable, and g' = f everywhere.

The special case for g = 0 is just the basic version.

Here is the special case for f = 0 (often sufficient).

If a continuous function g on an interval (a,b) satisfies the equality

\int _{a}^{b}g(x)\,h'(x)\,\mathrm {d} x=0

for all smooth functions h on (a,b) such that

h(a)=h(b)=0

, then g is constant.

If, in addition, continuous differentiability of g is assumed, then integration by parts reduces both statements to the basic version; this case is attributed to Joseph-Louis Lagrange, while the proof of differentiability of g is due to Paul du Bois-Reymond.

Versions for discontinuous functions

The given functions (f, g) may be discontinuous, provided that they are locally integrable (on the given interval). In this case, Lebesgue integration is meant, the conclusions hold almost everywhere (thus, in all continuity points), and differentiability of g is interpreted as local absolute continuity (rather than continuous differentiability). Sometimes the given functions are assumed to be piecewise continuous, in which case Riemann integration suffices, and the conclusions are stated everywhere except the finite set of discontinuity points.

Higher derivatives

If a tuple of continuous functions

f_{0},f_{1},\dots ,f_{n}

on an interval (a,b) satisfies the equality

\int _{a}^{b}(f_{0}(x)\,h(x)+f_{1}(x)\,h'(x)+\dots +f_{n}(x)\,h^{(n)}(x))\,\mathrm {d} x=0

for all compactly supported smooth functions h on (a,b), then there exist continuously differentiable functions

u_{0},u_{1},\dots ,u_{n-1}

on (a,b) such that

{\begin{aligned}f_{0}&=u'_{0},\\f_{1}&=u_{0}+u'_{1},\\f_{2}&=u_{1}+u'_{2}\\\vdots \\f_{n-1}&=u_{n-2}+u'_{n-1},\\f_{n}&=u_{n-1}\end{aligned}}

everywhere.

This necessary condition is also sufficient, since the integrand becomes $(u_{0}h)'+(u_{1}h')'+\dots +(u_{n-1}h^{(n-1)})'.$

The case n = 1 is just the version for two given functions, since $f=f_{0}=u'_{0}$ and $f_{1}=u_{0},$ thus, $f_{0}-f'_{1}=0.$

In contrast, the case n=2 does not lead to the relation $f_{0}-f'_{1}+f''_{2}=0,$ since the function $f_{2}=u_{1}$ need not be differentiable twice. The sufficient condition $f_{0}-f'_{1}+f''_{2}=0$ is not necessary. Rather, the necessary and sufficient condition may be written as $f_{0}-(f_{1}-f'_{2})'=0$ for n=2, $f_{0}-(f_{1}-(f_{2}-f'_{3})')'=0$ for n=3, and so on; in general, the brackets cannot be opened because of non-differentiability.

Vector-valued functions

Generalization to vector-valued functions $(a,b)\to \mathbb {R} ^{d}$ is straightforward; one applies the results for scalar functions to each coordinate separately, or treats the vector-valued case from the beginning.

Multivariable functions

If a continuous multivariable function f on an open set

\Omega \subset \mathbb {R} ^{d}

satisfies the equality

\int _{\Omega }f(x)\,h(x)\,\mathrm {d} x=0

for all compactly supported smooth functions h on Ω, then f is identically zero.

Similarly to the basic version, one may consider a continuous function f on the closure of Ω, assuming that h vanishes on the boundary of Ω (rather than compactly supported).

Here is a version for discontinuous multivariable functions.

Let

\Omega \subset \mathbb {R} ^{d}

be an open set, and

f\in L^{2}(\Omega )

satisfy the equality

\int _{\Omega }f(x)\,h(x)\,\mathrm {d} x=0

for all compactly supported smooth functions h on Ω. Then f=0 (in L, that is, almost everywhere).

Applications

This lemma is used to prove that extrema of the functional

J=\int _{x_{0}}^{x_{1}}L(t,y(t),{\dot {y}}(t))\,\mathrm {d} t

are weak solutions $y:\to V$ (for an appropriate vector space $V$ ) of the Euler–Lagrange equation

{\partial L(t,y(t),{\dot {y}}(t)) \over \partial y}={\mathrm {d}  \over \mathrm {d} t}{\partial L(t,y(t),{\dot {y}}(t)) \over \partial {\dot {y}}}.

The Euler–Lagrange equation plays a prominent role in classical mechanics and differential geometry.

Notes

^ Jost & Li-Jost 1998, Lemma 1.1.1 on p.6
^ Gelfand & Fomin 1963, Lemma 1 on p.9 (and Remark)
Liberzon 2012, Lemma 2.1 on p.30 Web version: "Lemma 2.1 The Lemma of DuBois-Reymond".
Gelfand & Fomin 1963, Lemma 4 on p.11
^ Hestenes 1966, Lemma 15.1 on p.50
Gelfand & Fomin 1963, Lemma 2 on p.10
Liberzon 2012, Lemma 2.2 on p.33 Web version: "Lemma 2.2 (modification of Lemma 2.1)".
Jost & Li-Jost 1998, Lemma 1.2.1 on p.13
Giaquinta & Hildebrandt 1996, section 2.3: Mollifiers
Hestenes 1966, Lemma 13.1 on p.105
Gelfand & Fomin 1963, p.35
Jost & Li-Jost 1998
Gelfand & Fomin 1963, Lemma on p.22; the proof applies in both situations.
Jost & Li-Jost 1998, Lemma 3.2.3 on p.170

References

Jost, Jürgen; Li-Jost, Xianqing (1998), Calculus of variations, Cambridge University
Gelfand, I.M.; Fomin, S.V. (1963), Calculus of variations, Prentice-Hall (transl. from Russian).
Hestenes, Magnus R. (1966), Calculus of variations and optimal control theory, John Wiley
Giaquinta, Mariano; Hildebrandt, Stefan (1996), Calculus of Variations I, Springer
Liberzon, Daniel (2012), Calculus of Variations and Optimal Control Theory, Princeton University Press

Categories: