Professional Documents
Culture Documents
Functional Derivative
The goal of this section is to discover a suitable definition of a ”functional
derivative”, such that we can take the derivative of a functional and still have
the same rules of differentiation as normal calculus. For example, we wish to
find a definition for δJ
δy
, where J[y(x)] is a functional of y(x) such that things
δ 2 δJ
like δy J = 2J δy are still true.
Definitions
Functional
Variation
A variation of the functional is the amount the functional changes when the
input function is changed by a little bit. Suppose we let y(x) → y(x) + δy(x),
d
then since dx is linear
y 0 (x) → y 0 (x) + dx
d
δy(x) = y 0 (x) + δy 0 (x)
y 00 (x) → y 00 (x) + d2 δy(x) = y 00 (x) + δy 00 (x)
dx2
. (2)
..
dn
(n)
y (x) → y (n) (x) + dx n δy(x) = y
(n)
(x) + δy (n) (x)
thus the integrant of the new output J[y + δy] can be expanded to first order
using Taylor expansion of a multivariable function around the old input y
Z x2
J[y + δy] = f (x, y + δy, y 0 + δ 0 y, · · · , y (n) + δ (n) y)dx
x1
Z x2
∂ (n) f (n)
∂f ∂f ∂f 0
= f+ dx + δy + 0 δy + · · · + (n) δy dx (3)
x1 ∂x ∂y ∂y ∂y
2
The variation of the functional is thus, by definition, the new output minus
the old output taken to first order.
δJ =J[y + δy] − J[y]
Z x2
∂ (n) f (n)
∂f ∂f ∂f 0
= dx + δy + 0 δy + · · · + (n) δy dx (4)
x1 ∂x ∂y ∂y ∂y
d
we can moved all the on δy to f using integration by parts
dx
x2 x2 2
x2
d ∂f d ∂f 0
d ∂f
δJ = ( 0 )δy(x) + ( 00 )δy (x) − 2 ( 00 )δy(x)
dx ∂y x1 dx ∂y x1 dx ∂y x1
n
x2 Z x2
d ∂f ∂f
+ · · · + (−1)n−1 n ( (n) )δy (n) (x) + dx+
dx ∂y x1 x1 ∂x
Z x2
d2 ∂f n
∂f d ∂f n−1 d ∂f
− + − · · · + (−1) δy(x)dx
x1 ∂y dx ∂y 0 dx2 ∂y 00 dxn ∂y (n)
(5)
and Voila! (5) is the variation of the local functional defined in (1) with no
additional assumption.
Functional Derivative
For a normal multi-variable function f (x1 , x2 , · · · , xn ) we have a nice form
for its variation
n
X ∂f
df = dxi (6)
i=1
∂x i
∂f
and we know how to calculate the derivatives ∂x i
. Here we wish to rewrite
(5) such that we have a similar form for the variation of a functional
Z x2
δJ
δJ = dx (x)δy(x) (7)
x1 δy
Unfortunately, this is only possible under special circumstances. That is, we
need the variation to have ”fixed-ends” (δy (n) (x1 ) = δy (n) (x2 ) = 0) and
that we require implicit f ( ∂f ∂x
= 0)1 . Basically, we want everything before
the last line of (5) to vanish. This way, comparing (5) and (7) we finally have
δJ ∂f d ∂f d2 ∂f n−1 d
n
∂f
= − + − · · · + (−1) (8)
δy ∂y dx ∂y 0 dx2 ∂y 00 dxn ∂y (n)
1
R x2∂f
This is slightly over kill, since we just want x1 ∂x
dx =0
3
As long as δJ can be written as (7), we will have our nice rules of differenti-
ation. For example
Lagrangian Mechanics
When the integrant of the functional only has dependence on y and y 0
(f (y, y 0 )), (8) reduces to the popular Fréchet derivative
δJ ∂f d ∂f
= − (10)
δy ∂y dx ∂y 0
this form should look familiar to all physicists, since it laid the foundation
for basic Lagrangian mechanics. In a typical classical mechanics problem, we
wish to minimize the action S, which is often a functional of a configuration
function q, whose basic independent variable is time t. That is
Z t2
S[q] = L(q(t), q̇(t))dt (11)
t1