Part Iii: Functions From R Tor

Part III: Functions from Rn to Rm
Week 7 86
Introduction to Part III
In Part I of the module we considered functions
r : U ⊆ R → Rn
and in Part II we considered functions
f : U ⊆ Rn → R
In the remainder of the module we consider the

general situation
f : U ⊆ Rn → Rm
We shall focus particularly the following import-

ant cases.
• When n = m the function f may be viewed

as assigning an n-vector to each point in Rn .
This is called a vector field on Rn .
• Another view of the case n = m is that the

function is a coordinate transformation, or
change of coordinates, on Rn .
• In the case n = 2 and m = 3 the function

provides a parametrisation of a two-dimensional
surface in R3 .
After introducing vector fields, we will focus on

the second case this week. In Week 9 we will con-
sider surfaces. Finally in Week 10 we return to
parametrised curves and vector fields.
Week 8: Functions from Rn to Rn
8.1 Vector Fields
There are many situations in which a vector is as-

sociated to each point in some region of space. Fa-
miliar examples come from fluid motion such as the
wind or the motion of water in a river. Wind has
both a magnitude and a direction (blowing North-
East at 18 miles/hour) and hence is a vector quant-
ity whose value generally varies with location.
The assignment of a vector to each point in a
region of space is a function
F : U ⊆ R3 → R3
We typically use boldface to denote vector fields,

(usually F, V, or v), to emphasise that to each
point in U the function F assigns a vector.
While the majority of physical examples are vec-
tor fields on R3 , the general case is for any dimen-
sion n
F : U ⊆ Rn → Rn
We will primarily be interested in vector fields on

the plane (n = 2) or in space (n = 3).
A few examples
• The planar vector field,
F(x, y) = −yi + xj
corresponds to vectors pointing counterclockwise

around the origin.
The vector field,
F(x, y, z) = xi + yj + zk
corresponds to vectors in space pointing away

from the origin.
• An example we have already seen is the gradi-
ent of a function of several variables f (x). The
gradient is a vector field since ∇f (x) is a vec-
tor whose value depends on the point x. On the
plane for example
∂f ∂f
F(x) = ∇f (x) = (x, y)i + (x, y)j
∂x ∂y
Week 8 88
• Systems of ordinary differential equations
ẋ1 = f1 (x1 , · · · , xn )
..
.
ẋn = fn (x1 , · · · , xn )
can be viewed as vector fields on the correspond-

ing phase space. To each point in phase space
x = (x1 , · · · , xn ) there is an associated derivat-
ive vector,
ẋ = F(x) = (f1 (x1 , · · · , xn ), · · · , fn (x1 , · · · , xn ))
• Vector fields arise frequently in applications. Fluid

flows, gravitational fields, electric fields, and mag-
netic fields are all vector fields.
8.2 Coordinate Transformations
A different way in which mappings Rn → Rn arise

is coordinate transformations. We motivate the
subject by noting the similarity between what is
called variable substitution in one-variable integ-
ration and changing to polar coordinates in two-
variable integration. Both involve the same three
steps.
Variable substitution
Consider integrating
ˆ 1 ˆ 1
1
f (x) dx = √ dx
0 0 1 − x2
by variable substitution
1. Substitute x = sin u into the integrand f (x).

1 1
f (x(u)) = p =√
2
1 − sin u cos2 u
2. Change the limits of integration to correspond

to those of u
ˆ x=1 ˆ u= π
2
→
x=0 u=0
3. Calculate the differential dx in terms of du

dx
dx = du = cos udu
du
Thus
ˆ 1 ˆ π
1 2 cos u π
√ dx = √ du =
0 1 − x2 0 cos2 u 2
89 MA134 Geometry and Motion
Polar coordinates
Now consider the double integral of f (x, y) = cos(x2 +

y 2 ) over the region
p
Ω = {(x, y)| − 3 ≤ x ≤ 3, 0 ≤ y ≤ 9 − x2 }
¨ ˆ 3 ˆ √
9−x2
f dA = cos(x2 + y 2 ) dy dx
Ω −3 0
We can evaluate this double integral be changing

to polar coordinates
1. Substitute x = r cos θ, y = r sin θ into the integ-

rand f (x, y).
f (x, y) = cos(r2 cos2 θ + r2 sin2 θ) = cos(r2 )
2. Change the limits of integration to those corres-

ponding to Ω the polar coordinates
ˆ 3 ˆ √
9−x2 ˆ θ=π ˆ r=3
→
−3 0 θ=0 r=0
3. Use the correct area element for polar coordin-

ates
dA = r dr dθ
Thus
ˆ 3 ˆ √
9−x2
cos(x2 + y 2 ) dy dx
−3 0
ˆ θ=π ˆ r=3
= cos(r2 )r dr dθ
θ=0 r=0
In both cases we re-wrote the integrals in a more

useful way by making a change of variables, or a
change of coordinates. We say we transformed
the coordinates.
8.3 Linear Transformations
You are familiar with linear transformations from

Linear Algebra
T : Rn → Rm : T (αu + βv) = αT (u) + βT (v)
This week we will concentrate on the case of the

same dimension: m = n, where n = 2 or n = 3.
Initially we focus on n = 2 later extend to n = 3.
Week 8 90
T can be expressed as a matrix,

a b
.
c d
We also think of T in terms of component func-

tions
x = x(u, v) y = y(u, v)
where here
x(u, v) = au + bv y(u, v) = cu + dv
We previously used this shorthand notation of nam-

ing functions by the corresponding variable name.
We assume T is invertible and thus we can obtain
the component functions of T −1
u = u(x, y) v = v(x, y)
although we often do not need such explicit expres-

sions.
The transformation is viewed either as a map-
ping T : R2 → R2 or as a change of coordinates in
R2 .
Suppose we want to write a double integral
¨
f dA
Ω
in a form better suited for integration using a linear

transformation T . Assume we figured out which
transformation T will help us. The first two steps
of the process are:
1. Substitute for x and y in terms of u and v in the

integrand
f (x, y) = f (x(u, v), y(u, v)) = g(u, v)
where we have written g(u, v) to emphasise that

after the substitution we have a different func-
tion of (u, v) than we had of (x, y).
2. Find the region Γ in (u, v) coordinates corres-
ponding to Ω in (x, y) coordinates:
T :Γ→Ω
This will determine the limits of integration of

u and v.
What remains is to express the area element

dA in terms of differentials du and dv. Consider
a rectangle in the uv-plane with lower left corner
at u0 = (u0 , v0 ), and with sides of length 4u and

4v:
{(u, v)| u0 ≤ u ≤ u0 + 4u, v0 ≤ v ≤ v0 + 4v}
This gets mapped by T to a parallelogram in the

xy-plane with vertices
x0 = T (u0 , v0 ) x1 = T (u0 + 4u, v0 )

x2 = T (u0 , v0 + 4v) x3 = T (u0 + 4u, v0 + 4v)
The area 4A of this parallelogram is given by

the cross product of two vectors corresponding to
two adjacent sides.
4A = kr1 × r2 k
where,
r 1 = x1 − x 0 , r2 = x2 − x0
(For the cross product these are viewed as vectors

in R3 with zero third component.) The reader can
show that

r1 = ai + cj 4u, r2 = bi + dj 4v
Thus

i j k
a c
r1 × r2 = a c 0 4u4v = 4u4v k
b d 0 b d
or transposing to bring into standard form

a b
r1 × r2 =
4u4v k
c d
The determinant in the above expression is called
the Jacobian of the transformation T . In the
present case where T is linear, it is just the de-
terminant of the matrix representing the transform-
ation. Denote this determinant by J. Then finally
4A = kr1 × r2 k = |J| 4u4v
For infinitesimally small du and dv we have the

differential of area
dA = |J| du dv
Combining everything, we have the following

formula for transforming the double integral
¨ ¨
f dA = f (x(u, v), y(u, v)) |J| du dv
Ω Γ
Week 8 92
8.4 General transformations in R2
We now generalise this approach from linear map-

pings to arbitrary mappings. The main change will
be that the formerly constant entries in the Jac-
obian will now depend on position and be given by
partial derivatives of the mapping.
We continue to denote the mapping by T and
we assume it maps some region Γ in uv-coordinates
to some region of interest Ω in xy-coordinates. We
focus on the component functions
x = x(u, v) y = y(u, v)
which now are not assumed to be linear. We re-

quire that the partial derivatives of the component
functions exist and are continuous. We also require
that T is invertible except possibly on the bound-
ary of Γ.
We proceed as in the linear case. We substitute
the change of variables into f (x, y) and then set
limits of integration to correspond to the region
Γ in (u, v). Finally, the thing we actually need to
work out is the element of area dA in terms of dudv
for a general transformation.
Consider a rectangle in the uv-plane, which we
now imagine to be small
{(u, v)| u0 ≤ u ≤ u0 + 4u, v0 ≤ v ≤ v0 + 4v}
This gets mapped by T to a slightly curved paral-

lelogram in the xy-plane with vertices
x0 = T (u0 , v0 ) x1 = T (u0 + 4u, v0 )

x2 = T (u0 , v0 + 4v) x3 = T (u0 + 4u, v0 + 4v)
The x and y components of x1 can be approximated

by the linear approximation
∂x
x(u0 + 4u, v0 ) ≈ x(u0 , v0 ) + (u0 , v0 )4u
∂u
∂y
y(u0 + 4u, v0 ) ≈ y(u0 , v0 ) + (u0 , v0 )4u
∂u
or
∂x ∂y
x1 ≈ x 0 + 4ui + 4uj
∂u ∂u
where it is understood that the partial derivatives
are evaluated at (u0 , v0 ). Similarly
∂x ∂y
x2 ≈ x0 + 4vi + 4vj
∂v ∂v
We use the linear approximations to replace the

sides of the sides of the slightly curved region by
vectors r1 and r2 ,

∂x ∂y ∂x ∂y
r1 = i+ j 4u, r2 = i+ j 4v
∂u ∂u ∂v ∂v
where,
r1 ≈ x1 − x0 , r2 ≈ x2 − x0
Recall that in the linear case, we had exactly the

sides r1 and r2 of the of region in xy-coordinates,
and these were given in terms of the constants a,
b, c, and d of the linear transformation. For the
general case, we instead use a linear approxima-
tion, i.e. first partial derivatives, to approximate
the sides of the region. The partial derivatives then
appear where before we had constants. Otherwise,
the computation is very similar to the linear case.
The approximate area of the slightly curved re-
gion is
4A ≈ kr1 × r2 k
where

∂x ∂y

∂u ∂u
r1 × r2 = 4u4v k

∂x ∂y

∂v ∂v

The transpose of the above determinant is denoted

∂x ∂x

∂(x, y) ∂u ∂v
=
∂(u, v) ∂y ∂y

∂u ∂v

This determinant is called the Jacobian of the

transformation T . It gives the approximate area
4A in terms of 4u4v

∂(x, y)
4A ≈ 4u4v
∂(u, v)
Unlike the linear case it is no longer constant but

depends on the point (u, v) at which the partial
derivative are evaluated. For infinitesimally small
changes in u and v we have

∂(x, y)
dA = du dv
∂(u, v)
Week 8 94
Combining everything, we have the following

formula for transforming a double integral with a
general mapping
¨ ¨
∂(x, y)
f dA = f (x(u, v), y(u, v))
du dv
Ω Γ ∂(u, v)
8.5 Coordinate transformations in R3
The generalisation to three dimensions is straight-

forward. Our transformation T will now have three
component functions depending on three variables
(u, v, w)
x = x(u, v, w) y = y(u, v, w) z = z(u, v, w)
T will map some region Γ ⊆ R3 to some region

we care about Ω ⊆ R3 . The first 2 steps in our
transformation of the integral do not depend in any
essential way on the dimensionality.
To derive the volume element consider a small
rectangular box in uvw
{(u, v, w)| u0 ≤ u ≤ u0 + 4u,v0 ≤ v ≤ v0 + 4v,

w0 ≤ w ≤ w0 + 4w}
This get mapped by T to a slightly curved paral-

lelepiped. The volume 4V of this region can be ap-
proximated from three vectors approximating edges
of the parallelepiped

∂x ∂y ∂z
r1 = i+ j+ k 4u
∂u ∂u ∂u

∂x ∂y ∂z
r2 = i+ j+ k 4v
∂v ∂v ∂v

∂x ∂y ∂z
r3 = i+ j+ k 4w
∂w ∂w ∂w
where
r 1 ≈ x1 − x 0 , r 2 ≈ x2 − x0 , r3 ≈ x3 − x0
with
x0 = T (u0 , v0 , w0 ) x1 = T (u0 + 4u, v0 , w0 )

x2 = T (u0 , v0 + 4v, w0 ) x3 = T (u0 , v0 , w0 + 4w)
The volume of the parallelepiped formed from

these three vectors is given by their triple product
(see Additional Material from Week 3),
4V ≈ (r1 × r2 ) · r3
The triple product gives us a determinant of a mat-

rix built out of partial derivatives. The transpose
of this is the Jacobian of T for this case and is
denoted
∂x ∂x ∂x

∂u ∂v ∂w

∂(x, y, z) ∂y ∂y ∂y
=
∂(u, v, w) ∂u ∂v ∂w

∂z ∂z ∂z

∂u ∂v ∂w
This gives the approximate volume 4V in terms of
4u4v4w

∂(x, y, z)
4V ≈ 4u4v4w
∂(u, v, w)
For infinitesimally small du, dv, dw we have the

differential of volume
dV = |J| du dv dw
Then finally we have
˚ ˚
∂(x, y, z)
f dV = f du dv dw
Ω Γ ∂(u, v, w)
where it is understood that one uses T to substitute

for (x, y, z) in terms of (u, v, w) in f .
Additional Material
Why the absolute the value of the Jacobian ?
You will notice that in our expressions in generalised coordinates we took the absolute value of the
Jacobian

∂(x, y) ∂(x, y, z)
|J| du dv, ∂(u, v) du dv,

∂(u, v, w) du dv dw

This seems to be the convention used in text books at this level. It is not strictly necessary, but one
must properly take into account limits for integration if one does not include the absolute value. We
will use the absolute value together with the following rule about setting up integrals:
Always use limits of integration such that the lower limit is smaller than the upper limit..
For example, in
¨ ¨
∂(x, y)
f dA = f (x(u, v), y(u, v)) du dv
Ω Γ ∂(u, v)
¨ ˆ 1ˆ 1
if Γ is the rectangular region 0 ≤ u ≤ 1, −1 ≤ v ≤ 1, then express as repeated integrals
ˆ 1ˆ 1 ˆ −1 ˆ 1 ˆ 1ˆ 0 Γ 0 −1
and not or .
−1 0 1 0 −1 1
Derivative matrix and the general chain rule
Consider now a function f : Rn → Rm , where n and m are arbitrary. Express f in terms of m

component functions each depending on n real values,
f1 (x1 , x2 , · · · , xn ),
..
.
fj (x1 , x2 , · · · , xn ),
..
.
fm (x1 , x2 , · · · , xn )
Now differentiate each component function with respect to each independent variable and arrange
these in a matrix. The matrix is known as the derivative matrix of the function f . It is commonly
denoted Df ,
∂f1 ∂f1
 
···
 ∂x1 ∂xn 
 . .. 
Df =  .
 . . .

 ∂fm ∂fm 
···
∂x1 ∂xn
Comments
• You will recognise special cases we have already encountered. A vector function r(t) corresponds
to f : R → Rm . The derivative matrix has one column, the components of r0 (t). (When a function
depends on only one variable, the partial derivatives become ordinary derivatives.)
A function of several variables corresponds to the case f : Rn → R. The derivative matrix has one
row, the elements of the gradient vector ∇f .
The general case includes also the derivative of a function of one variable, f : R → R. The derivative
matrix is just a 1 × 1 matrix.
In the context of transformations, we encountered functions f : Rn → Rn and we computed the
elements of the derivative matrix. The Jacobian of the transformation we now see as the determinant
of the derivative matrix. In the context of transformations the derivative matrix is often called the
Jacobian matrix of the transformation.
• Df is itself a function of the independent variables and this may be indicated by Df (x1 , · · · , xn ) or
by Df (x).
• A function is differentiable if it can be approximated by a linear map (in a certain precise sense which
you will learn in future modules). You know from Linear Algebra that there is a correspondence
between linear maps and matrices. The derivative matrix is the matrix representation of the linear
map corresponding to the derivative.
Finally, we return once again to the Chain Rule. Suppose now that we want to compose two functions
h and f to form a function g = f ◦ h and then take its derivative. We need agreement between the
domain of f and the range of h and we need differentiability, but we will assume this. The question is,
what is the derivative of g in terms of the derivatives of f and h? The answer is given by the general
Chain Rule.
Formally, let h be a differentiable function h : Rn → Rp and let f be a differentiable function
f : Rp → Rm . Compose these to obtain g : Rn → Rm , where g = f ◦ h. The derivative of g is given
by
The General Chain Rule.
Dg(x) = Df (h(x)) Dh(x)
where Dg(x) is an n × m derivative matrix,

Df (h(x)) is an n × p derivative matrix, and Dh(x)
is an p × m derivative matrix.
The Chain Rule is expressed very simply as the product of derivative matrices. It is frequently
expressed in component form using summation
p
∂gi X ∂fi ∂hk
=
∂xj ∂hk ∂xj
k=1
∂fi
where is understood to mean the derivative of fi with respect to its k th argument evaluated at
∂hk
the appropriate point h(x). Recall, I warned you about possible confusion with the Chain Rule because
of compact notation.
In the case n = m = 1 the general Chain Rule reduces to the special case considered in Week 4.
Using the notation from Week 4 where g = f ◦ r, this special case is the following matrix product
 
dx1
 dt 
dg ∂f1 ∂f1   .. 

= ··· .
dt ∂x1 ∂xp   
| {z } | {z } dxp 
1×1 1×p
dt }
| {z
p×1
(Previously n was used where here p appears. We have suppressed where the various derivatives are
evaluated.) Which could be written as before as a sum or using the dot product
p
dg X ∂f dxi dg dr
= , = ∇f ·
dt ∂xi dt dt dt
i=1
Week 8 98

Part Iii: Functions From R Tor

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Part Iii: Functions From R Tor

Uploaded by

Copyright:

Available Formats

Part III: Functions from Rn to Rm

Introduction to Part III

In Part I of the module we considered functions

and in Part II we considered functions

In the remainder of the module we consider the

We shall focus particularly the following import-

• When n = m the function f may be viewed

• Another view of the case n = m is that the

• In the case n = 2 and m = 3 the function

After introducing vector fields, we will focus on

8.1 Vector Fields

There are many situations in which a vector is as-

We typically use boldface to denote vector fields,

We will primarily be interested in vector fields on

• The planar vector field,

corresponds to vectors pointing counterclockwise

corresponds to vectors in space pointing away

• Systems of ordinary differential equations

can be viewed as vector fields on the correspond-

ẋ = F(x) = (f1 (x1 , · · · , xn ), · · · , fn (x1 , · · · , xn ))

• Vector fields arise frequently in applications. Fluid

8.2 Coordinate Transformations

A different way in which mappings Rn → Rn arise

1. Substitute x = sin u into the integrand f (x).

2. Change the limits of integration to correspond

3. Calculate the differential dx in terms of du

Now consider the double integral of f (x, y) = cos(x2 +

We can evaluate this double integral be changing

1. Substitute x = r cos θ, y = r sin θ into the integ-

f (x, y) = cos(r2 cos2 θ + r2 sin2 θ) = cos(r2 )

2. Change the limits of integration to those corres-

3. Use the correct area element for polar coordin-

In both cases we re-wrote the integrals in a more

8.3 Linear Transformations

You are familiar with linear transformations from

T : Rn → Rm : T (αu + βv) = αT (u) + βT (v)

This week we will concentrate on the case of the

T can be expressed as a matrix,

We also think of T in terms of component func-

We previously used this shorthand notation of nam-

although we often do not need such explicit expres-

in a form better suited for integration using a linear

1. Substitute for x and y in terms of u and v in the

f (x, y) = f (x(u, v), y(u, v)) = g(u, v)

where we have written g(u, v) to emphasise that

This will determine the limits of integration of

What remains is to express the area element

at u0 = (u0 , v0 ), and with sides of length 4u and

{(u, v)| u0 ≤ u ≤ u0 + 4u, v0 ≤ v ≤ v0 + 4v}

This gets mapped by T to a parallelogram in the

x0 = T (u0 , v0 ) x1 = T (u0 + 4u, v0 )

The area 4A of this parallelogram is given by

(For the cross product these are viewed as vectors

or transposing to bring into standard form

4A = kr1 × r2 k = |J| 4u4v

For infinitesimally small du and dv we have the

Combining everything, we have the following

8.4 General transformations in R2

We now generalise this approach from linear map-

which now are not assumed to be linear. We re-

{(u, v)| u0 ≤ u ≤ u0 + 4u, v0 ≤ v ≤ v0 + 4v}

This gets mapped by T to a slightly curved paral-

x0 = T (u0 , v0 ) x1 = T (u0 + 4u, v0 )