Professional Documents
Culture Documents
(6) If W and w are skewsymmetric tensor and its axial vector, respectively, then show
that
(i) cof W = w ⊗ w,
(ii) W (u × v) = (W u) × v + u × (W v), ∀u, v ∈ V.
(iii) cof (I − W ) = I − W + cof W .
(iv) det (I − W ) = 1 + w · w.
(7) If W is skewsymmetric tensor the show that Q = (I − W )−1 (I + W ) is an orthogonal
tensor.
(9) If Q is improper orthogonal tensor, i.e., det (Q) = -1, then show that I + Q is not
an invertible tensor.
(10) Let u, v and v be linearly independent vectors, i.e., [u, v, v] 6= 0. Then prove that
(i) u × v, v × w and w × u are also linearly independent,
(ii) (u × v) ⊗ w + (v × w) ⊗ u + (w × u) ⊗ v = [u, v, v]I.
(11) Let R and S be second-order tensors. Then show that
(1) Let T be a second-order tensor and U be a second-order tensor such that ||U || = 1.
Then show that the directional derivative of G(T ) = cof T along the direction of U ,
DG(T )[U ] = ((tr T )(tr U ) − tr(T U )) I + T T − (tr T )I U T + U T − (tr U )I T T .
(6) Let Γ be a surface bounded by a closed curve C. Let n be unit normal to Γ. Let u
and v be vector fields. Then show that
I Z
(u ⊗ v)dx = [(∇u)(v × n) + (n · (∇ × v)) u] ∂Γ.
C Γ
Module-3: Kinematics
Exercises
(1) Let θ0 be an angle between two line segments, dX1 and dX2 , at a point in the
reference configuration. Let θ be an angle between the corresponding line segments in
deformed configuration. Then show that the angles are related by
2dX1 · EdX2 + cos θ0 |dX1 ||dX2 |
cos θ = √ √
dX1 · CdX1 dX2 · CdX2
where E is the Green strain tensor and C is the right Cauchy-Green strain.
(2) The Lagrangian description of motion of continuum medium is given by
x1 = X1 e−t + X3 (e−t − 1)
x2 = X2 et + X3 (1 − e−t )
x3 = X 3 e t
find out the Eulerian description of the motion; velocity and acceleration in both La-
grangian and Eulerian description.
(3) The motion of continuum is defined by the velocity components v1 = −αx2 , v2 = αx1 ,
and v3 = β where α and β are non-zero constants. Then
(i) show that the motion is isochoric,
(ii) Find out the components of acceleration.
(6) Let v, D and W be velocity field, rate of deformation tensor and spin tensor, respec-
tively. Then show that
Dv D
∇· = (∇ · v) + D : D − W : W .
Dt Dt
(7) Let W be a spin tensor. Then show that
Z
v·v
Z
(2W v + (∇x · v)v) ∂Ω = (v · n)v − n ∂Γ.
Ω Γ 2
(1) Two symmetric tensors are said to be coaxial if their eigenvectors are same (i.e., if they
have a set of orthonormal eigenvectors common). Show that the second Piola-Kirchhoff
stress S and the right Cauchy-Green strain C are coaxial if and only if the Cauchy stress
τ and left Cauchy-Green strain tensor B.
(2) Let φ and H be scalar and second-order tensor fields. Then prove the following
transport theorems:
!
d I I
Dφ
φ dx = + φL dx,
dt C C Dt
!
d Z Z
DH
Hn ∂Γ = + (∇ · v)H − H(∇v)T n ∂Γ.
dt Γ Γ Dt
Continuum mechanics is a branch of mathematical physics and it deals with the deforma-
tion of matter under the action of forces and thermal effects. The treatment is given for
all the forms of matter (solids, liquids, and gasses) in a unified framework. The frame-
work of continuum mechanics is developed by assuming fundamental laws of mechanics
and thermodynamics as axioms.
Structure of matter
Well known fact from elementary physics that the matter is composed of molecules, which
in turn consists of atoms. Further, the atoms are composition of elementary particles (elec-
trons, protons, neutrons, etc.). Thus, the matter is composition of elementary particles
irrespective of the phase of the matter (i.e., solid, liquid or gas state). Consequently, the
material properties of matter are related directly to the molecular structure and also to
the intermolecular forces. These forces not only depends on structure of molecules but
also on intermolecular distances. Although the molecules in the matter always undergo
random vibration/motion, the bulk material exhibits a stable behavior at macroscopic
level. Hence, the study of deformation behavior of matter can be approached fundamen-
tally in two ways: (i) considering the molecules as discrete particles which is known as
statistical mechanics (ii) considering the bulk material as continuous medium which is
known as continuum mechanics.
Statistical mechanics
This approach seems useful in unifying the mechanics as every material in any phase
consists of atoms. In this approach, we need to know the intermolecular potentials to
predict the behavior of matter. It can be noted that the separation distance between
two adjacent atoms (i.e., center to center distance) is approximately in the order of 10−8
cm. Furthermore, there are about 1022 atoms present in a gram of copper. Therefore,
this approach becomes too complicated to predict the desired results in any large scale
problem due to the computational difficulties that arise in evaluating particle interactions.
Apart from the computational complexity, it is also very difficult to know the exact
intermolecular potentials. Also, small error in the intermolecular potentials can lead to
a large error in predicted results. Hence, this approach has limited usage for large scale
problems such as failure of turbine blade, building response under seismic waves and
drag forces on aircraft. Thus, the continuum mechanics approach is preferred over the
statistical mechanics in order to solve large scale problems in physics and engineering.
Continuum assumption
In nature, despite having random vibration/motion of molecules, we observe the definite
behavior for the bulk material (i.e., the matter at macroscopic scale). For example there is
definite shape for a wooden block and water in rotating bucket assumes paraboloid shape.
Therefore, the matter can be treated as continuum (i.e., the matter without tiny holes)
at macroscopic scale. In other words, the molecular structure can be disregarded. In fact,
historically, due to stable behavior of matter at macro-scale, the study of mechanics of
solids and fluids came into existence much before the discovery of molecular and atomic
structure of matter.
• The more general definition of tensor accounts scalars and vectors as special cases.
In other words the tensor unifies the definition of physical quantities.
• The physical quantities defined using concept of tensor can be independent coor-
dinate frame. Hence, tensors are ideally suited for describing basic (fundamental)
laws of continuum physics.
of quantification, there is a difficulty arises in the field of rational numbers while taking
limiting process. Hence, it is essential to fill the gaps with irrational numbers. The ratio-
nal numbers and irrational number put together forms a gap-less number system called
real numbers. Therefore, the real number system can be thought of as one dimensional
continuum since there are no gaps. We take the advantage of the limiting process of real
numbers in continuum mechanics.
Three-dimensional Euclidean space and continuum body:
Cartesian product of three sets of real numbers, i.e. the set of all triplets (x, y, z), where
x, y and z are real numbers, is sufficient to represent every point in our real world. Fur-
thermore, the Cartesian product of three sets of real numbers is called three-dimensional
(3-D) Euclidean space. The 3-D Euclidean space is a continuum as the real number set
is continuum. Consequently, it is convenient to represent continuum bodies as bounded
domain in the 3-D space.
The concept of continuum or continuous media allow us to define piecewise continuous
functions on material domain (continuum body) and also permit us to take limits and
differentiation at a point in order to define the stress. Therefore, this approach enables us
to use powerful calculus of several variables and thereby facilitate the study of nonuniform
distribution of stresses in the material body. The quantitative predictions by continuum
theories closely agree with the wide range of experimental observations. Therefore, this
powerful theory can be utilized in complicated engineering problems.
References
We develop the tensor algebra that is required to study the continuum mechanics. In this
lecture, we introduce the concept of vector, vector space, basis, and inner product.
Elementary concept of vector
The vector in elementary physics is defined as the quantity which has magnitude and
direction. The vector is graphically represented by an arrow shown in Fig. (1a) while
the algebraic representation (in 3-D space) is done with three numbers. For example the
vector u is represented as {u1 , u2 , u3 } or u1 e1 + u2 e2 + u3 e3 , q
where e1 , e2 , e3 are unit
vectors along coordinate axes. The length of vector, |u| = u21 + u22 + u23 , represents
the magnitude while direction cosines (cos θ1 , cos θ2 , cos θ3 ) represent the direction of
vector as shown in Fig. (1a). Two vectors can be added and subtracted as shown in
Figs. (1b) and (1c). Two vectors are said to be equal if they have same magnitude and
direction. Examples for the vector quantities are position, velocity, acceleration etc. It
can be observed that the coordinate transformation can alter the components but cannot
affect the vector. For example somebody throws a stone then the velocity vector of stone
does not depend on the choice of coordinate system, i.e., the existence of the vector
is independent of the coordinate frame. However, the components of velocity vector do
depend on the coordinate frame, i.e., the column matrix that represents the velocity vector
does change with coordinate frame. The foregoing discussion on vectors is intuitive from
the elementary physics.
1_
2
|u| = (u12 + u22 + u32 )
cos θ1 = |_
u1
u|
u+ v y u = y-x
u cos θ2 = |u_2
u|
v
θ2
cos θ3 = |_
u3
u|
θ1 u x
θ3
From the previous discussion, it is clear that the definition of elementary concept of
vector requires unit vectors along coordinate axes. But the definition of unit vectors along
coordinates cannot have an independent definition as they are also vectors. Therefore,
the elementary definition of vector is running into a cyclic argument that is similar to the
egg and chicken paradox. We now state the following axiomatic framework for the vector
space to avoid this ambiguity.
Vector space
We consider only the vector spaces over the field of real numbers. Let us denote the set
of real numbers by < and vector space by V. The set, V, equipped with an addition
operation (V × V → V) and a scalar multiplication operation (< × V → V), is called
vector space if it follows:
(i) Associativity: (u + v) + w = u + (v + w), for all u, v, w ∈ V.
(ii) Commutativity: u + v = v + u, for all u, v ∈ V.
(iii) Existence of a zero element: There exist 0 ∈ V such that u + 0 = u, for all u ∈ V.
(iv) Existence of negative elements: For each u ∈ V there exist a unique v ∈ V such that
u + v = 0 and v is denoted as −u.
(v) Associativity in scalar multiplication (or the compatibility of multiplication defined
between the field elements (real numbers) and scalar multiplication operation): α(βu) =
(αβ)u, for all u, ∈ V, and α, β ∈ <.
(vi) Identity in scalar multiplication: There exist a unique element 1 ∈ < such that
1u = u, for all u ∈ V.
(vii) Distributivity with respect to addition operation on vectors: α(u + v) = αu + αv,
for all u, v ∈ V, and α ∈ <.
(viii) Distributivity with respect to scalar addition: (α + β)u = αu + βu, for all u ∈ V,
and α, β ∈ <.
The elements of set V are called vectors. This axiomatic framework generalizes the
notion of vector. We now present few examples for the vector space.
Addition operation:
u + v = (u1 + v1 , u2 + v2 , · · · , un + vn )
It is easy to verify that the set, <n , obey all the axioms of vector space under the addition
and scalar multiplication. Therefore, the set, <n , is a vector space. It can be observed
that the special case n = 3 is our usual three-dimensional (3-D) Euclidean space.
Example-2: Let us consider the set of all m × n matrices over the real numbers,
· · · a1n
a11 a12
· · · a2n
a21 a22
<m×n =
.. .. .. .. : aij ∈ <
. . . .
am1 am2 · · · amn
Addition operation:
···
a11 + b11 a12 + b12 a1n + b1n
a21 + b21 a22 + b22 ··· a2n + b2n
A+B = .. .. .. ..
. . . .
am1 + bm1 am2 + bm2 · · · amn + bmn
Scalar multiplication:
· · · αa1n
αa11 αa12
αa21 αa22 · · · αa2n
αA = .. .. .. ..
. . . .
αam1 αam2 · · · αamn
It is easy to verify that the given set, <m×n , with defined operations is a vector space.
Example-3: Let us consider set of all Lebesgue measurable functions over the domain of
[0,1], as stated in the following expression,
Z 1
p p
L [0, 1] = f : |f | dx < ∞ ,
0
Linear independence:
A subset {u1 , u2 , · · · , un } of V is said to be linearly dependent if and only if there exist
set of scalars α1 , α2 , · · · , αn , not all zero, such that
α1 u 1 + α2 u 2 + · · · + αn u n = 0
If such nonzero scalars do not exist, i.e, α1 = α2 = · · · = αn = 0, then the set vectors,
{u1 , u2 , · · · , un }, are said to be linearly independent.
Basis vectors:
A subset {u1 , u2 , · · · , un } of V is said to be the basis if the subset is linearly indepen-
dent and linear combination of this subset spans the total set V, i.e. there exist some
Figure 2: Vector representation in 3-D: (a) components of vector (b) two vectors separated
by an angle θ.
Now, an alternative definition is stated for dot product which is equivalent to the one
defined in Eq. (1).
u · v = u1 v1 + u2 v2 + u3 v3 (3)
We note that the definition of dot product given in Eq. (3) accounts for both length of
vector shown in Eq. (2) and also angle between vectors shown in Eq. (1). We will prove
the equivalence of definitions stated in Eq. (1) and Eq. (3).
Problem 1. In 2-D Euclidean space with canonical basis ({e1 = (1, 0), e2 = (0, 1)}) the
definition of dot product defined in Eq. (1) and Eq. (3) are equivalent, i.e. |u||v| cos(θ) =
u1 v1 + u2 v2
Proof-1. Let us consider two vectors in two dimensional space as shown in the Fig. (3).
We note that e1 is unit vector along horizontal axis and e2 is unit vector along vertical
axis.
u2 u
v
v2 θ
θv θu
u1
v1
Let θu and θv be angles made by vectors u and v with respect to horizontal axis as shown
in Fig. (3). Therefore, we have
u1 u2 v1 v2
cosθu = , sinθu = , cosθv = , sinθv = . (5)
|u| |u| |v| |v|
u1 v1 + u2 v2 = |u||v|cosθ (7)
The result can be generalized to 3-D as any two vectors from two-dimensional subspace.
We can have the following alternative proof for the same result based on cosine rule.
Proof-2. We have two vectors u and v with an angle θ between them. Therefore, the
vectors u, v and u − v forms
q a triangle
q as shown q
in Fig. (4). Three sides of triangle are
2 2 2 2
|u|, |v| and |u − v|, i.e., u1 + u2 , v1 + v2 and (u1 − v1 )2 + (u2 − v2 )2 .
u u-v
θ
v
We now apply the cosine rule to the triangle shown in Fig. (4), i.e.,
|u − v|2 = |u|2 + |v|2 − 2|u||v|cosθ (8)
We can rewrite Eq. (8) as
1
|u||v|cosθ = (|u|2 + |v|2 − |u − v|2 )
2
1 2 1 1
= (u1 + u22 ) + (v12 + v22 ) − ((u1 − v1 )2 + (u2 − v2 )2 )
2 2 2
= u1 v1 + u2 v2
Thus, the result is proved. This can be easily generalized to the vectors in 3-D space.
Inner product:
We saw the general definition of vector space in the previous discussion. Now, we gener-
alize the definition of dot product over the vector space, which is known as inner product
or scalar product.
Let V be a vector space. The inner product is a function from V × V → <, denoted by
(u, v), satisfying the following properties:
(i) Linearity: (αu + βv, w) = α(u, w) + β(v, w) ∀α, β ∈ < and ∀u, v, w ∈ V
(ii) Symmetry: (u, v) = (v, u) ∀u, v ∈ V
(iii) Positive-definiteness: (u, u) ≥ 0, ∀u ∈ V and (u, u) = 0 if and only if u = 0.
The definition of inner product brings out the geometric quantities such as length and
orthogonality. The vector space equipped with inner product is called inner product space
or Hilbert space.
Example-4: There is an inner product on the vector space <n . It is defined as,
n
X
u · v = u1 v1 + u2 v2 + · · · + un vn = ui vi (9)
i=1
This definition satisfies all the properties of inner product. It is known as standard inner
product on <n .
Example-5: The choice of inner product is not unique. For any given positive definite
matrix S, another choice of inner product on <n is
n X
X n
u·v = ui Sij vj (10)
i=1 j=1
u = e1 + 2e2
e2 e'2 v = 2e1 + e2
e 1 e'1
Solution: Given vectors, u = e1 + 2e2 and v = 2e1 + e2 are in canonical basis. Length of
√ √
vectors, |u| = 5 and |v| = 5.
Let θ be the angle between two vectors. Then
u·v u1 v1 + u2 v2 4
cos θ = = =
|u||v| |u||v| 5
Both vectors u and v have the following representation in the basis B1 and B2 .
Length of vectors,
1 1
q 2 X
2 2
√ q 2 X
2 2
√
u0i Sij u0j = 0 0
X X
|u| = (u, u) = 5 and |v| = (v, v) = v Sij v
i j = 5
i=1 j=1 i=1 j=1
Orthogonality:
If the inner product between two non-zero vectors is zero then two vectors are said to be
orthogonal. Let u and v be orthogonal vectors in an inner product space V. Then
(u, v) = 0.
and the equality holds if and only if u and v are linearly dependent.
Proof. If (u, v) = 0 then the inequality is trivial. Let us assume both vectors u and v
are non-zero. The positive definite property of inner product implies
The quadratic function f (α) is minimum at α = (u, v)/(v, v). Thus, we obtain the
Cauchy-Schwartz inequality by substituting α = (u, v)/(v, v) in previous equation. If
equality holds in Eq. (11) then we get (u − αv, u − αv) = 0 where α = (u, v)/(v, v). The
relation (u − αv, u − αv) = 0 implies u = αv, i.e., u and v are linearly dependent.
The application of the Cauchy-Schwartz inequality to <n and L2 [0, 1] spaces yields
n
!2 n
! n
!
u2i vi2 ,
X X X
ui vi ≤
i=1 i=1 i=1
Z 1 2 Z 1 Z 1
f (x)g(x)dx ≤ f (x)2 dx g(x)2 dx .
0 0 0
The triangle inequality follows from the Cauchy-Schwartz inequality. The proof is pre-
sented next.
Triangle inequality:
Let V be an inner product space. Then
Proof:
|u + v|2 = (u + v, u + v)
= |u|2 + 2(u, v) + |v|2
≤ |u|2 + 2|(u, v)| + |v|2 (Since (u, v) ≤ |(u, v)|)
≤ |u|2 + 2|u||v| + |v|2 (By Cauchy-Schwartz inequality)
2
≤ (|u| + |v|)
We can get the triangle inequality by taking square-root on both sides. The triangle
inequality is essential in defining the normed vector space where the definition of length
or norm of vector is the main task.
1
kuk := |u| = (u, u) 2 .
Converse is not true, i.e., normed vector space is not necessarily an inner product space.
Consequently, the normed space is subset of general vector space and the inner product
space is a subset of normed vector space. This fact is depicted in Fig. (6). Furthermore,
the concept of length is defined for the normed space whereas the length and also angle
between vectors is defined for the inner product spaces. In continuum mechanics, we
require the definition of both length of vector and angle between vectors. Thus, all the
analysis are being done in the inner product spaces.
Figure 6: Venn diagram for vector spaces, normed spaces, and inner product spaces
Reference
The concept of the length of vector was discussed in previous lecture. We now introduce
the definitions of area and volume. To introduce these new quantities, we require the
definition of cross product between two vectors. We also introduce the index notation
along with some conventions to handle the vector algebra. These conventions are useful
in handling cumbersome algebraic operations between the vectors. Hereon we consider all
the vectors belonging to Euclidean three-dimensional space with orthonormal basis. The
orthonormal basis to the Euclidean three-dimensional space is known as Cartesian frame.
The summation convention:
Let us consider the sum,
s = u1 v1 + u2 v2 + u3 v3
We, usually, write the preceding equation using summation sign
3
X
s= ui vi . (1)
i=1
If the index i replaced with j or k in above expression the meaning would not alter, i.e.,
3
X 3
X 3
X
s= ui vi = uj vj = uk vk (2)
i=1 j=1 k=1
Therefore, the index i in Eq. (1) is called dummy index as it can be changed to any other
letter as shown in Eq. (2). Of course the dummy index appears twice in a term. We now
simplify the way of expressing Eq. (1) by dropping the explicit summation symbol, i.e.,
s = ui vi .
Although the summation symbol is not written explicitly, the summation is taken over
the index when it appears twice in a term. Of course the summation can also be expressed
s = uj vj as the index is dummy. This convention, i.e., taking the summation over dummy
index, is called Einstein’s summation convention. We follow this convention throughout
the tensor algebra and calculus.
We further make a note that no index should appear more than twice in a term. In
other words, the summation is not implied in a term ui vi wi over index i. If we want to
obtain summation then summation symbol should be written explicitly, i.e.,
3
X
ui vi wi .
i=1
Example-1: We can have Aii = A11 + A22 + A33 . In this sum i is the dummy index. Since
the dimension of space is three, we have taken the summation over index i from 1 to 3.
If dimension of space is n, the summation should be taken over i from 1 to n.
with α = Aij ui vj and β = Bijk ui vj wk . The indices i and j are dummy indices in first
expression while i, j and k are dummy indices in second expression.
Free index:
If the index appear once in a term is called free index. Let us consider the system of linear
equations,
y1 = A11 x1 + A12 x2 + A13 x3
y2 = A21 x1 + A22 x2 + A23 x3
y3 = A31 x1 + A32 x2 + A33 x3
Using the summation convention, we get
y1 = A1j xj , y2 = A2j xj , y3 = A3j xj
These equations can also be written as,
yi = Aij xj , i = 1, 2, 3 (3)
The index j is dummy as it appears twice in a term on the right hand side of equation. On
the other hand, the index i is called free index since it appears once in a term. Of course
the index i appears once on left hand side and again once on the right hand side but once
in a term. In this example, clearly, index j is the dummy index and index i is the free
index. Usually the free index takes values 1, 2 or 3, unless stated explicitly. In conclusion,
the short form of three system of linear equations can be written as yi = Aij xj .
Example-1 For a given vector u and orthonormal basis {e1 , e2 , e3 }, the component along
basis vector
ui = u · ei , (4)
where the index i is free index. Furthermore, for the given components and also basis,
the vector can be represented by
u = uj ej , (5)
where j is dummy index. Upon substitution of Eq. (4) in Eq. (5), we get
u = (u · ei )ei (6)
Example-2: Consider the following set of equations,
A11 = B111 x1 + B112 x2 + B113 x3 , A12 = B121 x1 + B122 x2 + B123 x3 ,
A13 = B131 x1 + B132 x2 + B133 x3 , A21 = B211 x1 + B212 x2 + B213 x3 ,
A22 = B221 x1 + B222 x2 + B223 x3 , A23 = B231 x1 + B232 x2 + B233 x3 ,
A31 = B311 x1 + B312 x2 + B313 x3 , A32 = B321 x1 + B322 x2 + B323 x3 ,
A33 = B331 x1 + B332 x2 + B333 x3 .
The set of equations is represented by Aij = Bijk xk in the index notation. Clearly, i and
j are free indices while k is a dummy index.
x1 = B111 A11 + B112 A12 + B113 A13 + B121 A21 + B122 A22 + B123 A23 + B131 A31 +
B132 A32 + B133 A33
x2 = B211 A11 + B212 A12 + B213 A13 + B221 A21 + B222 A22 + B223 A23 + B231 A31 +
B232 A32 + B233 A33
x3 = B311 A11 + B312 A12 + B313 A13 + B321 A21 + B322 A22 + B323 A23 + B331 A31 +
B332 A32 + B333 A33 .
The set of equations in index notation is represented by xi = Bijk Ajk . In this expression,
i is free index while j and k are dummy indices.
Example-4: Consider the following set of equations,
• The index that appear twice in a term is called dummy index and summation is
taken over all such dummy indices.
• The index that appear once but in every term is called free index and no summation
is taken over such free indices.
It is clear from the definition of Kronecker delta that they are components of the identity
matrix. Following are the properties of Kronecker delta:
(i) δij = δji
(ii) δii = δ11 + δ22 + δ33 = 3
(iii) xi = δij xj , τij = τik δkj , and so on
Because of property (iii) in the preceding list, the Kronecker delta is known as substitution
operator.
ej × ek = ijk ei (9)
where ijk is permutation symbol or ijk th component the alternate tensor ε, and it is
given by
Let us take dot product on both sides of Eq. (9) with em then we get
We note that the dot product is taken with em instead of ei to avoid repetition of index
more than twice. Replacing the index i in place of m in the preceding equation, we get
The distributivity of cross product is needed along with the definition stated in Eq. (9)
to perform the cross product between any two vectors.
Example-5: Let w be the resultant vector of cross product between two vectors u and v.
Then express w in terms of components of u, v.
w = u×v
= (uj ej ) × (vk ek ) (from Eq. (5))
= uj vk (ej × ek ) (using distributive property)
= ijk uj vk ei
Let V be a vector space. Let u, v and w be three vectors in V. Then the scalar triple
product, denoted by [u, v, w], is defined by
[u, v, w] := u · (v × w) (13)
Clearly, the scalar triple product is a function from V × V × V to <. The absolute value
of scalar triple product gives the volume of parallelepiped formed by three vectors. The
scalar triple product in indicial notation,
[u, v, w] := ijk ui vj wk . (14)
Properties of scalar triple product:
(i) [u, v, w] = [w, u, v] = [v, w, u] = −[v, u, w] = −[w, v, u] = −[u, w, v] ∀u, v, w ∈ V
(ii) Multi-linear function or in other words linear in every variable, i.e.,
[αu + βx, v, w] = α[u, v, w] + β[x, v, w] ∀x, u, v, w ∈ V and ∀α, β ∈ <
[u, αv + βy, w] = α[u, v, w] + β[u, y, w] ∀y, u, v, w ∈ V and ∀α, β ∈ <
[u, v, αw + βz] = α[u, v, w] + β[u, v, z] ∀z, u, v, w ∈ V and ∀α, β ∈ <
(iii) The scalar triple product can be written in determinant form as,
u1 u2 u3
[u, v, w] = det v1 v2 v3 . (15)
w1 w2 w3
We know from matrix algebra that the determinant1 for above 3×3 matrix can be written
as u1 (v2 w3 − w2 v3 ) − u2 (w3 v1 − v3 w1 ) + u3 (v1 w2 − w1 v2 ). Therefore, this form can be easily
verified using the definition shown in Eq. (14).
(iv) The scalar triple product [u, v, w] is zero if and only if the vectors u, v and w are
linearly dependent, i.e., all three vectors lie on a plane.
The epsilon-delta identity:
The permutation symbol (ijk th component of alternate tensor ε) can be represented using
scalar triple product shown in Eq. (11). Using property (iii) of scalar triple product shown
in Eq. (15) and Eq. (4), we get
ei · e1 ei · e2 ei · e3 δi1 δi2 δi3
ijk = [ei , ej , ek ] = det ej · e1 ej · e2 ej · e3 = det δj1 δj2 δj3 (16)
Using Eq. (17) and a property of Kronecker delta, δii = 3, we get the following famous
-δ identity
ijk iqr = δjq δkr − δjr δkq (18)
As a consequence of -δ identity, we also have
Example-6: Show that the vector triple product, in general, is not associative i.e., u ×
(v × w) is different from (u × v) × w.
Solution:
u × (v × w) = ijk uj (v × w)k ei
= ijk uj kmn vm wn ei
= kij kmn uj vm wn ei
= (δim δjn − δin δjm )uj vm wn ei
= (δim δjn uj vm wn − δin δjm uj vm wn )ei
= (uj vi wj − uj vj wi )ei
= (uj wj )vi ei − (uj vj )wi ei
= (u · w)v − (u · v)w
(u × v) × w = ijk (u × v)j wk ei
= ijk jmn um vn wk ei
= jki jmn um vn wk ei
= (δkm δin − δkn δim )um vn wk ei
= (δkm δin um vn wk − δkn δim um vn wk )ei
= (uk vi wk − ui vk wk )ei
= (uk wk )vi ei − (vk wk )ui ei
= (u · w)v − (v · w)u
It is clear that the vector triple product of non-zero vectors u, v and w is associative if
either u and w are linearly dependent or all three vectors are mutually orthogonal. In
general, the vector triple product does not follow associative property.
References
We know from elementary physics that the quantities such as density and temperature
are represented by scalar functions while velocity and acceleration are represented by
vectors. But the stress and strain cannot be represented by a scalar or a vector. Thus, we
introduce the concept of tensor to represent these new physical quantities. In this lecture,
we specialize our discussion to second-order tensors after defining the tensor.
Tensor:
Let V and W be two vector spaces. Then the linear function L that is defined from
V → W is called a tensor1 , i.e., the function L : V → W with the following property is
called tensor.
We note that L(αu + βv), Lu and Lv are vectors in the vector space W. Furthermore,
the linear function L is also known as linear transformation. It can be observed from the
following examples that both the scalar and vector are special cases of tensor.
Example-1: We know from Example-1 in Lecture-2 that the set of real numbers < forms
a vector space. Let us choose V = W = < in above definition. Let f be a linear function
defined from V → W, i.e., f (αx+βy) = αf (x)+βf (y) for all α, β, x, y ∈ <. Then there is
a scalar c such that f (x) = cx for every x ∈ <. In other words f assumes a representation
of a scalar. It is easy to verify that the function follows the linearity property defined
above. Thus, all scalars are tensors.
Example-2: Let us choose V = <3 and W = < in above definition of tensor. Let f : V →
W be a linear function, i.e., f (αx + βy) = αf (x) + βf (y) for all x, y ∈ <3 and for all
α, β ∈ <. Then there exist a vector v ∈ <3 such that f (u) = v · u. It is easy to see that
the function f follows the above stated linearity. Thus, every vector in vector space <3 is
also a tensor.
Clearly, the scalars and vectors are included in the definition of tensor. It is possible to
define various tensors by choosing the vector spaces V and W. We know that the scalars
have one component whereas the vectors have three components in a given coordinate
frame. We will see that the second-order tensor possess nine components. In general nth
order tensors have 3n components. Therefore, the scalars are zeroth-order tensors and the
vectors are first-order tensors.
We now discuss the algebra of second-order tensors.
1
Linear transformation is a synonym for tensor
We know from Example-1 in Lecture-1 that set of scalars forms one dimensional vector
space and set of vectors forms an n-dimensional vector space. We now show that the
collection tensors also forms a new vector space (space of second-order tensors). We
denoted the new vector space formed by second order tensors with Lin.
Algebra of tensors:
Let us collect the set of all tensors defined from V to W and denote by Lin. We now
define sum, R + S, of two tensors R and S and the scalar product, αT , of tensor T by
We note that Ru, Su and T u are vectors in the vector space W. As a consequence of
definitions of sum and scalar product, the set of all tensors forms a new vector space over
the field of real numbers. This fact can be verified using vector space axioms stated in
Lecture-2. Analogous to zero vector, there is a zero tensor, O in the vector space Lin.
The zero tensor is defined by
Ou = 0 ∈ W, ∀u ∈ V.
T + O = T ∀T ∈ Lin.
All other axioms of vector space are easy to verify. Thus, with the definition of addition
and scalar multiplication given in Eq. (2) and (3), the set Lin is a vector space over <.
We will see in later lectures that stress, strain and rate of deformation are second-order
tensors. Therefore, second-order tensors play an important role in continuum mechanics.
Thus, we specialize our discussion more towards the second-order tensors than other
higher-order tensors. Here on tensor without specifying order would be treated as second-
order tensor.
Ru = Su, ∀u ∈ V. (4)
It is easy to see that the condition shown in Eq. (4) implies the condition shown in Eq.
(5). Using linearity of inner product, we can write Eq. (5) as,
Iu = u, ∀u ∈ V.
This product or composition is similar to the usual product of two matrices. We note
that any tensor, T , on pre-multiplication or post multiplication with identity tensor I
does not alter the action of T on vectors, i.e.,
T I = IT = T ∀T ∈ V 2 .
Transpose of tensor:
Let T T be a transpose of tensor T . Then the transpose of tensor T , using inner product,
is defined by
(T T u, v) = (u, T v), ∀u, v ∈ V (7)
The transpose of tensor has the following properties:
(T T )T = T
(αT )T = αT T
(R + S)T = RT + S T
Symmetric tensor:
The tensor S is said to be symmetric if S T = S.
Skewsymmetric tensor:
The tensor W is said to be skewsymmetric (antisymmetric) if WT = −W.
Additive decomposition of tensor:
Every second-order tensor T can be uniquely decomposed into a symmetric and a skew-
symmetric part as
T = Ts + Tss (8)
where
1 1
Ts = (T + T T ), Tss = (T − T T )
2 2
Components of second-order tensor in a coordinate frame:
Let e1 , e2 and e3 be an orthonormal basis to the Euclidean three-dimensional vector space
V. Let T be a second-order tensor from V to V. Then, we have
Tu = v (9)
where u and v are in V. Note that u and v are linear combination of basis vectors as
they are in V. Choosing e1 , e2 , and e3 for u in Eq. (9), we get
T e1 = α1 e1 + α2 e2 + α3 e3
T e2 = α4 e1 + α5 e2 + α6 e3
T e3 = α7 e1 + α8 e2 + α9 e3 ,
T ej = Tij ei . (10)
Clearly, i is dummy index and j is the free index. The scalars Tij are called components
of the tensor T with respect to the base vectors ei , i = 1, 2, 3. Taking the dot product
both sides of Eq. (10) with ek , we get
vi ei = T (uj ej ) = uj T ej = uj Tij ei .
vi = Tij uj (13)
Let us consider a vector v. Then the transformation law for the component of the vector
is given by,
vi0 = v · e0i = v · (lij ej ) = lij v · ej = lij vj (17)
Similar to transformation of vector components, using Eq. (11) and Eq. (15), we get the
following transformation law for the second-order tensor.
It is easy to see from above equation that a ⊗ b is a linear function from V to V. Thus,
a ⊗ b is a second-order tensor.
Representation of second-order tensor:
Every second-order tensor T assumes the representation
T = Tij ei ⊗ ej . (20)
Considering a vector u and the representation of tensor shown in Eq. (20), we get
This holds for all u ∈ V. Therefore, any second-order tensor admits the representation
shown in Eq. (20). We already showed that the set of second-order tensors V 2 is a real
vector space. We now state that the set of tensors {ei ⊗ ej }, i = 1, 2, 3 and j = 1, 2, 3
forms a basis to the space of second-order tensors. In fact the linear combination can
span every second-order tensor which is shown as representation in Eq. (20).
I = ei ⊗ ei . (21)
Conversely, let Tij = Rij then we want to show that (u, T v) = (u, Sv) ∀u, v ∈ V.
Consider the dyadic product representation of two tensors in a given basis, i.e,
We can have the following scalar product of two vectors u and v with respect to tensors,
(u, T v) − (u, Sv) = (ui ei , T (vj ej )) − (ui ei , S(vj ej )) (Since u = ui ei and v = vj ej )
= ui vj (ei , T ej ) − ui vj (ei , Sej )
= ui vj Tij − ui vj Sij
= ui vj (Tij − Sij )
= 0 (Since Tij = Sij ).
Thus, two second-order tensors are equal if and only if respective components are equal
in a given basis.
The vector and tensor components follow the transformation law then the physical vector
and tensor are independent of coordinate frame. This is true for the higher order tensors
too. Therefore, some authors consider the transformation law as definition for the tensor.
Reference
As noted in Lecture-4, the components of tensor are altered with the change of basis
although the tensor is invariant. We now ask a question that are there any relations
among the components such that they are invariant under change of basis for a given
tensor. The answer is affirmative and in fact there are three such kind of invariant
relations exist for a second-order tensor. These three relations are known as principal
invariants. Of course, any other invariant relation can be expressed as function of these
three principal invariants. The relations can be obtained from the following equations.
Let {u, v, w} and {a, b, c} be two set of linearly independent vectors and let T be a
second-order tensor. Then
[T u, v, w] + [u, T v, w] + [u, v, T w] [T a, b, c] + [a, T b, c] + [a, b, T c]
= (1)
[u, v, w] [a, b, c]
[T u, T v, w] + [u, T v, T w] + [T u, v, T w] [T a, T b, c] + [a, T b, T c] + [T a, b, T c]
= (2)
[u, v, w] [a, b, c]
[T u, T v, T w] [T a, T b, T c]
= (3)
[u, v, w] [a, b, c]
Problem 1. Let {u, v, w} and {a, b, c} are two sets of linearly independent vectors.
Then prove Eq. (1).
Proof. Since the set of vectors {u, v, w} are linearly independent they form a basis to the
vector space V. Let T be a second-order tensor. Then
T u = α1 u + β1 v + γ1 w
T v = α2 u + β2 v + γ2 w
T w = α3 u + β3 v + γ3 w (4)
Using properties of scalar triple product (see Lecture-3 for properties of scalar triple
product) and above relations, we get
[T u, v, w] + [u, T v, w] + [u, v, T w]
= [α1 u + β1 v + γ1 w, v, w] + [u, α2 u + β2 v + γ2 w, w] + [u, v, α3 u + β3 v + γ3 w]
= α1 [u, v, w] + β2 [u, v, w] + γ3 [u, v, w]
= (α1 + β2 + γ3 )[u, v, w] (5)
Let {e1 , e2 , e3 } be orthonormal basis. Then the vectors u, v and w have the representa-
tion u = ui ei , v = vj ej and w = wk ek . Therefore, the relation shown in Eq. (5) can also
be expressed as
[T u, v, w] + [u, T v, w] + [u, v, T w]
= [T ui ei , vj ej , wk ek ] + [ui ei , T vj ej , wk ek ] + [ui ei , vj ej , T wk ek ]
= ui vj wk {[T ei , ej , ek ] + [ei , T ej , ek ] + [ei , ej , T ek ]} (6)
x = αy, x = βz.
[T x, y, z] + [x, T y, z] + [x, y, T z] = 0
Case-ii: There are two linearly independent vectors among three, i.e., one vector can be
written as linear combination of other two vectors. Without loss of generality, we can
write
z = αx + βy
Using properties of scalar triple product, we get
[T u, v, w] + [u, T v, w] + [u, v, T w]
tr(T ) =
[u, v, w]
Problem 2. Let {u, v, w} and {a, b, c} are two set of linearly independent vectors. Then
prove Eq. (2).
Proof. This can be proved similar to the previous problem. Let u, v, w be independent
vectors and let T be a tensor. Then we have Eq. (4). Using properties of scalar triple
product (see Lecture-3 for properties of scalar triple product) and Eq. (4), we get
[T u, T v, w] + [u, T v, T w] + [T u, v, T w] = [α1 u + β1 v + γ1 w, α2 u + β2 v + γ2 w, w]
+[u, α2 u + β2 v + γ2 w, α3 u + β3 v + γ3 w]
+[α1 u + β1 v + γ1 w, v, α3 u + β3 v + γ3 w]
= α1 β2 [u, v, w] + β1 α2 [v, u, w]
+β2 γ3 [u, v, w] + γ2 β3 [u, w, v]
+α1 γ3 [u, v, w] + γ1 α3 [w, v, u]
= {(α1 β2 − β1 α2 ) + (β2 γ3 − γ2 β3 )
+(α1 γ3 − γ1 α3 )}[u, v, w]. (11)
Let {e1 , e2 , e3 } be orthonormal basis to the vector space. Then we have u = ui ei , v = vj ej
and w = wk ek . An alternative form of Eq. (11) can be written as
[T u, T v, w] + [u, T v, T w] + [T u, v, T w]
= [T ui ei , T vj ej , wk ek ] + [ui ei , T vj ej , T wk ek ] + [T ui ei , vj ej , T wk ek ]
= ui vj wk {[T ei , T ej , ek ] + [ei , T ej , T ek ] + [T ei , ej , T ek ]}. (12)
Choosing u = ei , v = ej and w = ek and substituting in Eq. (11), we get
[T ei , T ej , ek ] + [ei , T ej , T ek ] + [T ei , ej , T ek ]
= {(T11 T22 − T12 T21 ) + (T22 T33 − T23 T32 ) + (T33 T11 − T31 T13 )}[ei , ej , ek ]
= ijk {(T11 T22 − T12 T21 ) + (T22 T33 − T23 T32 ) + (T33 T11 − T31 T13 )} (13)
Let IIT be second invariant of tensor T . Then we have
IIT = (T11 T22 − T12 T21 ) + (T22 T33 − T23 T32 ) + (T33 T11 − T31 T13 ). (14)
Substituting Eq. (13) in Eq. (12), we get
[T u, T v, w] + [u, T v, T w] + [T u, v, T w] = IIT ijk ui vj wk
= IIT [u, v, w] (15)
The arbitrariness of u, v and w implies Eq. (2).
If the vectors x, y and z are linearly dependent then we get [T x, T y, z]+[x, T y, T z]+
[T x, y, T z] = 0. This can be proved by following similar steps that were adopted in
previous cases of first invariant.
Problem 3. If {u, v, w} and {a, b, c} are two set of linearly independent vectors then
prove Eq. (3).
[T u, T v, T w] = [α1 u + β1 v + γ1 w, α2 u + β2 v + γ2 w, α3 u + β3 v + γ3 w]
= α1 β2 γ3 [u, v, w] + α1 β3 γ2 [u, w, v] + β1 α2 γ3 [v, u, w]
+β1 γ2 α3 [v, w, u] + γ1 α2 β3 [w, u, v] + γ1 α3 β2 [w, v, w]
= {α1 (β2 γ3 − β3 γ2 ) − β1 (α2 γ3 − γ2 α3 ) + γ1 (α2 β3 − α3 β2 )}[u, v, w] (16)
[T u, T v, T w] = [T ui ei , T vj ej , T wk ek ] = ui vj wk [T ei , T ej , T ek ] (17)
[T ei , T ej , T ek ] = ijk {T11 (T22 T33 − T23 T32 ) − T12 (T21 T33 − T23 T31 ) + T13 (T21 T32 − T31 T22 )}
(18)
Let IIIT be third invariant which is defined by
IIIT = T11 (T22 T33 − T23 T32 ) − T12 (T21 T33 − T23 T31 ) + T13 (T21 T32 − T31 T22 ). (19)
The third invariant of tensor associated with Eq.(3) is known as determinant of tensor.
If u, v and w are linearly independent vectors then the determinant
[T u, T v, T w]
det(T ) =
[u, v, w]
Properties of determinant:
(i) det(T ) =det(T T ) ∀T ∈ V 2
(ii) det(αT ) = α3 det(T ) ∀T ∈ V 2 and ∀α ∈ <
(iii) det(RS) = det(S)det(R) = det(SR) ∀R, S ∈ V 2
(iv) det(I) = 1, where I is identity tensor.
Cofactor of tensor:
We know how vectors are being transformed under the action of tensor. We now want
to know how the area is being transformed under the action of tensor. Recall from
Lecture-3 that area of parallelogram formed with two vectors can be obtained by the
cross-product. We now define a new tensor called cofactor of tensor that takes care of
areal transformation.
Definition of cofactor:
The cofactor of tensor T is defined by
We note that the cofactor of tensor cof T is also a second-order tensor as input and
output are vectors.
Problem 4. Let R and S are two second-order tensors. Then show that
The arbitrariness of vectors u, v and w imply the trace of cofactor tensor, tr(cof T), is
equal to the second invariant of the tensor T .
Problem 5. Let T be a second-order tensor. Then prove that the components of cofactor
1
(cof T )ij = imn jpq Tmp Tnq . (27)
2
Thus, det(T ) represents the scale factor for the volume transformation under the action
of tensor T . Using the definition of scalar triple product in Eq. (28), we get
det(T )[u, v, w] = T u · (T v × T w)
=⇒ [det(T )u, v, w] = T u · ((cof T )(v × w)) (By definition of cofactor)
T
=⇒ det(T )Iu · (v × w) = (cof T ) T u · (v × w) (Using definition of transpose)
where I is the identity tensor. Arbitrariness of u, v and w implies
(cof T )T T = det(T )I (29)
Using Eq. (27) and Eq. (29), we can obtain the following expression for determinant in
indicial notation.
tr((cof T )T T ) = tr(det(T )I)
=⇒ (cof T )ij Tij = det(T )tr(I)
1
=⇒ det(T ) = imn jpq Tij Tmp Tnq (Since tr(I) = 3)
6
Summary of principal invariants:
In summary, we have the following principal invariants.
T −1 T = T T −1 = I (30)
where I is the identity tensor. If the determinant of tensor T is non-zero, i.e., det(T ) 6= 0
then there exists a unique inverse. Using Eq. (29), the inverse can be written as
1
T −1 = (cof T )T (31)
det(T )
Properties of inverse:
(i) If T is invertible and T u = v then u = T −1 v
(ii) T −1 (αu + βv) = αT −1 u + βT −1 v
The above properties show that the inverse is a second-order tensor.
Since ((u ⊗ v)x) × ((u ⊗ v)y) = ((v · x)u) × ((v · y)u) = 0, we can write
T −1 (u ⊗ v)T −1
(T + u ⊗ v)−1 = T −1 − (33)
1 + u · T −1 v
(T −1 u) ⊗ v
!
−1
(T + u ⊗ v) = I− T −1
1 + v · T −1 u
T −1 (u ⊗ v)
!
= I− T −1
1 + v · T −1 u
T −1 (u ⊗ v)T −1
= T −1 −
1 + v · T −1 u
This update-formula for inverse is known as Sherman-Morrison formula. Since the rank
of u ⊗ v is one, some authors refer this formula as rank-one update. This formula is also
applicable to higher dimensional vector space.
Reference
We shall see in forthcoming lectures that the eigenvalues and eigenvectors play an impor-
tant role in obtaining the principal stresses and principal directions. Furthermore, it helps
in defining generalized strain measures. We note that the eigenvalues and eigenvectors
are also known as characteristic values and characteristic vectors or proper values and
proper vectors. We now define the eigenvalue and eigenvector.
Let T be a second-order tensor. If there exists a non-zero vector n and a scalar λ such
that
T n = λn, (1)
then the vector n is said to be an eigenvector and the scalar λ is said to be an eigenvalue.
Equation (1) is known as eigenvalue problem. We can also express Eq. (1) as (T −λI)n =
0. We now prove that there exists a non-zero vector n such that (T − λI)n = 0 if and
only if det(T − λI) = 0.
Problem 1. Let R be a second-order tensor. Then there is a non-zero vector u such that
Ru = 0 if an only if det(R) = 0.
Proof. If det(R) = 0 then from the definition of third invariant (see Lecture-5) we can
have [Rx, Ry, Rz] = 0 for arbitrary vectors x, y, z. Therefore, the vectors Rx, Ry, Rz
are linearly dependent i.e., there exist scalars α, β, γ, not all zero such that
Equation (3) implies det(R) = 0 as the vectors u, v and w are linearly independent.
Thus, Ru = 0 for non-zero u ∈ V ⇐⇒ det(R) = 0.
Let u, v and w be linearly independent vectors. Then, using the definition of third
invariant, we can express Eq. (4) as
where I1 , I2 and I3 are principal invariants of tensor T . Since u, v, and w are linearly
independent, we get
λ3 − I1 λ2 + I2 λ − I3 = 0. (5)
The polynomial of λ shown in preceding equation is known as characteristic polynomial.
We note that all principal invariants are real as tensor components are real in any co-
ordinate frame. In other words, the coefficients of characteristic polynomial are real.
Thus, the roots of characteristic equation are either real or complex conjugates. Conse-
quently, the cubic equation shown in Eq. (5) has at least one real root. In other words,
every second-order tensor defined over three dimensional space possess at least one real
eigenvalue.
I1 = tr(T ) = λ1 + λ2 + λ3
I2 = tr(cof T ) = λ1 λ2 + λ2 λ3 + λ3 λ1 (6)
I3 = det(T ) = λ1 λ2 λ3
Numerically, these principal invariants do agree with our earlier definition of principal
invariants.
P (T ) = O,
where P (λ) = λ3 −I1 λ2 +I2 λ−I3 is the characteristic polynomial and I1 , I2 , I3 are principal
invariants of tensor T .
T u1 = λ1 u1 , T u2 = λ2 u2 , T u3 = λ3 u3 . (7)
v = α1 u1 + α2 u2 + α3 u3 , α1 , α2 , α3 ∈ <.
Since the eigenvectors u1 , u2 and u3 are linearly independent and form a basis to vector
space. Consequently, (T − λ1 I)(T − λ2 I)(T − λ3 I)v = 0 is true for every vector v ∈ V.
Therefore, we have
(T − λ1 I)(T − λ2 I)(T − λ3 I) = O
3 2
T − (λ1 + λ2 + λ3 )T + (λ1 λ2 + λ2 λ3 + λ3 λ1 )T − (λ1 λ2 λ3 )I = O
T 3 − I1 T 2 + I2 T − I3 I = O (using Eq. (6)).
Hence, the theorem is proved if eigenvectors form a basis to the vector space irrespective
of whether eigenvalues are repetitive or not. This proof can be extended easily to any
second-order tensor defined over n-dimensional vector space where eigenvectors form a
basis to the vector space.
The tensor is said to be “defective” or “non-diagonalizable” if the eigenvectors of tensor
do not form basis to the vector space. The Cayley-Hamilton theorem still holds for the
defective tensors but the proof presented above does not take care of defective tensors.
Reader can refer any standard book on linear algebra for the general proof.
rotation but scaling. We note that scaling includes multiplication with negative number
too. In general, any resultant vector that is collinear with original vector is considered as
scaling. For example u and −u are scaled vectors and not considered as rotated vectors.
Thus, geometrically, the eigenvectors or eigen-directions are the directions that do not
experience rotation under the action of tensor. Furthermore, the scale factors of unro-
tated directions represents corresponding eigenvalues. We present two simple examples
to understand this geometric interpretation.
Example 1. Let us consider the following matrix form of a rotation tensor R in canonical
basis. Show that the eigenvalues and eigen vectors are complex as every direction is
undergoing rotation. " √ √ #
1/√2 −1/√ 2
R=
1/ 2 1/ 2
Solution: Given tensor rotates every vector in the plane by π/4 angle as shown in Fig.
(1). Therefore, geometrically no vector exists without undergoing rotation. Let λ be an
eigenvalue. Then we get
det(R − λI) = 0
√ " √ #
1/ 2√− λ −1/ √ 2
=⇒ det = 0
1/ 2 1/ 2 − λ
√
=⇒ λ2 − 2λ + 1 = 0
1 1 1 1
=⇒ λ = √ + i √ or √ − i √
2 2 2 2
√
where i is the complex number −1. We now have following complex conjugate pair of
eigenvalues (λ1 and λ2 ) and corresponding eigenvectors (v1 and v2 ):
√i − √i2
( ) ( )
1 1 1 1
λ1 = √ + i √ , v1 = √1
2 , λ2 = √ − i √ , v2 = √1
2 2 2 2 2 2
Since we are getting complex eigenvectors it means that no real eigenvector exists. In
other words, there is no direction which is not undergoing rotation. It is consistent with
the geometric depiction of tensor action in Fig. (1).
e1 = (1,0)
e2 = (0,1) Ru1
u2 u1
R Re 2 e2 Re 1
e2
-e 1 e1
Ru2 e1
-e 2
-Re 1 -Re 2
Example 2. Consider the following shear transformation tensor in canonical basis. Show
that the eigenvectors make an angle of π/4 with basis.
" √ #
3/2 √1/2
S=
1/2 3/2
Solution: Geometrically given shear transformation tensor and canonical basis e1 = (1, 0)
and e2 = (0, 1) are shown in Fig. (2). We get the following characteristic equation by
considering the eigenvalue problem of given tensor.
e1 = (1,0)
u2 e2 = (0,1)
u1 e2 Se2 Su1
e2 S Su2
-e 1 e1 Se1
-e 2 e1
-Se1
-Se2
√
3/2 − λ √ 1/2
det(S − λI) = =0
1/2 3/2 − λ
q
2
=⇒ λ − (3)λ + 1/2 = 0
√ √
3+1 3−1
=⇒ λ = or
2 2
Eigenvalues (λ1 and λ2 ) and corresponding eigenvectors (u1 and u2 ) are
√ ( 1 ) √
√1
( )
3+1 √ 3 − 1 −
λ1 = , u1 = √12 , λ2 = , u2 = √1
2
2 2 2 2
The basis vectors and eigenvectors are shown in Fig. (2). The angles between eigenvectors
and basis are as follows:
√
cos(θu1 e1 ) = u1 · e1 = 1/ 2
=⇒ θu1 e1 = π/4
√
cos(θu2 e2 ) = u2 · e2 = 1/ 2
=⇒ θu2 e2 = π/4
It is clear from geometric picture shown in Fig. (2) and also algebraic procedure that the
angle between eigenvectors and canonical basis is π/4.
Newton identities:
Newton identity is a recursive formula for obtaining principal invariants of any second-
order tensor T defined over n-dimensional vector space. Let principal invariants are
Ik , k = 1, 2, · · · , n then recursive formula is given by
(−1)k+1
Ik = [trT k − I1 trT k−1 + · · · + (−1)k−1 Ik−1 trT ]
k
k−1
(−1)k+1
" #
tr T k + (−1)i Ii T k−i
X
= (8)
k i=1
I1 = tr(T )
1 1
I2 = − (tr(T 2 ) − I1 tr(T )) = ((trT )2 − tr(T 2 )) (9)
2 2
1 1
I3 = (tr(T ) − I1 tr(T ) + I2 tr(T )) = ((trT )3 − 3tr(T )tr(T 2 ) + tr(T 3 ))
3 2
3 6
Exponential tensor:
Exponential tensor can be defined in terms of series expansion or in terms of solution
to an ordinary differential equation. Here we define exponential of tensor through the
series expansion. In forthcoming discussion on tensor calculus, we show an alternative
definition. The exponential of tensor A is defined by
1 22 1 33
eAt := I + At + A t + A t + ··· (10)
2! 3!
We can easily verify by considering an example that taking exponential to every com-
ponent of tensor does not give the exponential of tensor. Furthermore, just computing
sufficiently large powers of tensor and taking summation as shown in Eq. (10) can provide
an approximation but not the exact components of tensor. We now show using eigenval-
ues and eigenvectors that closed form can be obtained for the exponential tensor. If the
eigenvectors of tensor form a basis to the vector space then we have,
A = P ΛP −1
where P is an invertible matrix with linearly independent eigenvectors as its columns and
Λ is a diagonal matrix with corresponding eigenvalues on the diagonal. From preceding
equation we get
A2 = P Λ2 P −1 , A3 = P Λ3 P −1 , · · ·
Substituting preceding terms in Eq. (10), we get
1 1
eAt = P P −1 + P ΛP −1 t +P Λ2 P −1 t2 + P Λ3 P −1 t3 + · · ·
2! 3!
1 22 1 33
= P (I + Λ + Λ t + Λ t + · · · )P −1
2! 3!
Λt −1
= Pe P
Since the product of diagonal matrix is diagonal, we can get following form for the eΛt ,
e λ1 t 0 ···
0
Λt
0 e λ2 t ··· 0
e = .. .. .. ..
. . . .
0 0 · · · eλn t
Therefore, if we know eigenvalues and eigenvectors then the exponential of tensor can
be computed exactly using above relations. In forthcoming discussion, we show that
exponential of a skewsymmetric is rotation.
One can define exponential transformation for defective tensors where eigenvectors are
not sufficient to form basis to the vector space. Here we do not present the exponential
of defective tensors as they require the definition of Jordan form. We note that the eigen-
vectors and eigenvalues not only play an important role in exponential of diagonalizable
tensor but also in exponential of defective tensors.
References
Choosing u = v, we have
(u, W u) = −(u, W u)
=⇒ (u, W u) = 0, ∀u ∈ V (2)
From Eq. (2), we conclude that the vector W u is either orthogonal to the vector u or
W u = 0. Choosing u = ei and v = ej in Eq. (1), we get
Wij = −Wji .
Thus, any skewsymmetric tensor defined over three dimensional space has three indepen-
dent components.
W u = w × u, ∀u ∈ V (4)
Since the determinant of tensor W is zero there must exist at least one zero eigenvalue.
Let p be a unit eigenvector corresponding to zero eigenvalue i.e., W p = 0. Consider
q and r are two mutually orthogonal unit vectors such that they are orthogonal to the
vector p. Thus, the set {p, q, r} forms an orthonormal basis to the vector space V. Since
W is a second-order tensor we can have the following representation.
Since the set {p, q, r} is an orthonormal basis, the components of W can be written as
Using Eq. (2), we get (p, W p) = (q, W q) = (r, W r) = 0. The relation W p = 0 give
rise to (q, W p) = (r, W p) = 0. In addition, using skewsymmetric nature of tensor W ,
we have
In summary, the components of tensor in orthonormal basis {p, q, r}: W11 = (p, W p) =
0, W12 = −W21 = (p, W q) = 0, W13 = −W31 = (p, W r) = 0, W22 = (q, W p) = 0,
W33 = (r, W r) = 0, W23 = (q, W r) = −(r, W q) = −W32 . If W is non-zero then W23
and W32 are only non-zero components. Furthermore, W23 = −W32 . Substituting these
components in Eq. (6), we obtain
W = W23 (q ⊗ r − r ⊗ q)
W u = W23 (q ⊗ r − r ⊗ q)u
= W23 ((r · u)q − (q · u)r)
= W23 ((r × q) × u)
Let w = γp then
W u = w × u, ∀u ∈ V.
Thus, there is an axial vector w corresponding to every skewsymmetric tensor W .
Conversely, if the vector w is given then we need to show that there exists a corre-
sponding skewsymmetric tensor W . For every vector u ∈ V there is a vector v such
that
w × u = v.
There is an output vector for every input vector and hence, the operation with given
vector is a transformation from vector space to vector space. The linearity of the preceding
operation follows from the distributive property of cross product.
w × (αu + βv) = α(w × u) + β(w × v), ∀u, v ∈ V and ∀α, β ∈ <
Therefore, the cross product with a given vector w is a linear transformation. Conse-
quently, there should exist a second-order tensor corresponding to the operation. Let W
be a second-order tensor such that
W u = w × u, ∀u ∈ V.
Writing the preceding equation in indicial notation, we get
(W u)i = (w × u)i
Wij uj = ikj wk uj
Wij uj = −ijk wk uj
Arbitrariness of vector u implies Wij = −ijk wk . It is easy to see that Wij = −Wji . There-
fore, the second-order tensor W is a skewsymmetric tensor. Thus, there is a skewsym-
metric tensor W associated with a given vector w.
Using principal invariants, we get the following characteristic equation for the tensor W .
λ3 + |w|2 λ = 0 (11)
where λ is the eigenvalue. This cubic equation has one zero root and two purely imaginary
√
roots. If i is imaginary number −1 then the eigenvalues can be expressed as λ1 = 0,
λ2 = i|w| and λ3 = −i|w|. It can be verified that eigenvalues satisfy the characteristic
equation. Geometrically, there are no real eigenvectors for the skewsymmetric tensor
except the axial vector. Furthermore, the axial vector also maps to the zero vector as it
is corresponding to zero eigenvalue. In conclusion, the skewsymmetric tensor W assumes
a diagonal form if the field is extended to the complex numbers.
Skewsymmetric tensor in higher dimensional vector space:
Let us consider a skewsymmetric tensor in n-dimensional vector space. The following
property is true for any dimension,
(u, W u) = 0, ∀u ∈ V
From the geometrical interpretation, all eigenvalues are either complex numbers or zeros.
In other words either the directions undergo orthogonal rotation or the directions have
zero scale factors.
It is easy to see that any skewsymmetric tensor defined over n-dimensional vector space
has n(n − 1)/2 independent components. Consequently, the skewsymmetric tensor can
not be associated with a vector in the vector space other than three dimensional space.
Thus, three dimensional space has special property that every skewsymmetric tensor can
be associated with a unique vector and vice versa.
Reference
The orthogonal tensors appear in rotation transformation and also in transformation rule
of tensors. Hence, orthogonal tensors play an important role in continuum mechanics.
We now discuss the representation and properties of orthogonal tensors.
Definition of orthogonal tensor:
A second-order tensor Q is said to be orthogonal if QT = Q−1 , i.e.,
QT Q = QQT = I, (1)
where I is the identity tensor. We show in the following problem that length of vector
and angle between any two vectors are preserved under the action of orthogonal tensor,
i.e., inner product is preserved.
Problem 1. A tensor Q is orthogonal if and only if it preserves the inner product, i.e.,
(Qu, Qv) = (u, v), ∀u, v ∈ V. (2)
It is easy to see that the orthogonal tensor preserves the length of vector by choosing
u = v in Eq. (2), i.e.,
|Qu| = |u|. (3)
Using Eq. (1), we get
det(QT Q) = det(I)
=⇒ det(QT )det(Q) = 1 (Since det(RS) = det(R)det(S))
=⇒ (det(Q))2 = 1 (Since det(T T ) = det(T ))
det(Q) = ±1 (4)
The rotation tensor preserves the length of vector and angle between vectors as shown in
Eqs. (2) and (3). We now show that the rotation tensor preserves the area of parallelogram
formed by two vectors.
Proof. det(Q) = 1 for the given rotation tensor. Using the relation between cofactor and
determinant, we get
cof Q = (det(Q))Q−T = Q.
Since the inverse of transpose is equal to the transpose of inverse for any arbitrary in-
vertible second-order tensor, i.e., (QT )−1 = (Q−1 )T , we use the notation to indicate Q−T
either cases. Note that this notation used in the above equation. From the definition of
cofactor, we have
Problem 3. Show that any rotation tensor Q defined over three dimensional space always
possesses a unit eigenvalue.
Proof. Since Q is rotation tensor, we have QT Q = I and also det(Q) = 1. Using the
relation between orthogonal and identity tensors, we get
Comparing with eigenvalue problem, we conclude that there is a unit eigenvalue to the
rotation tensor. The eigenvector corresponding to the unit eigenvalue is indeed the axis
of rotation as the direction and length of vector are preserved.
Geometrically, we know that rotation of any (rigid) body takes place about a fixed
axis known as axis of rotation (see Fig. 1). As previously mentioned, the axis of rotation
indeed the eigenvector corresponding to a unit eigenvalue. In fact, the rotation tensor can
be represented using axis of rotation which is discussed in the following problem.
Rigid body
Axis of rotation
Problem 4. Show that any rotation tensor Q can be represented by the following form,
where p is the eigenvector corresponding to the unit eigenvalue, q and r are two orthonor-
mal vectors such that the set {p, q, r} is an orthonormal basis to the vector space V, and
α, β are two scalars such that α2 + β 2 = 1.
Q11 = p · Qp, Q12 = p · Qq, Q13 = p · Qr, Q21 = q · Qp, Q22 = q · Qq,
Q23 = q · Qr, Q31 = r · Qp, Q32 = r · Qq, Q33 = r · Qr. (8)
Using inner product preserving nature of orthogonal tensor (i.e., using Eq. (2)), we get
p·q =0 =⇒ Qp · Qq = 0 =⇒ p · Qq = 0, (10)
p·r =0 =⇒ Qp · Qr = 0 =⇒ p · Qr = 0. (11)
The above relations (i.e., p·Qq = 0 and p·Qr = 0) show that Qq and Qr are orthogonal
to the vector p. Therefore, the vectors Qq and Qr are linear combination of q and r,
i.e.,
Qq = α1 q + β1 r and Qr = α2 q + β2 r, (12)
q · r = 0 =⇒ Qq · Qr = 0,
α1 α2 + β1 β2 = 0. (13)
α2 (α1 α2 + β1 β2 ) + β2 (α1 β2 − β1 α2 ) = β2
α1 α22 + α2 β1 β2 + α1 β22 − α2 β1 β2 = β2
α1 (α22 + β22 ) = β2
α 1 = β2 (Using Eq. (16)) (17)
β2 (α1 α2 + β1 β2 ) − α2 (α1 β2 − β1 α2 ) = −α2
α1 α2 β2 + β1 β22 − α1 α2 β2 + β1 α22 = −α2
β1 (α22 + β22 ) = −α2
β1 = −α2 (Using Eq. (16)). (18)
Let us choose α1 ≡ α and β1 ≡ β. Then Eqs. (12), (17) and (18) imply
q · Qq = α, r · Qr = α, q · Qr = −β and r · Qq = β. (19)
Substitution of Eqs. (9), (10), (11) and (19) in Eq. (8), yields
Q11 = 1, Q12 = 0, Q13 = 0, Q21 = 0, Q22 = α, Q23 = −β, Q31 = 0, Q32 = β, Q33 = α.
(20)
With substitution of these components (i.e., Eq. (20)) in Eq. (7) give rise the final form
of rotation tensor shown in Eq. (6).
References
Proof. We have the following representation for rotation tensor Q from Problem-4 of
Lecture-8.
Q = p ⊗ p + α(q ⊗ q + r ⊗ r) + β(r ⊗ q − q ⊗ r), (2)
where α2 + β 2 = 1, p is the eigenvector corresponding to unit eigenvalue (i.e., the axis
of rotation) and the set {p, q, r} is the orthonormal basis. Since p is the axial vector of
skewsymmetric tensor W , we have the following form (see Eq. (10) in Lecture-7).
W = r ⊗ q − q ⊗ r. (3)
W 2 = (r ⊗ q − q ⊗ r)(r ⊗ q − q ⊗ r)
= (r ⊗ q)(r ⊗ q) − (q ⊗ r)(r ⊗ q) − (r ⊗ q)(q ⊗ r) + (q ⊗ r)(q ⊗ r)
= (q · r)r ⊗ q − (r · r)q ⊗ q − (q · q)r ⊗ r + (r · q)q ⊗ r
To obtain the last step we use the relation (a ⊗ b)(c ⊗ d) = (b · c)(a ⊗ d)) (see Module-1:
Tensor Algebra exercise 2(ii)). Since q and r are orthonormal vectors, we can write
W 2 = −(q ⊗ q + r ⊗ r) (4)
Proof. Since W is a second-order tensor, using definition of exponential tensor (see Eq.
(10) in Lecture-6), we get
1 2 2 1 3 3 1 4 4 1 5 5 1 6 6
eW θ = I + W θ + W θ + W θ + W θ + W θ + W θ + ··· (6)
2! 3! 4! 5! 6!
We know from Lecture-7 that the characteristic equation for the skewsymmetric tensor is
λ3 + |w|λ = 0. Using Cayley-Hamilton theorem, we get
W 3 = −W
as the magnitude of axial vector is unit, i.e., |w| = 1. Therefore, we can have the higher
powers of W
W 4 = −W 2 , W 5 = W , W 6 = W 2 , · · ·
Substituting higher powers in exponential tensor, we obtain
1 3 1 5 1 7 1 1 1 1
eW θ = I + (θ − θ + θ − θ + · · · )W + ( θ2 − θ4 + θ6 − θ8 + · · · )W 2
3! 5! 7! 2! 4! 6! 8!
1 2 1 4 1 6 2
= I + sin θ W + (1 − (1 − θ + θ − θ + · · · ))W
2! 4! 6!
2
= I + sin θ W + (1 − cos θ)W
T
To show orthogonality of exponential tensor, it is sufficient to prove eW θ eW θ = I.
T
eW θ eW θ = (I + sin θ W + (1 − cos θ)W 2 )(I + sin θ W + (1 − cos θ)W 2 )T
= (I + sin θ W + (1 − cos θ)W 2 )(I − sin θ W + (1 − cos θ)W 2 )
= I + sin θ W + (1 − cos θ)W 2 − sin θ W − sin2 θ W 2 − (1 − cos θ)sin θ W 3
+(1 − cos θ)W 2 + sin θ (1 − cos θ)W 3 + (1 − cos θ)2 W 4
= I + 2(1 − cos θ)W 2 − sin2 θ W 2 − (1 − cos θ)2 W 2 (since W 4 = −W 2 )
= I + 2(1 − cos θ)W 2 − sin2 θ W 2 − (1 + cos2 θ − 2cos θ)W 2
= I + (1 − sin2 θ)W 2 − cos2 θ W 2
= I
Problem 3. Show that first invariant (trace) and second invariant (trace of cofactor) of
rotation tensor Q are equal to 1 + 2cos θ if the angle of rotation is θ.
Proof. Let p be the axis of rotation tensor Q. Let the set of vectors {p, q, r} be an
orthonormal basis. Then, using Eq. (2), we can write the rotation tensor
where θ is an angle of rotation. Using the definition of trace (see Lecture-5), we can write
Using the representation of rotation tensor shown in Eq. (7), the following relations can
be obtained.
Thus, the lower and upper bounds on first and second invariants are -1 and 3, i.e., −1 ≤
I1 = I2 ≤ 3.
The orthogonal tensor not only useful in representing the rotation but also useful in
tensor transformation law. In fact the direction cosines that appear in tensor transforma-
tion are components of an orthogonal tensor. Following problem addresses the orthogonal
tensor that can transform one orthogonal basis to another orthogonal basis.
Problem 4. Let {e01 , e02 , e03 } and {e∗1 , e∗2 , e∗3 } be two sets of orthonormal basis to the vector
space V. In addition, both sets represents right hand coordinate system, i.e., e01 × e02 = e03
and e∗1 × e∗2 = e∗3 . Then the following tensor Q is a proper orthogonal tensor.
Proof. {e01 , e02 , e03 } and {e∗1 , e∗2 , e∗3 } are given sets of two orthonormal basis to the vector
space V. Then we want to show that the following second-order is an orthogonal tensor.
QT Q = (e01 ⊗ e∗1 + e02 ⊗ e∗2 + e03 ⊗ e∗3 )(e∗1 ⊗ e01 + e∗2 ⊗ e02 + e∗3 ⊗ e03 )
= (e01 ⊗ e∗1 )(e∗1 ⊗ e01 ) + (e01 ⊗ e∗1 )(e∗2 ⊗ e02 ) + (e01 ⊗ e∗1 )(e∗3 ⊗ e03 )
+(e02 ⊗ e∗2 )(e∗1 ⊗ e01 ) + (e02 ⊗ e∗2 )(e∗2 ⊗ e02 ) + (e02 ⊗ e∗2 )(e∗3 ⊗ e03 )
+(e03 ⊗ e∗3 )(e∗1 ⊗ e01 ) + (e03 ⊗ e∗3 )(e∗2 ⊗ e02 ) + (e03 ⊗ e∗3 )(e∗3 ⊗ e03 )
= e01 ⊗ e01 + e02 ⊗ e02 + e03 ⊗ e03
= I.
We can also show {e01 , e02 , e03 } = {Qe∗1 , Qe∗2 , Qe∗3 }. Since,
Q = QI
= Q(e∗1 ⊗ e∗1 + e∗2 ⊗ e∗2 + e∗3 ⊗ e∗3 )
= (Qe∗1 ) ⊗ e∗1 + (Qe∗2 ) ⊗ e∗2 + (Qe∗3 ) ⊗ e∗3 )
= e01 ⊗ e∗1 + e02 ⊗ e∗2 + e03 ⊗ e∗3 .
The given basis {e01 , e02 , e03 } and {e∗1 , e∗2 , e∗3 } are right hand coordinate system, i.e., [e01 , e02 , e03 ] =
1 and [e∗1 , e∗2 , e∗3 ] = 1. Since [e01 , e02 , e03 ] = [Qe∗1 , Qe∗2 , Qe∗3 ], by definition of determinant,
we get det(Q) = 1. Thus, Q is a rotation tensor or proper orthogonal tensor.
Transformation law:
Let {e1 , e2 , e3 } and {e01 , e02 , e03 } be two orthonormal basis. Then the following proper
orthogonal tensor can be constructed with help of both basis.
On the other hand, we can also represent ei in the basis {e01 , e02 , e03 } as
Let vi0 be ith component of vector v in the basis {e01 , e02 , e03 } and vj be j th component of
vector v in the basis {e1 , e2 , e3 }. Then we have the following transformation law for the
components of vector.
Similarly, let Tij0 be ij th component of second-order tensor T in the basis {e01 , e02 , e03 } and
Tmn be mnth component of tensor T in the basis {e1 , e2 , e3 }. Then the transformation
law for the components of tensor
Tij0 = e0i · T e0j = (Qim em ) · T (Qjn en ) = Qim Qjn em · T en = Qim Qjn Tmn
Thus, the proper orthogonal tensor play an important role in transformation of tensors.
We will be discussing another important class of second-order tensors called symmetric
tensors in the next lecture.
Reference
Problem 1. Show that all eigenvalues of symmetric tensor S are real numbers.
Proof. Extending the inner product over complex number field, we can define (u, v) =
ui v̄i , where v̄i is the complex conjugate of vi (ith component of vector v), and ui is the
ith component of vector u. It is easy to see that the positive definite property of inner
product is followed by the new definition, i.e., if u = v then (u, u) ≥ 0 for all vectors
in vector space. The new definition of inner product follows the conjugate symmetry
property, i.e., (u, v) = (v, u). It can also be easily verified that the inner product follows
linear property. Furthermore, if there is no imaginary part in the vectors, i.e., vectors
are defined over real field, then it reduces to the standard dot product. Using the new
definition of inner product, we get
Let λ and n be an eigenvalue and its corresponding eigenvector of given symmetric tensor
S, respectively. Then
λ(n, n) = (λn, n) = (Sn, n) = (n, S T n) = (n, Sn) = (n, λn) = λ̄(n, n).
λ = λ̄, since n is a non-zero vector, i.e., (n, n) > 0. If complex number and its conjugate
are equal then the imaginary part is zero. Thus, eigenvalues of symmetric tensor are real
numbers.
Proof. Let λ1 and λ2 be two distinct eigenvalues of second-order symmetric tensor S. Let
n1 and n2 be eigenvectors corresponding to λ1 and λ2 . Then we have
Sn1 = λ1 n1 ,
Sn2 = λ2 n2 .
The symmetric tensor can be divided into three classes based on the eigenvalues: (i) all
three eigenvalues are distinct (ii) two eigenvalues are equal and other one is distinct (iii)
all three eigenvalues are equal. There is convenient representation for all the cases of
symmetric tensor known as spectral decomposition. We now discuss the spectral decom-
position for all the cases.
Problem 3. If all three eigenvalues are distinct then the symmetric tensor S can be
represented by the following form,
S = λ1 n1 ⊗ n1 + λ2 n2 ⊗ n2 + λ3 n3 ⊗ n3 , (1)
Proof. We have shown in previous problem that the eigenvectors corresponding to dis-
tinct eigenvalues are orthogonal. Let n1 , n2 and n3 be unit eigenvectors then the set
{n1 , n2 , n3 } forms an orthonormal basis to the vector space. Since S is a second-order
tensor, it has the following representation,
S = λ1 p ⊗ p + λ2 (I − p ⊗ p). (3)
The symmetric tensor S can have the general representation in the basis {n1 , n2 , n3 } as
shown in Eq. (2). Since n2 is an eigenvector corresponding λ2 , we can write Sn2 = λ2 n2 .
It is easy to see the relations S11 = n1 · Sn1 = λ1 , S22 = n2 · Sn2 = λ2 , S12 = n1 · Sn2 =
n1 · λ2 n2 = 0 and S21 = n2 · Sn1 = S T n2 · n1 = Sn2 · n1 = S12 . Similarly, we also have
S13 = S31 = n3 · Sn1 = n3 · λ1 n1 = 0 and S23 = S32 = n3 · Sn2 = n3 · λ2 n2 = 0
S = λ1 p ⊗ p + λ2 q ⊗ q + S33 n3 ⊗ n3 .
Since the first invariant is equal to sum of eigenvalues. Hence, we get S33 = λ2 . Substi-
tuting S33 in above equation, we obtain
S = λ1 p ⊗ p + λ2 q ⊗ q + λ2 n3 ⊗ n3
We can easily see that not only p and q are eigenvectors but also n3 is an eigenvector.
Furthermore, the linear combination of q and n3 is also an eigenvector. Therefore, we
have two eigensubspaces: (i) subspace generated by vector p, (ii) orthogonal plane to p.
Replacing q with n2 in preceding equation, we get
S = λ1 p ⊗ p + λ2 (n2 ⊗ n2 + n3 ⊗ n3 ).
Problem 5. If all eigenvalues are equal then we have following representation for sym-
metric tensor S.
S = λI, (4)
where λ is the eigenvalue and I is the identity tensor.
Proof. Let λ1 , λ2 and λ3 be eigenvalues. Since eigenvalues are equal, we choose a scalar λ
such that λ ≡ λ1 = λ2 = λ3 . Let p be an unit eigenvector corresponding to the eigenvalue
λ, i.e., Sp = λp. Let {n1 , n2 , n3 } be an orthonormal basis such that n1 = p. The general
representation of S in the basis {n1 , n2 , n3 } is shown in Eq. (2).
Similarly, S13 = S31 = n3 · Sn1 = n3 · λn1 = 0 and S23 = S32 . Substituting these
coefficients in general form, we get
We know that all components of tensor S are real. Therefore, equation implies S23 = 0
and S22 = S33 . From the trace of tensor, we conclude that S22 = S33 = λ. Substitution
of the coefficients in the general form yields
Above three problems can be summarized into the following spectral decomposition of
symmetric tensor.
Proof. Problems (3), (4) and (5) show the existence of at least an orthonormal set of
vectors such that they are eigenvectors of symmetric tensor S. Let e∗1 , e∗2 and e∗3 be
three orthonormal eigenvectors. Then we have the following result from the preceding
problems.
S = λ1 e∗1 ⊗ e∗1 + λ2 e∗2 ⊗ e∗2 + λ3 e∗3 ⊗ e∗3 ,
The unique representation of S is shown in previous three problems for the three different
cases.
Problem 7. There is an orthogonal tensor Q such that every symmetric tensor S assumes
diagonal form (i.e., matrix form of S is diagonal). The orthogonal transformation is given
by Q = ei ⊗e∗i , where {ei } is canonical basis and {e∗i } is another set of orthonormal basis.
Proof. Let us consider the form of S as shown in preceding problem of spectral decom-
position, i.e.,
3
λi e∗i ⊗ e∗i ,
X
S=
i=1
Let {ei }, i = 1, 2, 3, be the canonical basis. Then we can define an orthogonal tensor (see
Problem-4 in Lecture-9)
Q = ei ⊗ e∗i .
such that it transforms the orthogonal basis {e∗1 , e∗2 , e∗3 } to the other orthogonal basis
{e1 , e2 , e3 }. It can also be represented by Qe∗i = ei . Upon applying this transformation
rule to Eq. (6), we get
Conversely, for given eigenvalues and eigenvectors, we can recover symmetric matrix S.
Let Λ be the diagonal matrix with eigenvalues on the diagonal. Let Q be the orthogonal
matrix with orthonormal eigenvectors of S as its rows. Then the symmetric tensor is
S = QT ΛQ. Thus, recovering the symmetric tensor.
Proof. Let S be a symmetric tensor. Then it is enough to show that the symmetric tensor
S satisfies the following (characteristic) equation.
S 3 − I1 S 2 + I2 S − I3 = O (8)
Substituting these relations on the left hand side of Eq. (8), we get
P (S) = (λ31 − I1 λ21 + I2 λ1 − I3 )e∗1 ⊗ e∗1 + (λ32 − I1 λ22 + I2 λ2 − I3 )e∗2 ⊗ e∗2
+(λ33 − I1 λ23 + I2 λ3 − I3 )e∗3 ⊗ e∗3
= O.
The last step follows from the characteristic equation. We note that the eigenvalues are
roots of characteristic equation, i.e., λ3i − I1 λ2i + I2 λi − I3 = 0 for i = 1, 2, 3. Thus, the
theorem is proved.
We can prove an interesting result that eigenvector extremizes the quantity n · Sn,
where n is a unit vector and S is a symmetric tensor. Furthermore, the extremum value
is an eigenvalue. The proof of this result is presented in the properties of stress tensor
which will be discussed later.
Example 1. Consider following matrix representation of shear transformation S. Find
the eigenvalues and eigenvectors of shear transformation. Show that the eigenvectors are
mutually orthogonal and graphically depict the extremum directions.
" √ #
3/2 1/2
S= √
1/2 3/2
Solution: Given shear transformation is a symmetric tensor and it can be easily observed
from above matrix representation. The given shear transformation can be graphically
represented as shown in Fig. (1).
e1 = (1,0)
u2 e2 = (0,1)
u1 e2 Se2 Su1
e2 S Su2
-e 1 e1 Se1
-e 2 e1
-Se1
-Se2
It is easy to see that eigenvectors are mutually orthogonal as u1 · u2 = 0. This can also
be observed from the Fig. (1) that the eigenvectors are mutually orthogonal and u1 · Su1
is maximum and u2 · Su2 is minimum. We note that the saddle point may also exist as
an extremum in three-dimensional case along with maximum and minimum.
Reference
Here, we discuss the positive definiteness and polar decomposition of second-order tensors.
The polar decomposition plays an important role in defining generalized strain and also
in the applications of principle of material frame-indifferent.
Definition of positive definite tensor:
A second-order tensor S is said to be positive definite if
Any second-order tensor T can be uniquely decomposed into a symmetric part Ts , and a
skew-symmetric part Tss , i.e., T = Ts + Tss where Ts = 12 (T + T T ) and Tss = 21 (T − T T )
(see Lecture-4). Therefore, we have
Problem 1. Given a symmetric tensor S is positive definite if and only if all eigenvalues
are positive.
Proof. Let us assume that the given tensor S is positive definite tensor. Then we have
Let λ and n be an eigenvalue and its corresponding eigenvector of S. Then we have the
relation Sn = λn. Taking inner product with n on both sides give rise to
(n, Sn)
λ= (2)
(n, n)
Since (n, n) > 0 and S is positive definite tensor (i.e., (n, Sn) > 0), Eq. (2) implies
λ > 0. Thus, positive definite property of tensor S implies positive nature of eigenvalues.
Conversely, let all eigenvalues of symmetric tensor S be positive. Then using spectral
decomposition (see Problem-6 in Lecture-10), we can have
where λ1 , λ2 , λ3 are eigenvalues and e∗1 , e∗2 , e∗3 are corresponding orthonormal eigenvectors.
Let u be a non-zero vector. Then
where u∗1 , u∗2 , u∗3 are components vector in the basis {e∗1 , e∗2 , e∗3 }.
The positive property of eigenvalues implies (u, Su) > 0. Suppose, (u, Su) = 0 then
from preceding equation u∗i = 0, which implies that u = 0. Thus, S is positive definite
tensor if and only if all eigenvalues are positive.
Problem 2. All principal invariants of symmetric tensor S are positive if and only if all
eigenvalues are positive.
I1 = λ1 + λ2 + λ3
I2 = λ1 λ2 + λ2 λ3 + λ3 λ1
I3 = λ1 λ2 λ3
It is easy to see that if eigenvalues are positive then principal invariants are also positive.
Conversely, let principal invariants I1 , I2 and I3 be positive. The characteristic equation
of tensor S is given by
λ3 − I1 λ2 + I2 λ − I3 = 0,
where λ is an eigenvalue of tensor S. We know that eigenvalues of symmetric tensor are
real. Hence, if eigenvalue is negative then it cannot satisfy the characteristic equation as
the term λ3 − I1 λ2 + I2 λ − I3 is negative. Therefore, eigenvalue must be positive since the
principal invariants are positive. Thus, all principal invariants are positive if and only if
eigenvalues are positive.
Problem 3 (Square root tensor). Let S be a positive definite symmetric tensor. Then
2
there exist a unique positive definite symmetric tensor U such
√ that U = S. The tensor
U is called square root tensor of S. We usually write U = S.
Proof. If U is not positive definite symmetric (P sym) tensor then there can be many U
satisfying U U = S such that S is P sym. For example, the matrices diag[1, -1, 1] (i.e.
diagonal matrix with entities 1, -1, 1 on the diagonal) and diag[1, -1, -1] are non-positive
definite square roots of the identity tensor I. Therefore, in order to ensure uniqueness,
we stated in the problem that U is the positive definite symmetric tensor. We now show
the relation between the given positive definite symmetric tensor and its positive definite
square root. Consider the spectral decomposition of given symmetric tensor S.
where λ1 , λ2 and λ3 are eigenvalues and e∗1 , e∗2 and e∗3 are corresponding orthonormal
eigenvectors.
The eigenvalues are positive as S is a positive definite symmetric tensor. Therefore,
we can define a tensor
q q q
U= λ1 e∗1 ⊗ e∗1 + λ2 e∗2 ⊗ e∗2 + λ3 e∗3 ⊗ e∗3 . (3)
√ √
Since the set {e∗1 , e∗2 , e∗3 } is orthonormal, it is easy to see U U = S. Clearly, λ1 , λ2
√
and λ3 are eigenvalues and e∗1 , e∗2 and e∗3 are corresponding orthonormal eigenvectors
of U . Thus, there is a positive definite square root of S. We now show the uniqueness of
positive definite square root.
Let λ and n be the eigenvalue and corresponding eigenvector of S. Then we have
0 = (S − λI)n
= (U 2 − λI)n
√ √
= (U + λI)(U − λI)n.
√
Suppose (U − λI)n = ñ then we have
√
(U + λI)ñ = 0.
We have discussed the additive decomposition of arbitrary tensor into symmetric and
skew-symmetric parts. We now discuss the multiplicative decomposition of invertible
tensors given by the polar decomposition theorem.
F = QU = V Q, (4)
Proof. Since given tensor F is invertible, F T F is positive definite symmetric tensor, i.e.,
QT Q = (F U −1 )T (F U −1 )
= U −T F T F U −1
= U −1 (F T F )U −1 (since U is symmetric)
−1 −1
= U (U U )U
= I
Thus, Eq. (5) implies first part of polar decomposition F = QU , where Q is orthogonal
tensor and U is positive definite tensor.
We now show the uniqueness of the decomposition F = QU . Suppose F has second
polar decomposition such that F = Q̆Ŭ where Q̆ is also orthogonal tensor. We can
obtain F T F = Ŭ T Q̆T Q̆Ŭ = Ŭ 2 and we know F T F = U 2 . Hence, uniqueness of square
root tensor implies Ŭ = U and Q̆ = Q. Furthermore, The det(Q) and det(F ) must have
same sign as det(U ) > 0. Clearly, the tensor Q is rotation as det(F ) is positive.
T
We note that invertibility of F implies invertibility
√ of F . Therefore, similar to pre-
vious case we can define a square root tensor V = F F T since F F T is positive defi-
nite symmetric tensor. We now define an orthogonal tensor Q̃ = V −1 F and it implies
F = V Q̃. It is easy to see that Q̃ is an orthogonal tensor, i.e., Q̃T Q̃ = I. We can have
the relation F = Q̃(Q̃T V Q̃) since Q̃Q̃T = I. We now show that Q̃ = Q. Since F = QU
and F = V Q̃, we can write
QU = V Q̃ =⇒ U = QT V Q̃.
Since U and V are symmetric tensors, taking transpose on both sides of above equation,
we get
U = Q̃T V Q =⇒ Q̃U = V Q
Reference
It is clear from the definitions that there are no restrictions on the scalar variables of
isotropic functions. Hence, the representation of isotropic functions of vectors and tensors
are presented in the following discussion.
Example 1. Let φ be scalar-valued isotropic function of vector v. Then show that the
function can be represented by φ(v) = φ̃(v · v).
Solution: Let V be an inner product space. If there exists a vector u ∈ V then the inner
product guaranties the existence of a scalar function such that ψ(v) = u · v, ∀v ∈ V. We
now decompose the vector u into two orthogonal parts: one part is along the direction of
the vector v and other part is orthogonal to v. Hence, the vector u can be written as
(u · v)
u= v + (u · v ⊥ )v ⊥ ,
(v · v)
For any constant α the function ψ(v) is an isotropic function, i.e., ψ(Qv) = αQv · Qv =
α(QT Qv) · v = αv · v = ψ(v). Since ψ is an isotropic function, we now define a more
general isotropic function ψ̃(ψ(v)) = φ̃(v · v). It is easy to see that the function φ̃ is an
isotropic function. Thus, the function φ(v) = φ̃(v · v) is isotropic.
Solution Let {e∗1 , e∗2 , e∗3 } be orthonormal eigenvectors of tensor S and λ1 , λ2 , λ3 be the
corresponding eigenvalues. Then we have the spectral representation
i=1
3
λi (Qe∗i ) ⊗ (Qe∗i ))
X
= ψ(
i=1
If the function ψ(S) is isotropic then Q is arbitrary in the previous relation. Hence, we
can write ψ(QSQT ) = ψ̃(λ1 , λ2 , λ3 ) = ψ(S). We know that eigenvalues and principal
invariants have one to one relation. Therefore, we can define a function ψ̃(λ1 , λ2 , λ3 ) =
φ̃(I1 , I2 , I3 ). Furthermore, it is easy to verify that the function φ̃(I1 , I2 , I3 ) is isotropic as
det(S − λI) = det(QSQT − λI). Thus, the function φ(S) = φ̃(I1 , I2 , I3 ) is an isotropic
function.
Example 3. Prove that the scalar-valued isotropic function has representation φ(v, S) =
φ̃(v · v, v · Sv, v · S 2 v, I1 , I2 , I3 ) where I1 , I2 , I3 are invariants of symmetric tensor S and
v is a vector.
Solution: Examples (1) and (2) show that v · v is an invariant of vector v; I1 , I2 and I3
are invariants of tensor S. We can also create the following invariants as combination of
the tensor S and the vector v under every orthogonal transformation.
v · Sv, v · S 2 v, v · S 3 v · · · , v · S n v, · · ·
It is evident from Cayley-Hamilton theorem that not all invariants are independent. Using
Cayley-Hamilton theorem, we have
S 3 − I1 S 2 + I2 S − I3 I = O,
v · S 3 v − I1 v · S 2 v + I2 v · Sv − I3 v · Iv = 0
=⇒ v · S 3 v = I1 v · S 2 v − I2 v · Sv + I3 v · v
It is easy to verify that the function φ̃ is isotropic function, i.e., φ(Qv, QSQT ) = φ(v, S)
for every orthogonal tensor Q.
Solution: Let g(α) be a vector-valued isotropic function of scalar. Then, using the defini-
tion of isotropic function, we can have
The above relation shows that the vector g is an axis of the rotation tensor Q. Arbitrari-
ness of Q implies g(α) = 0. Thus, the isotropic vector-valued function of scalar variable
is zero.
Solution: If v = 0 then the isotropic function g(0) = Qg(0) for all orthogonal tensors Q.
This implies g(0) is an axis to every orthogonal tensor and hence, g(0) = 0. It is easy to
see that g(0) is isotropic function.
We now consider the vector v 6= 0 then we can write the vector g(v) as
where α(v) and β(v) are scalar functions and v ⊥ orthogonal vector to v. We now choose
the orthogonal tensor
(v ⊗ v)
Q=2 − I, (5)
(v · v)
where I is the identity tensor. Using Eqs. (4) and (5), we can write
Since g(v) is an isotropic function, equating above relations it is easy to see that β(v) = 0.
Furthermore, isotropy of g(v) implies
The above relation implies that α(v) is an isotropic function. Hence, based on Example-
1 the function α(v) can be represented by φ̃(v · v). Thus, the vector-valued isotropic
function of vector admits the representation g(v) = φ̃(v · v)v.
Let g(S) be a vector-valued isotopic function of second-order tensor S. Then, using the
definition of isotropic function, we have
g(QSQT ) = Qg(S)
=⇒ g(S) = Qg(S)
=⇒ g(S) = αp (Since p is the axis of Q)
g(S) = βq.
Any non-zero vector cannot be collinear with two orthogonal vectors. Therefore, the
isotropic function g(S) = 0. Conversely, it is easy to see that g(S) = 0 is an isotropic
function. Thus, the vector-valued isotropic function of symmetric tensor is zero.
Solution: Since given function G(α) is isotropic, using the definition of tensor valued
isotropic function, we can write
G(α) = QG(α)QT
=⇒ QG = GQ
Preceding relation shows that G commutes with every proper orthogonal tensor Q. Let
p be an axis of orthogonal tensor Q. Then we can have
We conclude from the preceding equation that Gp is a vector along the axis of Q and
hence, Gp = λp where λ is some scalar. Arbitrariness of Q implies Gp = λp for
every vector p ∈ V. Since whole vector space is an eigenspace for G, we can have the
representation G = λI where I is identity tensor.
Thus, the symmetric tensor-valued isotropic function of scalar has the representation
G(α) = λI.
Solution: If v = 0 then the case is similar to the previous example because G(0) =
QG(0)QT . This implies φ̃0 = λ where λ is some scalar and φ̃1 = 0 in the representation.
Let us consider v 6= 0. Let us define a vector valued function g(v) = G(v)v where
G(v) is given isotropic tensor valued function. We now show that isotropy of G implies
isotropy of g.
g(Qv) = G(Qv)Qv
= QG(v)QT Qv (Since Q is isotropic)
= QG(v)v
= Qg(v).
Furthermore, using spectral form of function G(v) and previously defined orthogonal
tensor Q, we get
α(Qv)
G(Qv) = (Qv) ⊗ (Qv) + β(Qv)q ⊗ q + γ(Qv)r ⊗ r
(Qv) · (Qv)
α(v)
= Q(v ⊗ v)QT + β(v)q ⊗ q + γ(v)r ⊗ r (using isotropy of α, β and γ)
(v · v)
α(v)
QGQT = Q(v ⊗ v)QT + β(v)Q (q ⊗ q) QT + γ(v)Q (r ⊗ r) QT
(v · v)
α(v)
= Q(v ⊗ v)QT + β(v)r ⊗ r + γ(v)q ⊗ q. (using Eqs. (6) and (7))
(v · v)
It is clear from the expressions G(Qv) and QGQT that isotropy of G implies β(v) = γ(v).
Therefore, we can write,
α(v)
G(v) = v ⊗ v + β(v)(q ⊗ q + r ⊗ r)
(v · v)
α(v) − β(v) 1
= v ⊗ v + β(v)( v ⊗ v + q ⊗ q + r ⊗ r)
(v · v) v·v
α(v) − β(v)
= β(v)I + v⊗v (8)
(v · v)
Since α(v) and β(v) are scalar-valued isotopic functions, using Example-1, we can have
the following representation.
α(v) − β(v)
β(v) = φ̃0 (v · v) and = φ̃1 (v · v).
(v · v)
Substituting the representation for β and (α − β)/(v · v) in Eq. (8), yields following final
form for symmetric tensor-valued isotropic function of vector
Proof. We first show that eigenvectors of S and G are same if G(S) is an isotropic
function. Let {e∗1 , e∗2 , e∗3 } and {λ1 , λ2 , λ3 } be a set of orthonormal eigenvectors and cor-
responding eigenvalues of symmetric tensor S. Then the spectral decomposition of S is
given by
S = λ1 e∗1 ⊗ e∗1 + λ2 e∗2 ⊗ e∗2 + λ3 e∗3 ⊗ e∗3 .
Choose a proper orthogonal tensor Q = 2e∗1 ⊗ e∗1 − I. The rotation tensor Q commutes
with S, i.e.,
Clearly, the tensors I, S and S 2 belong to the subspace spanned by {e∗1 ⊗ e∗1 , e∗2 ⊗ e∗2 , e∗3 ⊗
e∗3 }. We further know that following determinant of Vandermonde matrix is non-zero.
1 1 1
det λ 1 λ 2 λ 3
= (λ1 − λ2 )(λ2 − λ3 )(λ3 − λ1 ) 6= 0 (since λ1 6= λ2 , λ2 6= λ3 , λ3 6= λ1 )
λ21 λ22 λ23
Therefore, the set {I, S, S 2 } is linearly independent and hence, forms another basis to
the space generated by {e∗1 ⊗ e∗1 , e∗2 ⊗ e∗2 , e∗3 ⊗ e∗3 }. In other words, any tensor in the space
of {e∗1 ⊗ e∗1 , e∗2 ⊗ e∗2 , e∗3 ⊗ e∗3 } can be represented by linear combination of set {I, S, S 2 }.
Consequently, the tensor-valued function shown in Eq. (12) admits the following form.
where α0 (S), α1 (S) and α2 (S) are scalar-valued isotropic functions of S. It is easy to
see G is isotropic function. Furthermore, Example-2 shows that α0 (S), α1 (S) and α2 (S)
can have the following representation.
Taking dot product with e∗2 and e∗3 on both sides of preceding equation, we get µ2 = µ
and µ3 = µ, respectively. Therefore, G(S) has the following representation
Similar to previous case, we now show that {e∗1 ⊗ e∗1 , e∗2 ⊗ e∗2 + e∗3 ⊗ e∗3 } and {I, S} are
two sets of basis to two-dimensional subspace of tensors.
and " #
1 1
det = (λ1 − λ2 ) 6= 0 (since λ1 6= λ2 )
λ1 λ2
Therefore, {e∗1 ⊗ e∗1 , e∗2 ⊗ e∗2 + e∗3 ⊗ e∗3 } and {I, S} are two sets of basis to the two-
dimensional subspace of tensors. Consequently, we can have,
where α0 (S) and α1 (S) are scalar-valued isotropic function of S. Furthermore, Example-2
shows that α0 and α1 can have the following representation
Taking dot product with e∗1 , e∗2 and e∗3 on both sides of preceding equation, we get µ1 = µ,
µ2 = µ and µ3 = µ. Therefore, the tensor-valued function has the following representation,
We note that α0 (S) is a scalar-valued isotropic function of S. Therefore, the scalar func-
tion has the representation α0 (S) = φ0 (I1 , I2 , I3 ), where I1 , I2 , I3 are principal invariants
of S.
Conversely, it is easy to see that the function G(S) = φ0 (IS )I + φ1 (IS )S + φ2 (IS )S 2 is
an isotropic function. Thus, Eq. (9) shows the most general representation of symmetric,
isotropic tensor-valued function.
Proof. Though it is possible to specialize the previous result (i.e., Rivlin-Ericksen repre-
sentation theorem), we proceed directly to prove the result as it is easy. Let V1 be the
set of all unit vectors. We now choose a tensor e ⊗ e where e ∈ V1 . It is easy to see that
We can use the following identity in proving the preceding isotropic relation.
tr(QSQT ) = tr((QS)QT ) = tr((QS)T Q) (Since tr(RT T ) = tr(RT T ))
= tr((S T QT )Q) = tr(S T (QT Q)) = tr(S T )
= tr(S)
Thus, the linear symmetric tensor-valued isotropic function of symmetric tensor has the
representation G(S) = λ(tr S)I + 2µS, where λ and µ are constants.
References
Here, we discuss the representation of third-, fourth- and nth -order tensors. The fourth-
order tensors play an important role in constitutive relations as they relate stress with
strain or stress with rate of deformation. Similar to symmetry in case of second-order
tensors, some symmetries can be defined for fourth-order tensors that are discussed in
this lecture. Finally, we discuss an interesting concept called isotropic tensor.
Third-order tensors:
A third-order tensor B is a linear transformation from the vector space V to the space of
second-order tensors V 2 , i.e.,
It is easy to see that the input is a vector and output is a second-order tensor in the
operation. Furthermore, it is also easy to see the linearity property. Therefore, the object
u ⊗ v ⊗ w is a third-order tensor. This representation is useful in extracting components
of tensor once basis is known.
Representation of third-order tensor in component form:
Let {e1 , e2 , e3 } be a set of orthonormal basis to the vector space V. Then the third-order
tensor B is represented by
B = Bijk ei ⊗ ej ⊗ ek , (3)
where Bijk = ei · (Bek )ej .
The representation can be verified by the following procedure. We know from Eq. (1)
that the action of third-order tensor result in a second-order tensor. We now consider the
action of B on an arbitrary vector a, then we can write
Ba = (Ba)ij (ei ⊗ ej )
= {ei · ((Ba)ej )} (ei ⊗ ej )
= {ei · (B(ak ek )ej )} (ei ⊗ ej )
= ak {ei · ((Bek ) ej )} (ei ⊗ ej ) (Since B is linear transformation)
= {ei · ((Bek )ej )} (ek · a)(ei ⊗ ej ) (Since ak = ek · a)
= {ei · ((Bek )ej )} ((ei ⊗ ej ⊗ ek )a)
Clearly, there are 27 components in third-order tensor. In other words, the dimension
of space of third-order tensors is 27. Similar to third-tensor, the representation using
dyadic product can be extended to nth -order tensor.
(αT )x = α(T x) ∀x ∈ V,
Using these two operation (addition and scalar multiplication), we can easily show that
the collection of all nth -order tensors forms a vector space over the field of real numbers.
Furthermore, 3n components appear in nth -order tensor. Thus, the dimension of space of
nth -order tensors is 3n .
Third-order tensor:
Let Bijk ei ⊗ ej ⊗ ek and B 0 e0i ⊗ e0j ⊗ e0k are two representations of third-order tensor B.
Fourth-order tensors:
In continuum mechanics, fourth-order tensors play an important role along with second-
order tensors. For example, the tensor that relates stress and stain or stress and rate of
deformation can be represented by fourth-order tensors. We now present a convenient
definition of fourth-order tensor.
(T , CS) = (CT , S) ∀T , S ∈ V 2 .
(T , CS) = (T , Csym(S)) ∀T , S ∈ V 2 .
1
where sym(S) = S + ST .
2
Left minor symmetry: A fourth-order tensor C is said to have left minor symmetry if
Cijkl = Cklij .
Cijkl = Cijlk .
Cijkl = Cjikl .
We note the major and right minor symmetries of fourth-order tensor imply the left
minor symmetry. Similarly, the major and left minor symmetries also imply right minor
symmetry.
Isotropic tensors:
An isotropic tensor is defined as a tensor whose components are invariant under the change
of orthonormal basis, i.e.,
T = Ti1 i2 ···in ei1 ⊗ ei2 · · · ⊗ ein = Ti1 i2 ···in e0i1 ⊗ ie0i2 · · · ⊗ e0in ,
Therefore, the alternate tensor is a third-order isotropic tensor. Of course, scalar mul-
tiple of alternate tensor is also isotropic. We now want to show that any third-order
isotropic tensor should be scalar multiple of alternate tensor.
Let B = Bijk ei ⊗ ej ⊗ ek be an isotropic third-order tensor. The components of tensor
B are given by Bijk = ei ·(Bek )ej . Consider a rotation tensor Q = ek ⊗ek +ej ⊗ei −ei ⊗ej .
Since B is isotropic tensor, we get
0
Bijk = Bijk = (Qei ) · (BQek )(Qej )
= (ej ) · (Bek )(−ei ) (Since Qei = ej , Qej = −ei , Qek = ek )
= −Bjik
Bijk = −Bkji
Using preceding relations, we get Bjik = Bkji . Combining all the relations, we get Bijk =
Bjki = Bkij = −Bjik = −Bikj = −Bkji . Furthermore, we get Bijk = 0 if i = j 6= k or
i 6= j = k or j 6= i = k or i = j = k from the relations Bijk = −Bjik and Bijk = −Bkji .
Let B123 = λ then B = λ. Thus, third-order isotropic tensor should be in the form λ
where λ is a scalar.
where λ, µ and γ are scalars. The proof for this result can be found in a book by
Chandrasekharaiah and Debnath (1994).
References
In tensor calculus, we present differentiation and integration of scalar, vector and second-
order tensors. We will show in later discussion that the concept of differentiation and
integration play an important role in obtaining governing equations. We assume all func-
tions are sufficiently smooth to perform differentiation. In this lecture, we discuss the
concept of directional derivative (Gâteaux derivative) and Fréchet derivative.
The scalar, vector and tensor fields:
The term field is used to denote a function defined over the three-dimensional Euclidean
space <3 . The scalar field refers to a scalar-valued function φ : <3 → <. Let V be a vector
space. Then the vector field refers to a vector-valued function g : <3 → V. Similarly, the
second-order tensor field refers to a tensor-valued function G : <3 → V 2 .
The directional derivatives indeed known as partial derivatives if the directions are
chosen to be basis vectors. If the directions along the basis in rectangular three dimen-
sional Cartesian space, i.e., u1 = (1, 0, 0), u2 = (0, 1, 0) and u3 = (0, 0, 1), then the partial
∂φ ∂φ ∂φ
derivatives are denoted by , and . These partial derivatives can be obtained
∂x1 ∂x2 ∂x3
∂φ ∂φ ∂φ
by the relation = Dφ(x)[u1 ], = Dφ(x)[u2 ], = Dφ(x)[u3 ].
∂x1 ∂x2 ∂x3
The directional derivative for a vector field:
Similar to the scalar field, we can define directional derivative of vector fields. Let g be a
given vector field and u be a unit vector in <3 . Then the directional derivative of g along
u at a position x ∈ <3 is defined by
It can be observed that the directional derivatives of scalar-, vector- and tensor-valued
functions are scalar, vector and tensors, i respectively.
φ(T + U ) − φ(T ) d
Dφ(T )[U ] = lim = φ(T + U )|=0 . (5)
→0 d
g(T + U ) − g(T ) d
Dg(T )[U ] = lim = g(T + U )|=0 . (6)
→0 d
is defined by
G(T + U ) − G(T ) d
DG(T )[U ] = lim = G(T + U )|=0 . (7)
→0 d
Solution: Given scalar field φ(x) = x · x. Let u be a unit vector. Then we get
Example 2. Show that the directional derivative of vector field g(x) = x × v along the
direction of unit vector u is u × v where v is a given vector.
Solution: Given vector field g(x) = x × v. Let u be a unit vector. Then we get
Example 3. Show that the directional derivative of G(x) = x ⊗ x along a unit vector u
is x ⊗ u + u ⊗ x.
Solution: Consider given function, φ(T ) = α(tr T )2 + βtr(T 2 ). Let U be a tensor with
unit norm, i.e., U : U = 1. Then we get
φ(T + U ) − φ(T )
=⇒ lim = α(tr T )(tr U ) + βtr(T U + U T )
→0
= α(tr T )(tr U ) + 2βtr(T U )
We require the following notion of limit to define Fréchet derivative. This notion of
limit furthermore elaborated in the following examples. Let f (u) be a function. We say
that f (u) approaches zero faster than u, write f (u) = O(u), if
kf (u)k
lim = 0,
kuk→0 kuk
Dφ(x)[u] = Dφ(x) · u.
Dg(x)[u] = Dg(x)u.
DG(x)[u] = DG(x)u.
kG(T + U ) − G(T ) − HU k
lim = 0, (13)
kU k→0 kU k
φ(v + u) = (v + u) · T (v + u)
= v · Tv + v · Tu + u · Tv + u · Tu
= φ(v) + (T T + T )v · u + o(u).
φ(v + u) − φ(v) − (T T + T )v · u
=⇒ lim =0
|u|→0 |u|
φ(T + U ) = v · (T + U )v = v · T v + v · U v = φ(T ) + (v ⊗ v) : U
φ(T + U ) − φ(T ) − (v ⊗ v) : U
=⇒ lim = 0.
kU k→0 kU k
φ(T + U ) = u · (T + U )v = u · T v + u · U v = φ(T ) + (u ⊗ v) : U
φ(T + U ) − φ(T ) − (u ⊗ v) : U
=⇒ lim = 0.
kU k→0 kU k
Example 9. Show that the Fréchet derivatives of functions φ(S) = u · Sv and ψ(W ) =
u · W v are 21 (u ⊗ v + v ⊗ u) and 12 (u ⊗ v − v ⊗ u) respectively if S belongs to the space
of symmetric tensors and W belongs to a space of skew-symmetric tensors.
Solution: Given function, φ(T ) = det(T ). Let {p, q, r} be orthonormal set of vectors
then we have,
φ(T + U ) = det(T + U )
= det(T + U )[p, q, r] (Since [p, q, r] = 1)
= [(T + U )p, (T + U )q, (T + U )r]
= [T p + U p, T q + U q, T r + U r]
= [T p, T q, T r] + [U p, T q, T r] + [T p, U q, T r] + [T p, T q, U r]
+[T p, U q, U r] + [U p, T q, U r] + [U p, U q, T r] + [U p, U q, U r]
= det(T )[p, q, r] + [(cof T )T U p, q, r] + [p, (cof T )T U q, r]
+[p, q, (cof T )T U r] + o(U )
= φ(T ) + tr(cof T )T U ) + o(U )
φ(T + U ) − φ(T ) − (cof T ) : U
=⇒ lim = 0.
kU k→0 kU k
Thus, the Fréchet derivative, Dφ(T ) = cof T = det(T )T −T .
References
Here we discuss the gradient, divergence and curl. These definitions play an important
role in continuum mechanics.
Gradient of a function:
The Fréchet derivative is synonym for the gradient of a function. If the gradient of a
function exists then it is unique. Since the gradient is a linear transformation, we can
represent it in the component form for a known basis. We now present gradient of scalar,
vector and tensor fields in canonical basis.
Gradient of scalar field:
Let φ : <3 → < be a scalar field and the set {e1 , e2 , e3 } be an orthonormal basis to the
three-dimensional Euclidean space. Then the gradient of scalar field is represented by
∂φ(x)
∇φ(x) = ei , (1)
∂xi
where x is a position vector in <3 .
Proof: Let ∇φ(x) be the gradient (Fréchet derivative) of given function φ. As stated in
Lecture-14, we can have the following directional derivative along a unit vector u
Dφ(x)[u] = ∇φ(x) · u
Dg(x)[u] = (∇g)u.
Choosing u = ej , we get
∂gi
Therefore, ij th component of gradient of vector field can be written as (∇g)ij = .
∂xj
Clearly, ij th component of gradient implies the representation shown in Eq. (2).
Gradient of second-order tensor field:
Let G : <3 → V 2 be a second-order tensor field and the set {e1 , e2 , e3 } be an orthonormal
basis to the three-dimensional Euclidean space. Let <3 be the vector space V. Then the
gradient of tensor field has the representation,
∂Gij
∇G(x) = ei ⊗ ej ⊗ ek , (3)
∂xk
Proof: Since the gradient is indeed Fréchet derivative, the directional derivative of G
along unit vector u is given by
DG(x)[u] = ∇Gu.
Choosing u = ek , we get
Proof: The Fréchet derivative Dφ(T ) is known as derivative. Therefore, the directional
derivative along unit tensor U can be written as
Divergence:
Here, we define divergence of a vector and second-order tensor fields.
The divergence of vector field:
Let g : <3 → V be a vector field. Then the divergence of a vector field is defined by
∇ · g = tr(∇g). (6)
It is clear from the definition that the divergence of a vector field is scalar-valued function.
Curl:
We now define new quantities called curl of vector field and second-order tensor field. The
curl of vector field appears in Stokes’ theorem which is discussed later.
The curl of vector field:
Let g : <3 → V be a vector field. Then the curl of vector field, denoted as ∇ × g, is
defined by
(∇ × g) × v = (∇g − ∇g T )v ∀ constant v ∈ V. (10)
It is easy to see that ∇g − ∇g T is a skew-symmetric tensor and ∇ × g is corresponding
axial vector. Using Eq. (9) in Lecture-7, the curl of vector can be written as
1 1
∇ × g = − ijk (∇g − ∇g T )jk ei = − ijk ((∇g)jk − (∇g)kj )ei = ijk ∇gkj ei
2 2
∂gk
= ej × ek
∂xj
It is easy to see from the definition that ∇ × G is a second-order tensor. The curl of
tensor can be expressed in the basis {e1 , e2 , e3 } by
Laplacian:
We now define Laplacian of scalar, vector and tensor field. These quantities involve
second-order partial differentials and arise in differential form of governing equations of
solids and fluids.
∇2 φ = ∇ · (∇φ). (12)
It is clear from the definition that the Laplacian of scalar field is a scalar function which
can be expressed as
∂ 2φ
∇2 φ = .
∂xi ∂xi
∂(∇g)ij ∂ 2 gi
∇2 g = (∇ · (∇g))i ei = ei = ei .
∂xj ∂xj ∂xj
(∇2 G) : H = ∇2 (G : H) ∀ constant H ∈ V 2 .
We now see that this is similar to product rule what we encounter in the case of one
variable calculus. Let x ∈ < then it represents one variable case. Replacing x with t in
product rule, we get
! !
dh(t) dg(t) df (t)
= π f (t), +π , g(t) .
dt dt dt
Clearly, this formula is consistent with one variable case. The proof of this result can be
found in a book by Gurtin (1981).
We now present an example to understand the concept of product rule.
Solution: Given function g = φh. Let u be a unit vector. Then, using product rule, we
get
Chain rule:
Let h(x) be a composite function of f and g, i.e., h = f ◦ g. In addition, let g : Ω1 → Ω2
be a differentiable function at x ∈ Ω1 and, f : Ω2 → Ω3 be a differentiable function at
y = g(x). If the function h is differentiable at x then the directional derivative of h
along u ∈ Ω1 is given by
Similar to product rule, this can also be deduced to one variable case. Let x ∈ < to
represent one variable case. Replacing x with t in chain rule, we get
Example 2. Show that the directional derivative of tr(T −1 ) along the direction of unit
tensor U is −tr(T −1 U T −1 ).
Solution: Given function can be written as φ(G(T )) = tr(T −1 ) and G(T ) = T −1 . There-
fore, we can write, φ(G) = tr(G).
The directional derivative of function φ(G(T )) can be obtained using chain rule. There-
fore, we first want to get directional derivative of G(T ). Using the relation T −1 T = I
and product rule, we get
Let H be a second order tensor then the directional derivative of φ(G) along H can be
written as
tr(G + H) − tr(G)
φ(G)[H] = tr(G)[H] = lim = tr(H).
→0
Using chain rule, we get directional derivative of function φ(G(T )) along U as
References
In continuum mechanics, scalar, vector and tensor functions are dependent not only on
space but also on time. Therefore, we need to define the time derivative of scalar, vector
and tensor functions. Since the time t ∈ <+ , the definition of time derivative is easier
than the spatial derivatives that are discussed earlier.
Time derivative of scalar-, vector- and tensor-valued functions:
Let φ : < → <, g : < → V and G : < → V 2 be scalar, vector and tensor functions of time
t, respectively. Then the time derivatives of these functions, denoted as φ̇, ġ and Ġ, are
defined by
dφ φ(t + ) − φ(t)
φ̇ = = lim
dt →0
dg g(t + ) − g(t)
ġ = = lim
dt →0
dG G(t + ) − G(t)
Ġ = = lim .
dt →0
The product rule and the chain rule can be extended the time derivative. We present an
example to understand these rules.
Example 1. Show that time derivative of function tr(T )det(T ) is
! !
dT dT
tr(T ) (cof T ) : + tr det(T ).
dt dt
Solution: Given function φ(T ) = tr(T )det(T ). Then we can apply product rule to get
the derivative. We now calculate derivative of tr(T ) and det(T ).
d(tr T ) tr(T (t + )) − tr(T (t)) tr(T (t + ) − T (t))
= lim = lim
dt →0 !
→0
!
T (t + ) − T (t) dT
= tr lim = tr ,
→0 dt
alternatively, using chain rule
" # !
d(tr T ) dT dT dT
= Dtr(T ) =I: = tr .
dt dt dt dt
Using chain rule, we can get time derivative of det(T ) as
" #
d(det T ) dT dT
= D(detT ) = (cof T ) : (see Example-10 in Lecture-14)
dt dt dt
where X(t) is a tensor function, 0 ≤ t < ∞. The existence theorem for linear differential
equations guarantees a unique solution to the initial value problem, which we write in the
form
X(t) = eT t .
It is easy to see from the series expansion that
Tt
T
eT = eT t ∀T ∈ V 2 ,
Proof. Let X(t) = eT t and X(0) = I then we want to show that det X(t) > 0 for every
t ∈ [0, ∞).
Suppose that det X(τ ) = 0 for some finite time τ . Since X is continuous and
det X(0) = 1, there is some nonempty interval [0, τ ) such that det X(t) > 0 for every
t ∈ [0, τ ). Consequently, X(t) is invertible in nonempty interval and we get
Using initial condition det X(0) = 1, we have the following unique solution
T )t
det X(t) = e(tr ,
for 0 ≤ t < τ . The equation shows that det X(t) > 0 for every finite t ≥ 0 because X(t) is
continuous. Therefore, this is contradicting the initial assumption that τ is finite. Thus,
there is no finite τ such that det X(τ ) = 0 and hence, det X(t) > 0 for every t ∈ [0, ∞).
Furthermore,
det eT t = e(tr T )t (Since X(t) = eT t ).
Problem 2. Let A and be B are two second-order tensors. Then e(A+B)t = eAt eBt if and
only if A and B are commuting tensors.
XA XB = e(A+B)t .
Thus, e(A+B)t = eAt eBt if and only if A and B are commuting tensors.
Proof. Let X(t) = eW t for t ≥ 0 then, by definition of previously stated initial value
problem, we have
Ẋ = W X, X(0) = I.
Let Z = XX T . Then we get
Ż = ẊX T + X Ẋ T = W XX T + XX T W T .
Since W is skew-symmetric tensor, we can write
Ż = W XX T − XX T W .
Therefore, Z satisfies
Ż = W Z − ZW , Z(0) = I.
This initial value problem has unique solution, Z(t) = I, ∀t ≥ 0. Thus, X(t)X(t)T = I
implies X(t) is an orthogonal tensor. Furthermore, det X(t) > 0 and hence, X(t) is a
rotation tensor.
Problem 4. If Q(t) is an orthogonal tensor then show that Q̇QT is a skew-symmetric
tensor.
The integral theorems play an important role in the balance laws. We now discuss
some integral theorems that are useful for our discussion in continuum mechanics.
Integral theorems:
Here we discuss divergence, Stokes’ and localization theorems.
Divergence theorem or Gauss divergence theorem:
Let Γ be a boundary of regular domain Ω in three-dimensional Euclidean space <3 . Let
φ : <3 → <, v : <3 → V, S : <3 → V 2 be scalar, vector, tensor fields, respectively. Then
Z Z
∇φ ∂Ω = φn ∂Γ (1)
Z Ω ZΓ
∇ · v ∂Ω = v · n ∂Γ (2)
ZΩ ZΓ
∇ · S ∂Ω = Sn ∂Γ (3)
Ω Γ
= u · (n × v) ∂Γ = u · (n × v) ∂Γ.
Γ Γ
∇ × v ∂Ω = n × v ∂Γ.
R R
Arbitrariness of u implies Ω Γ
= (u · n)v ∂Γ = (n · u)v ∂Γ = (v ⊗ n) u ∂Γ
Γ Γ Γ
Z
= v ⊗ n ∂Γ u
Γ
∇v ∂Ω = v ⊗ n ∂Γ.
R R
arbitrariness of u implies Ω Γ
Stokes’ theorem:
Let Γ be a surface in three-dimensional Euclidean space <3 with piecewise smooth bound-
ary curve C and v be a vector field. Then
Z I
(∇ × v) · n ∂Γ = v · dx (4)
Γ C
R
(∇ · v)n − (∇v)T n ∂Γ = v × dx.
H
Arbitrariness of u implies Γ C
Potential theorem:
Let Ω be a simply-connected region, and u = ∇φ be a vector field over Ω. Then
∇ × u = 0.
(∇ × u)i = (∇ × (∇φ))i
!
∂ ∂φ
= ijk
∂xj ∂xk
∂ 2φ
= ijk
∂xj ∂xk
∂ 2φ
= ikj (interchanging indices j and k)
∂xk ∂xj
∂ 2φ
= −ijk
∂xj ∂xk
It is clear that
∂ 2φ ∂ 2φ
(∇ × u)i = ijk = −ijk = −(∇ × u)i .
∂xj ∂xk ∂xj ∂xk
Thus, ∇ × u = ∇ × (∇φ) = 0.
Thus, the value of integral xx0 u(y) · dy along any curve in Ω depends only on x if we
R
Localization theorem:
Let φ be a continuous scalar, vector or tensor field, and B be an arbitrary open subset of
domain Ω. If Z
φ(x) ∂Ω = 0, ∀B ⊆ Ω,
B
Then
φ(x) = 0, ∀x ∈ Ω.
References
Module-3: Kinematics
Lecture-17: The Lagrangian and
Eulerian Descriptions
In previous lectures, we presented a detailed treatment of tensor algebra and calculus that
is necessary for the study of continuum mechanics. Since this course is about understand-
ing the beahviour of continuum under action of forces and thermal effects, the content
is divided into three major parts: (i) Kinematics, (ii) Balance laws, (iii) Constitutive
equations.
Kinematics refers to the study of the continuous medium under purely geometric con-
siderations. In other words, we study the displacement, velocity, acceleration, strain, and
rate of deformation fields of continuum body. Application of fundamental laws of me-
chanics and thermodynamics to the continuum is discussed in the topic balance laws. An
important physical quantity called stress is introduced via the balance of linear momen-
tum. Finally, the kinematical quantities such as strain and rate of deformation are related
with the quantities such as stress and rate of stress through the constitutive relations.
The task of continuum mechanics finishes with the understanding of restrictions imposed
by second law of thermodynamics, principle of material frame-indifference and material
symmetry on constitutive relations.
In this module, kinematics of continuous medium is presented. We make the following
assumptions in the study of continuous media.
• The boundary of continuum body should not make self contact or penetrate.
• Topology of the body should be invariant under the deformation, i.e., new holes
cannot be generated or existing holes cannot be removed in process of deformation.
Ω0 Ω
u
e3 X x
e2
e1
deformation of a body during its motion. These two are known as Lagrangian and Eulerian
descriptions.
Lagrangian and Eulerian descriptions:
Since the map χ and its inverse χ−1 are smooth, every field defined over the body can
be expressed either in terms of reference variable X ∈ Ω0 or in terms of spatial variable
x ∈ Ω. Expressing all quantities in terms of reference variable X is known as Lagrangian
formulation. Similarly, expressing all quantities in terms of spatial variable x is known
as Eulerian formulation. The Lagrangian and Eulerian formulations are also known as
material description and spatial description, respectively. Of course both formulations
are equivalent as it is just a change of variable. However, depending on convenience
and available experimental observations, we choose the suitable formulation. Usually,
Lagrangian framework is preferred for the solid mechanics problem since the reference
configuration of continuum body is known. On the other hand, Eulerian framework is
preferred for the fluid mechanics as fluid motion is observed in a fixed region of space. For
example the study of channel flow is done by observing water flow in a fixed region of a
channel and the study of air flow is done by observing air flow in a wind-tunnel. We now
present an example of bar with temperature distribution to understand the Lagrangian
and Eulerian formulations.
Example 1 (Temperature distribution of bar under uniaxial motion). Consider a bar of
initial length of four units, i.e., the reference domain Ω0 = [0, 4]. Let the bar is extending
with time t and the extension is given by the mapping function x = (1 + t)X. The bar is
experiencing the temperature distribution in Lagrangian description θ = X(4 − X)t2 or
in the Eulerian description θ = x(4 + 4t − x)t2 /(1 + t)2 (Fig. (2)).
It is clear from Fig. (2) that the material description X is always associated with
particle in reference configuration. We can visualize material description in this problem
as thermometer fixed to the bar and measuring the temperature under the extension of
bar. It can be noted that thermometer moves with the bar in material description as
shown in Fig. (3). On the other hand, spatial description can be visualized as measuring
the temperature by fixing the thermometer to the axis x, i.e., thermometers are not moving
with the bar as shown in Fig. (4). It can be noted that the particles of bar occupy different
spatial positions at different time frames.
T1 , T2 , T3 Thermometers
T1 T2 T3
t=3
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 x
T1 T2 T3
t=2
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 x
T1 T2 T3
t=1
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 x
T1 T2 T3
t=0
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 x
t=3
1
T₁
2
T₂
3
T₃
4
T₄
5 6
T₅
7 8
T₆
9
T₇
10 11 12
T₈
13 14 15 16 x
t=2
1
T₁
2
T₂
3
T₃
4
T₄
5 6
T₅
7 8
T₆
9
T₇
10 11 12
T₈
13 14 15 16 x
t=1
1
T₁
2
T₂
3
T₃
4
T₄
5 6
T₅
7 8
T₆
9
T₇
10 11 12
T₈
13 14 15 16 x
t=0
1
T₁
2
T₂
3
T₃
4
T₄
5 6
T₅
7 8
T₆
9
T₇
10 11 12
T₈
13 14 15 16 x
We observed from the example that material point X do not vary with time under the
deformation of continuum body. Therefore, the velocity of material point X is defined by
∂(x − X) ∂(χ(X, t) − X) ∂χ
u̇ = = = . (4)
∂t ∂t ∂t
∂ 2χ
ü = . (5)
∂t2
We will present more detailed discussion on spatial description of velocity and acceleration
in Lecture-20. We now define an important tensor field called deformation gradient which
plays a critical role in defining the measure of strain.
o(U )
lim = 0.
kU k→0 kU k
The deformation gradient F (X) characterizes the local behavior of χ(X, t) at X. Since
χ is the smooth vector-valued function of vector, the deformation gradient F is a second-
order tensor. The components of deformation gradient are given by
∂χ1 ∂χ1 ∂χ1
" # ∂X ∂X2 ∂X3
∂χ ∂χ21 ∂χ2 ∂χ2
[F ] = =
∂X1 ∂X2 ∂X3
(6)
∂X ∂χ3 ∂χ3 ∂χ3
∂X1 ∂X2 ∂X3
We note that the deformation with constant gradient F (i.e., F is independent of material
variable X) is known as homogeneous deformation.
Using the definition of displacement field u(X, t) = χ(X, t) − X, we can write the
deformation gradient as
F = I + ∇u, (7)
where ∇u is known as displacement gradient. The relation can be expressed in the
component form as
∂ui
Fij = δij + . (8)
∂Xj
We will see in the next lecture that the deformation gradient play an important role
in defining the strain measures.
References
Module-3: Kinematics
Lecture-18: The Strain and Stretch Tensors
Here, we discuss the strain measure that is related to the local change in length. Since
the generalized strain measure require the definition of stretch tensors, so their definition
also discussed in this lecture.
Strain tensors:
Let Ω0 and Ω be the reference and deformed configurations respectively (recall from
Lecture-17). Let χ(X, t) be a map from Ω0 to Ω and χ−1 (x, t) be its inverse map.
Let dX be a small fiber in the reference configuration and its transformation during
deformation be dx. Then we have the relation
dx = F dX, (1)
where F is the deformation gradient. As pointed out previously (see Lecture-17), the
deformation gradient is invertible tensor. Therefore, we have
dX = F −1 dx. (2)
C = FTF (3)
B = FFT. (4)
ds2 − dS 2 = dx · dx − dX · dX
= F dX · F dX − dX · dX
= dX · F T F dX − dX · dX
= dX · (C − I)dX
= 2dX · EdX.
It can be noted that the Green strain tensor is a strain measure in Lagrangian description.
Almansi strain tensor:
We now define another measure of strain known as Almansi strain tensor. Expressing dX
in terms of dx in ds2 − dS 2 yields
ds2 − dS 2 = = dx · dx − dX · dX
= dx · dx − F −1 dx · F −1 dx
= dx · dx − dx · F −T F −1 dx
= dx · (I − B −1 )dx
= 2dx · Ẽdx.
We can observe that the Green strain and the Almansi strain are related by
E = F T ẼF (7)
Clearly, both the Green strain and the Almansi strain measures are zero whenever there
is no change in length. Similar to Green and Almansi strain tensors, there are many
definitions for strain measure are possible such that the measure is zero when there is no
change in length. Of course the strain is non-zero when there is deformation experienced
by the body. We now define stretch tensors as they play an important role in definition
of generalized strain.
Stretch tensors:
The deformation gradient F is not only invertible but also det(F ) > 0 (see discussion in
Lecture-17). Therefore, using polar decomposition theorem, we can write the deformation
gradient as
F = RU = V R, (8)
where U and V are positive-definite symmetric tensors and R is the rotation tensor.
The positive-definite symmetric tensor U is known as right stretch tensor as it is the
square-root tensor of right Cauchy-Green strain C. Similarly, V is known as left stretch
tensor as it is the square-root tensor of left Cauchy-Green strain B. Using Eqs. (3), (4)
and (8), the left and right Cauchy-Green strains can be expressed by the stretch tensors,
C = U2
B = V 2,
Equation (8) implies V = RU RT and hence, the eigenvalues of U and V are same. But
eigenvectors are related by the rotation R.
The Green and Almansi strain tensors can have the following representation in terms of
principal values and principal directions of stretches.
3
1 1 1 2
(C − I) = (U 2 − I) =
X
E = (λi − 1)ri ⊗ ri (12)
2 2 i=1 2
3
1 1 1
(I − B −1 ) = (I − V −2 ) = (1 − λ−2
X
Ẽ = i )li ⊗ li . (13)
2 2 i=1 2
We now show that Green and Almansi strain tensors are particular cases of generalized
strain defined in the following section.
The strain tensor E (n) is called material strain tensor and it is used in Lagrangian de-
scription. On the other hand, Ẽ (n) is called spatial strain tensor and it is used in Eulerian
description.
For every integer n the measure qualifies as strain measure. Therefore, these measure
of strains are known as generalized strains. The special cases of n = 0, 1 and 2 lead to
known strain measures.
The logarithmic strain tensors E (0) and Ẽ (0) are also known as Hencky strain tensors.
Furthermore, the case n = 1 is known as Biot strain, i.e.,
3
E (1) = U − I =
X
(λi − 1) ri ⊗ ri . (19)
i=1
For the case n = 2, Eqs. (14) and (15) reduces to Green and Almansi strains shown in
Eqs. (12) and (13)
Problem 1 (Characterization of rigid motions-I). Let Ω0 and Ω be the reference and
deformed configurations. Then show that the motion of the body is rigid if and only if the
mapping function χ : Ω0 → Ω is in the form
where Q(t) is a rotation tensor and c(t) is a vector. (Physically, the rigid motion of
reference configuration consists of only rotation Q(t) and translation c(t).)
Solution: The rigid motion is defined as a motion of body in which the length between
any two points is preserved. Let X1 and X2 be two points in the body. Since χ(X, t) is
given mapping between reference configuration Ω0 and the deformed configuration Ω, the
rigid motion can be expressed by
Since X1 and X2 are arbitrary, choosing X1 = X2 , Eq. (21) implies the orthogonality of
F (X, t) at every point X ∈ Ω0 . Since det F (X, t) > 0 ∀t ∈ [0, ∞), F (X, t) is a rotation.
We now assume that Q(t) = F (X, t) as F (X, t) is independent of position X. Inte-
grating the relation ∇χ = Q(t), we get
Conversely, if the motion is in the form χ(X, t) = Q(t)X + c(t), we can write following
expression for every X1 and X2 in Ω0 .
Problem 2. Let Ω0 and Ω be reference and deformed configurations. The Green strain
E(X, t) = O, i.e., C(X, t) = I for all X ∈ Ωo if and only if the motion is rigid.
where Q(t) is the rotation tensor and c(t) is the translation vector.
Using the definition of deformation gradient, we get F (X, t) = Q(t). Therefore, this
implies the Green strain tensor,
1
E(X, t) = (F T F − I) = O, ∀X ∈ Ω0 .
2
Conversely, we now show that E(X, t) = O, ∀X ∈ Ω0 implies rigid motion. Let dS be
infinitesimal length of vector dX at a point X in the reference configuration and ds be
the transformed infinitesimal length in deformed configuration at x = χ(X, t). Locally,
dS = ds if the Green strain E(X, t) = O. We now consider a straight line between two
points X1 and X2 in the reference configuration such that the length is L. Suppose the
straight line is transformed into a curve in the deformed configuration with end points
x1 = χ(X1 , t) and x2 = χ(X2 , t). At an instant of time t, the condition dS = ds implies
L = |X1 − X2 | ≥ |x1 − x2 |.
We now consider a curve in reference configuration which gets transformed into a straight
line. At an instant of time t, the condition dS = ds implies
|X1 − X2 | ≤ |x1 − x2 |.
References
Module-3: Kinematics
Lecture-19: Area and Volume Transformation
where J is det F .
If two second-order tensor fields, T (X, t) and τ (x, t) stated above are related by the
Piola transformation then we have the following results:
where ∂Γ0 is infinitesimal area element on the boundary of reference domain Γ0 and
∂Γ is the corresponding infinitesimal area element on the boundary of deformed
domain Γ.
∇X · (JF −T ) = ∇X · cof F = 0,
T n0 ∂Γ0 = τ n ∂Γ
∂Γ = |(cof F )n0 | ∂Γ0
(cof F )n0
n = .
|(cof F )n0 |
Solution: We use the relation ∂Ω = J∂Ω0 where J is det F in following proof. The proof
for the relation ∂Ω = J∂Ω0 is presented in the following section. We can obtain the
following relation
Z Z
0
T n ∂Γ0 = ∇X · T ∂Ω0 (Using divergence theorem)
Γ0 Ω0
Z
= ∇x · τ ∂Ω (Using Eq. (9) and ∂Ω = J∂Ω0 )
ZΩ
= τ n ∂Γ. (Using divergence theorem)
Γ
∂Ω = J ∂Ω0 , (10)
where J is det F .
Proof: The volume can be defined as the absolute value of triple scalar product. Let dX1 ,
dX2 and dX3 are linearly independent infinitesimal vectors that form a parallelepiped in
the reference configuration as shown in the Fig. (1). Let us consider infinitesimal vectors
dX1 , dX2 and dX3 which are being transformed into dx1 , dx2 and dx3 , respectively. The
infinitesimal vectors dx1 , dx2 and dx3 in deformed configuration form a parallelepiped in
the deformed configuration as shown in Fig. (1).
x = χ(X,t)
dx₃ Ω
e3 dx₂ dx₁
Ω0
e1 e2
∂Ω = |[dx1 , dx2 , dx3 ]| = |[F dX1 , F dX2 , F dX3 ]| = |det(F )||[dX1 , dX2 , dX3 ]|
= J ∂Ω0 ,
where ∂Ω0 is the volume of parallelepiped in undeformed configuration. Thus, the volume
elements are related by the determinant of the deformation gradient.
Reference
Module-3: Kinematics
Lecture-20: Material and Spatial Time Derivatives
The velocity, acceleration and velocity gradient are important quantities of kinematics.
Here, we discuss the description of these kinematic quantities in both Lagrangian and
Eulerian framework.
Material and spatial description of velocity field:
Let Ω0 be a reference configuration and Ω be a deformed configuration at an instant of
time t. Let x = χ(X, t) be a mapping that represents the motion from Ω0 to Ω. Since
χ is invertible for all X ∈ Ω0 at every instant of time t, the inverse map X = χ−1 (x, t)
exists from Ω to Ω0 which associates every spatial point x to a material point X.
The material coordinate X is fixed to a particle which is independent of time (see
Example-1 in Lecture-17). On the other hand, the spatial position of particle x can vary
with time. The displacement of particle is defined by the relation u = x − X. Therefore,
the velocity of particle
∂(x − X) ∂x
u̇ = = (1)
∂t ∂t
Substituting the map x = χ(X, t) in u̇, we get velocity ṽ in terms of X as
∂x ∂χ(X, t)
ṽ(X, t) = = (2)
∂t ∂t
This represents the velocity field in Lagrangian description (material description) as it is
a function of material variable X.
Using the inverse map X = χ−1 (x, t), we can obtain the following Eulerian description
(spatial description) of velocity field
v(x, t) = ṽ(χ−1 (x, t), t). (3)
In general, the function forms of ṽ and v are different while the value of the functions
are equal at a material point X and corresponding spatial point x = χ(X, t). We now
present an example to understand the material and spatial description of velocity fields.
Example 1. Let the mapping function for motion x = χ(X, t) is given by
x1 = X1 (1 + t2 ), x2 = X2 (1 + t), x3 = 0.
Then find the velocity in material and spatial descriptions.
Using the motion representation, we can obtain X1 = x1 /(1 + t2 ) and X2 = x2 /(1 + t).
Substituting in the material description, we get the following spatial velocity field
2x1 t x2
v1 = , v2 = , v3 = 0.
1 + t2 1+t
It is clear from the example that ṽ and v are having different forms whereas both the
forms give same value of the velocity corresponding material and spatial coordinates, i.e.,
ṽ(X, t) = v(x, t)
For example, material description of velocity ṽ = (4, 2, 0) at a material point X =
(1, 2, 3) at an instant of time t = 2. We now see that x = (5, 6, 0) is a spatial coor-
dinate corresponding to the material point X = (1, 2, 3) at an instant of time t = 2.
Substituting x in spatial description of velocity, we get v = (4, 2, 0). Therefore, the
values of velocity are equal both in material or spatial at corresponding points, i.e.,
ṽ(X, t)|(1,2,3) = v(x, t)|(5,6,0)
We now introduce the concept of material and spatial time derivatives of more general
fields. Later, we apply this result to get the acceleration in both descriptions.
Material and spatial time derivative of general fields:
There is a possibility of confusion in taking material and spatial time derivatives of general
fields. Hence, we introduce the following notations to avoid the confusion in taking the
differentiation of fields.
Let φ̃(X, t), g̃(X, t), G̃(X, t) be scalar, vector and tensor fields in the Lagrangian
(material) description, respectively. Then the material gradient of scalar, vector and
tensor fields are denoted by ∇X φ̃, ∇X g̃, ∇X G̃, the material divergence of vector and
tensor field are denoted by ∇X · g̃, ∇X · G̃, and the material time derivative of scalar,
Dφ̃ ˙ Dg̃ ˙ DG̃
vector and tensor fields are denoted by φ̃˙ or , g̃ or , G̃ or . In component
Dt Dt Dt
form
∂ φ̃ ∂ g̃i ∂ G̃ij
(∇X φ̃)i = , (∇X g̃)ij = , (∇X G̃)ijk = ,
∂Xi ∂Xj ∂Xk
∂ g̃i ∂ G̃ij
∇X · g̃ = , (∇X · G̃)i = ,
∂Xi ∂Xj
Dφ̃ ∂ φ̃ ˙ Dg̃ ∂ g̃ ˙ = DG̃ = ∂ G̃ .
φ̃˙ = = , g̃ = = , G̃
Dt ∂t Dt ∂t Dt ∂t
Let φ(x, t), g(x, t), G(x, t) be scalar, vector and tensor fields in the Eulerian or spatial
description. Then the spatial gradient of scalar, vector and tensor fields are denoted by
∇x φ, ∇x g, ∇x G, the spatial divergence of vector and tensor field are denoted by ∇x · g,
∇x · G, and the spatial time derivative of scalar, vector and tensor fields are denoted by
φ0 , g 0 , G0 . In component form
∂φ ∂gi ∂Gij
(∇x φ)i = , (∇x g)ij = , (∇x G)ijk = ,
∂xi ∂xj ∂xk
∂gi ∂Gij
∇x · g = , (∇x · G)i = ,
∂xi ∂xj
∂φ ∂g ∂G
φ0 = , g0 = , G0 = .
∂t ∂t ∂t
We now derive the relation between the material and spatial gradients.
Relation between material and spatial gradients:
Let Ω0 be a reference configuration and Ω be a deformed configuration. Let x = χ(X, t)
be a map from Ω0 to Ω and F = ∇X χ be the deformation gradient. Let φ(x, t) and
g(x, t) be scalar and vector fields defined over Ω. Then we have
∂φ ∂φ ∂χj ∂φ
(∇X φ)i = = = Fji
∂Xi ∂xj ∂Xi ∂xj
∂gi ∂gi ∂χk ∂gi
(∇X g)ij = = = Fkj
∂Xj ∂xk ∂Xj ∂xk
or, equivalently,
∇X φ = F T ∇x φ (4)
∇X g = ∇x gF . (5)
We now use the relation between material and spatial gradients to represent Green and
Almansi strain tensors in terms of displacement gradients. We know that the displacement
u = x − X. Furthermore, the deformation gradient ∇X u = F − I. Therefore, the Green
strain tensor,
1 T 1
E= F F −I = ∇X u + ∇X uT + ∇X uT ∇X u . (6)
2 2
The Green strain tensor in component form,
!
1 ∂ui ∂uj ∂uk ∂uk
Eij = + + . (7)
2 ∂Xj ∂Xi ∂Xi ∂Xj
The relations between spatial and material gradients are shown in Eqs. (4) and (5). We
now want to derive relation between material and spatial time derivatives. In order to
get the relation between time derivatives, we need to have the following chain rule.
Chain rule:
Let ξ(t) be a scalar valued function of time t. Let ψ(ξ(t), t) be a scalar valued function
which depends explicitly and implicitly, through ξ, on t. Then the total derivative with
respect to time t can be written as
dψ ∂ψ ∂ψ dξ
= + . (11)
dt ∂t ∂ξ dt
Dφ̃
• the material time derivative of scalar field and the spatial time derivative of
Dt
∂φ
scalar field are related by
∂t
Dφ̃(X, t) Dφ(x, t) ∂φ ∂φ ∂χi ∂φ
= = + = + (∇x φ) · v.
Dt Dt ∂t ∂xi ∂t ∂t
Dg̃
• the material time derivative of vector field and the spatial time derivative of
Dt
∂g
vector field are related by
∂t
Dg̃(X, t) Dg(x, t) ∂g ∂g ∂χi ∂g
= = + = + (∇x g)v.
Dt Dt ∂t ∂xi ∂t ∂t
DG̃
• the material time derivative of tensor field and the spatial time derivative of
Dt
∂G
vector field are related by
∂t
DG̃(X, t) DG(x, t) ∂G ∂G ∂χi ∂G
= = + = + (∇x G)v.
Dt Dt ∂t ∂xi ∂t ∂t
We note that the material time derivative in reference domain can be obtained by
Example 2. Consider a bar of initial length of four units, i.e., the reference domain
Ω0 = [0, 4]. Let the bar is extending with time t and the extension is given by the mapping
function x = (1+t)X. The bar is experiencing the temperature distribution in Lagrangian
description θ̃(X, t) = X(4 − X)t2 or in the Eulerian description θ(x, t) = x(4 + 4t −
x)t2 /(1 + t)2 shown in Fig. (1).
Dθ̃(X, t) ∂ θ̃
= = 2X(4 − X)t.
Dt ∂t
We now consider the spatial description of temperature, θ = x(4 + 4t − x)t/(1 + t)2 . Using
the definition, we get the spatial time derivative as
∂θ 2xt
= 3
(4 + 6t − x + 2t2 ).
∂t (1 + t)
Dθ̃ Dθ
= .
Dt Dt
Thus, the material time derivative is equal to the total derivative in spatial description.
In general, material time derivative and spatial time derivative are not equal.
L = ∇x v, (15)
Using Eq. (5), we can obtain the following relation between spatial and material gradient
of velocity.
DF
= ∇X v = (∇x v) F = LF .
Dt
Since the tensor F is invertible, we have F F −1 = I. Taking the material time derivative
on both sides, we get
D (F F −1 ) DF −1 DF −1
= F +F =O
Dt Dt Dt
DF −1 DF −1
=⇒ = −F −1 F = −F −1 L.
Dt Dt
The material derivative of J (i.e., det(F )) in can be written as
DJ DF
= (cof F ) : (see an epression in Example-1 of Lecture-16)
Dt Dt
= (cof F ) : (LF )
= F (cof F )T : LT
= JI : LT
= J tr(L)
= J ∇x · v.
Since J represents locally the ratio of deformed volume to reference volume, the isochoric
motion (J = 1) can be characterized by
1 DJ
= ∇x · v = 0. (17)
J Dt
In summary, we have the relations
DF
= LF (18)
Dt
DF −1
= −F −1 L (19)
Dt
1 DJ
= tr(L) = ∇x · v (20)
J Dt
References
Module-3: Kinematics
Lecture-21: The Rate of Deformation and Spin
In kinematics, the rate of deformation and spin tensors are also useful in addition to
previously defined stretch and strain tensors. In particular, these are important in fluid
mechanics, visco-elasticity and plasticity. Therefore, we now define the rate of deformation
and spin tensors.
Rate of deformation and spin tensors:
Let Ω0 and Ω be reference and deformed configurations, respectively. Let v be a velocity
field over Ω at an instant of time t. Let y be a point in Ω and x ∈ Ω be a neighborhood
of y, i.e., |y − x| < for a given ∈ <. Let L be the spatial velocity gradient. Then,
using the definition of gradient, we can write
v(x, t) − v(y, t) = L(y, t)(x − y) + o(|x − y|), (1)
where o(|x − y|) approaches zero faster than |x − y|. As |x − y| → 0, the second-order
tensor L(y, t) represents the velocity gradient at a point y at an instant of time t. Let
D and W be the symmetric and skew-symmetric parts of L, i.e.,
1 1
D = (L + LT ) = (∇x v + ∇x v T ), (2)
2 2
1 1
W = (L − L ) = (∇x v − ∇x v T ).
T
(3)
2 2
Substituting D and W in the above expansion, we get
v(x, t) − v(y, t) = W (y, t)(x − y) + D(y, t)(x − y) + o(|x − y|).
Since W is skew-symmetric tensor, we show in the following problem that v(y, t) +
W (y, t)(x − y) represents the velocity field of rigid motion. Therefore, W is called spin
tensor. The other part of velocity field
D(y, t)(x − y)
represents the deformation with time, i.e., the rate of stretch of fibers of continuum body
locally at an instant of time t. Therefore, the tensor D is called rate of deformation
tensor or stretching tensor. It should be noted that the stretch tensor is different from
the stretching tensor as stretch is not rate of deformation (see definition of stretch tensor
in Lecture-18).
We now derive the relation between right and left stretch tensors of the deformation
gradient F , i.e., U and V , and the velocity gradient L. Using the polar decomposition
theorem (see Lecture-18), we have
F = RU = V R
where U is right stretch tensor, V is left stretch tensor and R is the rotation tensor. We
have the following relation between L and F (see Lecture-20)
DF −1 DF
L= F (Since = LF ).
Dt Dt
Substituting right polar decomposition F = RU , we get
D(RU )
L = (RU )−1
Dt !
DR DU
= U +R U −1 RT
Dt Dt
!
DR T DU −1
= R +R U RT .
Dt Dt
where
! !
DU −1 1 DU −1 DU −1
sym U = U + U , and
Dt 2 Dt Dt
! !
DU −1 1 DU −1 DU −1
skw U = U − U .
Dt 2 Dt Dt
It is interesting to see that W has two contributions: one due to rotation R denoted by
Wrot and other due to stretch tensor denoted by Wstr , i.e.,
where
DR T
Wrot = R
Dt !!
DU −1
Wstr = R skw U RT .
Dt
The motion is said to be rigid if the distance between two arbitrary material points is
preserved during the motion, i.e.,
∂
|χ(X1 , t) − χ(X2 , t)| = 0, ∀X1 , X2 ∈ Ω0 .
∂t
Taking differentiation, we get
We now prove a result in differential equations that relates skewsymmetric and rotation
tensors. Note that following result is converse to Problem-4 of Lecture-16.
Problem 1. Let Q(t) be a second-order tensor which is continuous with respect to time.
Let Ŵ (t) be a skewsymmetric tensor related to Q(t) by
DQ
= Ŵ Q, (8)
Dt
and Q(0) = I. Then show that Q(t) is rotation tensor 1 .
Proof. The tensor Ŵ is skewsymmetric in Eq. (8). Therefore, taking transpose, yields
DQT
= −QT Ŵ . (9)
Dt
Taking material time derivative of QT Q, we get
D DQT DQ
(QT Q) = Q + QT
Dt Dt Dt
= −QT Ŵ Q + QT Ŵ Q (Substituting Eqs. (8) and (9))
= O
The unique solution Q(t)T Q(t) = I, ∀ t ∈ [0, ∞) to the differential equation follows from
the initial condition Q(0) = I.
The continuity of Q implies det(Q(t)) = 1 as Q(0) = I. Thus, Q(t) is not just an
orthogonal tensor but a rotation tensor.
Proof. We showed in Lecture-18 (see Problem-1 in Lecture-18) that the rigid motion
should have the form
χ(X, t) = Q(t)X + c(t), (11)
1
The proof for this problem is taken from Bar-Itzhack, I. Y., and Markley, F. L., “Minimal parameter
solution of the orthogonal matrix differential equation”, IEEE Transactions on Automatic Control, 1990,
35(7), 314-317
where Q(t) is rotation tensor and c(t) is a vector. We now show that Eq. (11) implies
Eq. (10). Let X1 and X2 be two material points in Ω0 . Then we have
or, alternatively,
x2 − x1 = Q(t)(X2 − X1 ). (12)
Taking material derivative, we get
D DQ
(χ(X1 , t) − χ(X2 , t)) = (X1 − X2 )
Dt Dt
DQ −1
=⇒ v(x1 , t) − v(x2 , t) = Q (χ(X1 , t) − χ(X2 , t))
Dt
DQ T
=⇒ v(x1 , t) − v(x2 , t) = Q (x1 − x2 ).
Dt
DQ T DQ T
Since QQT = I implies Q is a skewsymmetric tensor. Let us denote Q with
Dt Dt
Ŵ . Then we have
v(x1 , t) − v(x2 , t) = Ŵ (t)(x1 − x2 ).
Clearly, rigid motion implies Eq. (10).
Conversely, we now show that Eq. (10) implies Eq. (11), i.e., Eq. (10) also represents
rigid motion. Applying Eq. (11) to both material points X1 and X2 yields Eq. (12).
Therefore, it is enough to show that Eq. (10) implies Eq. (12).
Let z(t) = x1 − x2 = χ(X1 , t) − χ(X2 , t). Then we have
Dz
= Ŵ (t)z, z(0) = X1 − X2 . (13)
Dt
Integrating above differential equation, we get
Z t
z(t) = z(0) + Ŵ (τ 0 )z(τ 0 )dτ 0 (14)
0
Let
" Z t Z t Z τ0 !
0 0 0 00 00
R(t) = I+ Ŵ (τ )dτ + Ŵ (τ ) Ŵ (τ )dτ dτ 0
0 0 0
Z t (Z 0 Z τ 00 ! ) #
τ
0 00 000 000 00 0
+ Ŵ (τ ) Ŵ (τ ) Ŵ (τ )dτ dτ dτ + · · · . (18)
0 0 0
Comparing above equation with Eq. (12), it is enough to show that R(t) is a rotation
tensor.
Problem 3. The motion is rigid if and only if the rate of deformation D(x, t) = O, ∀
x ∈ Ω and t ∈ [0, ∞).
Proof. We showed in previous problem (i.e., Problem-2) that the motion is rigid if and
only if the motion follows Eq. (10). Let ĉ(t) be a constant vector. Then Eq. (10) can be
written as
Since x1 and x2 are arbitrary in above equation, we can have the following equation of
motion for continuum.
v(x, t) = Ŵ (t)x + ĉ(t) ∀x ∈ Ω.
Taking the gradient on both sides of above equation, we get velocity gradient L = Ŵ .
Since velocity gradient is skewsymmetric tensor, the rate of deformation D(x, t) = O.
Conversely, we now show that D(x, t) = O for all x ∈ Ω implies the motion is rigid.
We have the following relation in indicial form as D = O.
∂vi ∂vj
=− .
∂xj ∂xi
∂vj ∂vk
Since =− , we can write above equation
∂xk ∂xj
! ! !
∂ 2 vi ∂ ∂vj ∂ ∂vk ∂ ∂vk ∂ 2 vi
=− = = =−
∂xk ∂xj ∂xi ∂xk ∂xi ∂xj ∂xj ∂xi ∂xk ∂xj
∂vk ∂vi
Last step follows from =− . In the above equation, right hand side term is same
∂xi ∂xk
as that of left hand side term with negative sign. Therefore, above equation yields
∂ 2 vi
= 0,
∂xk ∂xj
and it implies ∇x v is a constant skewsymmetric tensor. Thus, the motion is rigid.
which is the classical formula for velocity field in the rigid body mechanics, where w is
known as angular velocity.
Rate of strain tensor:
Usually, in the literature of fluid mechanics, the rate-of-deformation D is referred to as
rate-of-strain. We may infer that D is rate of some strain tensor but it is not true (see
Jog (2007)). Furthermore, if displacements and displacement gradients are very small
then D is approximately equal to the rate of small strain tensor. However, the rate of
deformation measure is usually applied in the cases where the displacements are not small
(for example fluid flow, metal forming, etc.)2 . Hence, the rate of deformation does not
coincide with the rate of strain in most of the applications. It is important to note that
there is no assumption made about displacements and its gradients in the definition of
rate of deformation. Hence, D is applicable for general flow fields. We now show that
rate-of-deformation is not a time derivative of any strain tensor.
If displacements are also small then the rate of strain approximately equal to D.
Rate of Almansi strain tensor:
The Almansi strain tensor and the Green strain tensor are related by
1
Ẽ = (I − F −T F −1 ) = F −T EF −1
2
Taking the material time derivative, we get
!
DẼ DE
= D − LT Ẽ − ẼL. Since = F T DF
Dt Dt
F = RU = V R,
DE (0)
= RT DR
Dt
DẼ (0)
= D.
Dt
Clearly, if all three principal stretches of stretch tensors are equal then the material time
derivative of spatial logarithmic strain is equal to rate of deformation. The equality of
all principal stretches is very strong condition and it is not applicable in general motion
of fluid. Thus, in general, the rate of logarithmic strain tensors are also not equal to the
rate of deformation.
In conclusion, none of the rate of strains are equal to the rate of deformation. Although
rate-of-deformation is referred as rate-of-strain in fluid mechanics literature, the usage is
misnomer.
Reference
3
Hoger, A., “The material time derivative of logarithmic strain”, International Journal of Solids and
Structures, 1986, 22(9), pp. 1019-1032.
Module-3: Kinematics
Lecture-22: The Reynolds’ Transport Theorem
Here, we discuss the transformation between material and spatial integrals. We also
present the famous Reynolds’ transport theorem which plays a crucial role in the deriva-
tion of differential form of governing equation from the integral form of balance laws.
Transformation of integrals from spatial to material and vice versa is equivalent to
the change of variable in the case of one-variable calculus. Therefore, we first present an
example to understand the change of variable in the integration.
Example 1. Let {e1 , e2 } be orthonormal basis to the two dimensional vector space and
x = (x1 , x2 ) be a position vector. Let g(x) = (−x1 , x2 ) be a given vector field. Let C be a
quarter circle with unit radius in the first quadrant of plane as shown in Fig. (1). Then
show that the line integral Z
g(x) · dx = −1,
C
on given curve of quarter circular arch using direct method and also change of variables.
Consider the starting and ending points of the curve as (0, 1) and (1, 0) (see Fig. (1)).
x2
(0,1)
x1
(0,0) (1,0)
Solution: We first present the direct solution. Since the given curve x21 + x22 = 1, taking
the differentiation, we get
x2 dx2 = −x1 dx1 .
Given vector valued function g(x) = (−x1 , x2 ) with (0, 1) and (1, 0) as starting and ending
points, respectively. Therefore, the integration along the curve
Z Z
g(x) · dx = (−x1 dx1 + x2 dx2 )
C C
Z 1
= −2x1 dx1 (Since x2 dx2 = −x1 dx1 )
0
h i1
= −x21
0
= −1
We now use change of variables to evaluate the integration. Let x1 = cos ξ and x2 = sin ξ.
Then x1 and x2 satisfy the curve x21 + x22 = 1 automatically. The parameter ξ = π/2 and
ξ = 0 represent the starting and ending points of the curve as (x1 , x2 ) = (0, 1) at ξ = π/2
and (x1 , x2 ) = (1, 0) at ξ = 0. Since both x1 and x2 are functions of ξ, we can have
dx1 = − sin ξ and dx2 = cos ξ. Using change of variables, we can evaluate integral along
the curve
Z 0
Z
dx
g(x) · dx = g(x(ξ)) ·
dξ
C π
2
dξ
Z 0
dx1 dx2
= π (−x1 (ξ) + x2 (ξ) ) dξ
2
dξ dξ
Z 0
= π
((cos ξ)(sin ξ) + (sin ξ)(cos ξ)) dξ
2
Z 0
= π
sin(2ξ) dξ
2
" #0
cos(2ξ)
= −
2 π
2
= −1
Clearly, the evaluation of integration by direct method and change of variables are
equivalent. Hence, we observe the following points from this simple example:
These three points need to be accounted in order to transform integration from the refer-
ence domain to spatial domain or vice versa.
In the transformation of the integrals, let Ω0 be a reference configuration and Ω be
a deformed configuration at an instant of time t. Let us also consider x = χ(X, t)
be a transformation from Ω0 to Ω and F (X, t) be the deformation gradient. Then the
transformation of line, surface and volume integrals are stated below.
Transformation of line integrals:
The line integrals are always evaluated along a curve for a given vector field. Let g(x, t)
be a vector field defined over Ω and Ct be a curve in the deformed configuration. Then
the line integral is given by Z
IL = g(x, t) · dx.
Ct
In order to transform the spatial line integral IL , we need to transform spatial vector field
g(x, t), the differential line element dx and the spatial curve Ct into material description.
Let g(x, t) = g̃(X, t) for every x = χ(X, t), i.e.,
Since F (X, t) is the deformation gradient, the transformation of differential line element
can be written as
dx = F (X, t) dX.
Let material curve C in Ω0 be a transformation of spatial curve Ct . Then the transfor-
mation of line integral can be written as
Z Z
g(x, t) · dx = g̃(X, t) · F dX
Ct
ZC
= F T g̃(X, t) · dX. (1)
C
where n is outward unit normal to the surface Γ. Similar to previous case, we can
transform g(x, t) = g̃(X, t) for x = χ(X, t). From our earlier discussion (see Problem-3
in Lecture-19), the transformation to area element is given by
If G(x, t) is a tensor field defined over Ω and G(x, t) = G̃(X, t) for x = χ(X, t) then we
can have the following relation for surface integral of tensor field.
Z Z
G(x, t)n ∂Γ = G̃(X, t)(cof F )n0 ∂Γ0 . (4)
Γ Γ0
We now want to prove an important result called Reynolds’ transport theorem which
plays an important role in balance laws. However, before going to show the general
result, we want to discuss the Leibniz rule of integration which is one dimensional version
of Reynolds’ transport theorem.
Leibniz rule of integration:
Let f (ξ, t) be a function. Let α(t) and β(t) be direct function of time t. Then we have
the following result known as Leibniz rule of integration.
d Z β(t) Z β(t)
∂f dβ dα
f (ξ, t) dξ = dξ + f (β(t), t) − f (α(t), t) (8)
dt α(t) α(t) ∂t dt dt
1 Z α(t) Z β(t)
= lim f (ξ, t + ) dξ + f (ξ, t + ) dξ
→0 α(t)+ dα +o() α(t)
dt
Z β(t)+ dβ +o() Z β(t) !
dt
+ f (ξ, t + ) dξ − f (ξ, t) dξ
β(t) α(t)
dα
1 Z α(t)+ +o()
dt
= lim − f (ξ, t + ) dξ
→0 α(t)
Z β(t)+ dβ +o() Z β(t) !
dt
+ f (ξ, t + ) dξ + f (ξ, t + ) − f (ξ, t) dξ
β(t) α(t)
dβ Z α(t)+ dα +o() !
1 Z β(t)+ dt +o() dt
= lim f (ξ, t + ) dξ − f (ξ, t + ) dξ
→0 β(t) α(t)
!
Z β(t)
f (ξ, t + ) − f (ξ, t)
+ lim dξ
α(t) →0
! !!
1 dβ dα
= lim f (β(t), t + ) + o() − f (α(t), t + ) + o()
→0 dt dt
Z β(t)
∂f
+ dξ
α(t) ∂t
Z β(t)
∂f dβ dα
= dξ + f (β(t), t) − f (α(t), t)
α(t) ∂t dt dt
This completes the proof.
∂f dα dβ
= ξ, = cos t, and = − sin t.
∂t dt dt
Substituting in Leibniz rule, we get
d Z cos t Z cos t
(1 + tξ)dξ = ξ dξ + (1 + t cos t)(− sin t) − (1 + t sin t)(cos t)
dt sin t sin t
" #cos t
ξ2
= − sin t − t cos t sin t − cos t − t sin t cos t
2 sin t
1
= (cos2 t − sin2 t) − 2t sin t cos t − − sin t − cos t
2
1
= cos(2t) − t sin(2t) − sin t − cos t.
2
dα dβ
The solution obtained in both ways is same. In case of Leibniz rule and represent
dt dt
velocity of boundaries. Similarly, we now see that the effect of boundary velocity appears
in the following Reynolds’ transport theorem.
Reynolds’ transport theorem:
Let Ω0 be a reference configuration with Γ0 as its boundary and Ω be a deformed con-
figuration with Γ as its boundary. Let φ(x, t) be a scalar field defined over Ω. Then the
R
material time derivative of integral Ω φ(x, t) ∂Ω is given by
d Z Z
∂φ Z
φ ∂Ω = ∂Ω + φ v · n ∂Γ, (9)
dt Ω Ω ∂t Γ
where n is normal to Γ.
Proof. We note that the deformed configuration Ω changes its shape with time. This is
similar to the earlier integral (i.e., the integral on the left hand side of Eq. (8)) where
the integration limits are function of time. Analogous to earlier integration, the time
derivative cannot be taken inside the spatial integration as Ω changes with time. On
the other hand, the undeformed domain that is independent of time. Therefore, we
follow three steps to evaluate integral: (i) transform the spatial integration to material
integral, (ii) take the total time derivative inside the integration as reference domain (i.e.,
undeformed domain) is independent of time (iii) transform the material integration again
back to spatial integration.
Clearly, the second term on the right hand side of equation represents contribution from
velocity of the boundary.
Let w(x, t) be a vector field. Then, following similar steps as that of previous result,
we can prove the following Reynolds’ transport theorem for the vector field.
d Z Z
∂w Z
w ∂Ω = ∂Ω + w(v · n) ∂Γ. (10)
dt Ω Ω ∂t Γ
Reference
Module-3: Kinematics
Lecture-23: The Vorticity and Circulation
We now introduce vorticity vector and circulation that are important in the study of
fluid mechanics. In addition, we discuss the Cauchy’s vorticity formula and the Kelvin’s
circulation theorem.
Vorticity vector:
Let v be a spatial velocity field defined over the deformed body Ω at an instant of time
t. Then the vorticity vector is defined by
w = ∇x × v. (1)
Since the spin tensor W is a skew-symmetric part of the velocity gradient L, we can write
1
W = (∇x v − ∇x v T ).
2
It is clear from the definition that w/2 is an axial vector of the spin tensor W . Therefore,
the vorticity vector at a point gives twice the rate of rotation or angular velocity. We now
define circulation and also the relation between circulation and vorticity.
Circulation:
Let Ω0 be a reference configuration and Ω be a deformed configuration at an instant of
time t. Let x = χ(X, t) be a mapping from Ω0 to Ω. Let C be a closed curve in Ω0 and
Ct be a corresponding closed curve in Ω. Then the circulation { is defined by
I
{= v · dx. (2)
Ct
Let Γ be a surface that passes through the closed curve Ct . Then, using Stokes’ theorem
(see Lecture-16), we get I Z
v · dx = (∇x × v) · n ∂Γ, (3)
Ct Γ
where n is the outward unit normal to the surface Γ. Thus, the circulation at a point is
directly related to the vorticity vector at that point.
Transport of circulation:
Let C be a closed curve in the reference configuration Ω0 and Ct be a corresponding closed
curve in deformed configuration Ω. Then the material time derivative of circulation is
given by
d I I
Dv
v · dx = · dx. (4)
dt Ct Ct Dt
Proof. Let x = χ(X, t) be a map from Ω0 to Ω. Let F (X, t) be the deformation gradient.
Let ṽ(X, t) = v(x, t) for x = χ(X, t), i.e., v(x, t) = v(χ(X, t), t) = ṽ(X, t). Then
d I d I
v · dx = ṽ · F dX
dt Ct dt C
I
∂ ṽ ∂F
= · F dX + ṽ · dX.
C ∂t ∂t
Let L be a velocity gradient. Then
d I I
∂ ṽ
v · dx = · F dX + ṽ · LF dX
dt Ct C ∂t
I
Dv I
= · dx + v · L dx
Ct Dt Ct
I
Dv I
= · dx + v · dv
Ct Dt Ct
I
Dv I
1
= · dx + d(v.v)
Ct Dt Ct 2
Dv
I I
= · dx since d(v.v) = 0
Ct Dt Ct
The material time derivative of circulation of the transport of circulation (see Eq. (4)),
d{ I Dv
= · dx.
dt Ct Dt
d{ I I
= ∇x φ · dx = dφ = 0.
dt Ct Ct
The Kelvin’s circulation theorem shows that the motion is irrotational at t = 0, i.e.,
∇x × v(x, 0) = 0, then the motion is irrotational forever, i.e., ∇x × v(x, t) = 0, ∀ t ∈
[0, ∞).
ω = J −1 F ω0 , (5)
ω = ∇x × v.
Let ω̃(X, t) be Lagrangian description of vorticity vector, i.e., ω(x, t) = ω(χ(X, t), t) =
ω̃(X, t). Since ω0 is vorticity vector field at time t = 0, we have ω0 = ω̃(X, 0).
Let Ct be a arbitrary closed curve in Ω and C be a corresponding curve in Ω0 . Let Γ
be surface in Ω bounded by Ct and Γ0 be a surface corresponding to Γ in Ω0 . Let n be
a unit normal field to Γ and n0 be a unit normal field to Γ0 . Then consider total time
derivative of circulation,
d I d Z
v · dx = (∇x × v) · n ∂Γ
dt Ct dt Γ
d Z
= ω · n ∂Γ
dt Γ
d Z
= ω̃ · JF −T n0 ∂Γ0
dt Γ0
Z
∂(ω̃ · JF −T n0 )
= ∂Γ0
Γ0 ∂t
Z
∂ ω̃ ∂J −T 0 ∂F −T 0
= · JF −T n0 + ω̃ · F n + ω̃ · J n ∂Γ0
Γ0 ∂t ∂t ∂t
Z
∂ ω̃
= · JF −T n0 + ω̃ · (tr L)JF −T n0 + ω̃ · J(−LT F −T )n0 ∂Γ0
Γ0 ∂t
!
Z
∂ ω̃
= + ω̃(tr L) − Lω̃ · J(−LT F −T )n0 ∂Γ0
Γ0 ∂t
Z
Dω
= + ω(tr L) − Lω · n ∂Γ.
Γ Dt
Applying Stokes’ theorem to Eq. (4), we get
d I I Z
v · dx = a · dx = (∇x × a) · n ∂Γ.
dt Ct Ct Γ
D(JF −1 ω)
= 0.
Dt
Since Ω0 is the configuration at time t = 0, we have F = I, J = 1 and ω = ω0 . We
get following relation by integrating above differential equation and substituting initial
conditions F = I and ω = ω0 .
ω0 = JF −1 ω.
This relation is known as Cauchy’s vorticity formula.
Similar to previous result of Kelvin’s circulation theorem, we can conclude from the
Cauchy’s vorticity formula that the motion is irrotational once then it is irrotational
forever.
References
Classical continuum mechanics assumes the following laws of mechanics and thermody-
namics as axioms.
• Conservation of mass
Since the classical continuum mechanics assumes Newtonian space-time framework (i.e.,
non-relativistic setup), the conservation of mass is an axiom.
Except second law of thermodynamics all others are balance laws. We will see in the
forthcoming discussion that the second law of thermodynamics is an inequality which im-
poses restrictions on constitutive relations. These balance laws are applied to continuum
in order to obtain governing equations in integral and differential forms. We now define
three types of regions of continuum body called reference volume, material volume and
control volume to apply balance laws.
Let B0 be the reference configuration and B be the deformed configuration at an
instant of time t. Let x = χ(X, t) be a map from B0 to B that represent the motion.
Let Ω0 be a part of B0 and its associated part in B be Ω, i.e., for every X ∈ Ω0 there is
a unique x ∈ Ω.
• Material volume Ω convects with body. In this case the material does not flow into
and out of Ω during motion, i.e., for every x ∈ Ω there is a X ∈ Ω0 and vise versa.
• Control volume Ωc is usually a fixed region1 in space that lie in the deformed body
B for some non-zero time interval. In general material flows into and/or out of a
control volume.
1
In general the control volume Ωc need not be fixed in space and it can have the motion and change
its shape with time. The control volume Ωc allows the flow of material into and out of control surface
whereas the material volume Ω does not allow, i.e., Ω consists of fixed set of particles with time.
The material volume Ω0 , the spatial volume Ω and the control volume Ωc are depicted in
Fig. (1).
Figure 1: Material volume (Ω0 ), spatial volume (Ω) and control volume (Ωc )
d Z Z
∂φ Z
φ ∂Ω = ∂Ω + φ(v · n) ∂Γ,
dt Ω Ω ∂t Γ
where Γ is the bounding surface of Ω and n is the outward normal of Γ. Similarly, RTT
for vector field w can be written as
d Z Z
∂w Z
w ∂Ω = ∂Ω + w(v · n) ∂Γ,
dt Ω Ω ∂t Γ
We refer the Reynolds’ transport theorem as first transport theorem, since another
convenient form of transport theorem is stated in fourth coming lecture in order to apply
directly to the conservation of momenta and energy.
Lagrangian framework:
Let ρ(x, t) be a density in the deformed configuration B. Let ρ̃(X, t) be a Lagrangian
description of ρ(x, t), i.e., ρ(x, t) = ρ(χ(X, t), t) = ρ̃(X, t). We denote the density field
of reference configuration by ρ0 (X), i.e., ρ0 (X) = ρ̃(X, 0).
The axiom of mass conservation states that the material volume Ω0 evolves to Ω with
time t but the mass remains constant (assuming that there are no mass sources within
Ω), i.e., Z Z
ρ0 (X) ∂Ω0 = ρ(x, t) ∂Ω. (1)
Ω0 Ω
d Z
ρ ∂Ω = 0. (3)
dt Ω
Let Γ be the boundary surface of Ω. Then, using Reynolds’ transport theorem, we get
Z
∂ρ Z
∂Ω + ρ(v · n) ∂Γ = 0, (4)
Ω ∂t Γ
The medium is said to be incompressible if all possible deformations are isochoric. The
isochoric constraint implies J = 1 (i.e., det(F ) = 1). Taking material time derivative of
J (see Eq. (20) in Lecture-20), we get
∇x · v = 0.
From Eq.(7), the mass conservation in Eulerian framework for incompressible medium
Dρ
is given by = 0, while in the Lagrangian framework (see Eq. (2)), we get the
Dt
condition ρ(x, t) = ρ0 (X). Thus, the density at any material point remains the same in
an incompressible material. The density of deformed configuration is same at every point
if the density of the reference configuration is same at every point.
It is clear that the rate of change of mass in control volume Ωc is balanced by the mass
flux into and out of the surface Γc .
Since Ωc is fixed in space, Eq. (8) can also be written as
d Z Z
ρ ∂Ω + ρ(v · n) ∂Γ = 0. (9)
dt Ωc Γc
Reference
The linear and angular momenta balance are known as laws of dynamics. These laws are
postulated to the continuous medium by Euler (C.E. 1707 - 1783). Therefore, these laws
of dynamics for massive (continuum) bodies are also known as Euler laws of motion. We
require the Cauchy’s hypothesis for the surface forces and second transport theorem in
order to obtain the differential form of equation of momenta balance. Therefore, in this
discussion, we introduce second transport theorem and the Cauchy’s hypothesis.
The second transport theorem:
Let B0 be the reference configuration and B be the deformed configuration at an instant
of time t. Let x = χ(X, t) be a map from B0 to B that represent the motion. Let Ω0
be a part of B0 and Ω be a convection1 of Ω0 with B, i.e., Ω is part of B corresponding
to Ω0 .
Let φ(x, t) be a scalar field defined over B and φ̃(X, t) = φ(x, t) for every x ∈ B.
Let ρ(x, t) be a density defined over B and ρ̃(X, t) be a material description of density,
i.e., ρ(x, t) = ρ(χ(X, t), t) = ρ̃(X, t). Let ρ0 (X) = ρ̃(X, 0) be a density of reference
configuration B0 . Then the time derivative of integral Ω ρφ ∂Ω is given by
R
!
d Z Z
Dφ Z
∂φ
ρφ ∂Ω = ρ ∂Ω = ρ + v · (∇x φ) ∂Ω. (1)
dt Ω Ω Dt Ω ∂t
We know that the density of reference configuration is not a function of time. Hence, we
have
d Z Z
∂ φ̃
ρφ ∂Ω = ρ0 ∂Ω0
dt Ω Ω0 ∂t
Z
∂ φ̃
= ρ̃ J ∂Ω0
Ω0 ∂t
Z
Dφ
= ρ ∂Ω
Ω Dt
The material time derivative of scalar field can be expressed as (see Lecture-20)
Dφ ∂φ
= + v · ∇x φ,
Dt ∂t
Therefore, we get
!
d Z Z
Dφ Z
∂φ
ρφ ∂Ω = ρ ∂Ω = ρ + v · (∇x φ) ∂Ω.
dt Ω Ω Dt Ω ∂t
Let w(x, t) be a vector field defined over B. Then the following second transport
theorem for vector field can be proved using the similar steps as that of previous result.
!
d Z Z
Dw Z
∂w
ρw ∂Ω = ρ ∂Ω = ρ + (∇x w)v ∂Ω. (2)
dt Ω Ω Dt Ω ∂t
In classical continuum mechanics, we assume that the moment is defined as the moment
of a force. Therefore, we require the definition of force for further study. The detailed
axiomatic set up for forces can be found in a book by Truesdell (1977)2 . Here, we present
a brief introduction to forces.
The forces on continuum bodies:
In classical continuum mechanics, the forces can be classified into two types: (i) body
forces (ii) surface forces.
1. Body forces
The body force acts on every particle. The source of these body forces usually lies
outside the body. The examples for these forces are the gravitational forces and
electromagnetic forces. Let ρ(x, t) be a density of the medium and b(x, t) be a
body force field per unit mass then the body force over the domain Ω is given by
R
Ω ρb ∂Ω.
2. Surface forces
The forces that are acting on the boundary of region are known as surfaces forces.
These forces are usually exerted by one body onto another through the contact. We
assume that these forces are related to the contact surface. Unlike body forces, the
surface forces are independent of mass of the body.
In any case the surface force are dependent on the surface that is in contact. We make
the following assumption on surface forces known as Cauchy’s hypothesis.
The Cauchy’s hypothesis:
The distribution of the force over the surface area is known as traction field t. In other
words, the force per unit area on a given surface is known as traction field. The Cauchy’s
hypothesis states that the traction vector field t on the surface Γ in B depends only on
the unit normal to the surface at a point. In other words, the traction field is a function
of position, time and outward normal to the surface, i.e., t(x, t, n) where n is outward
unit normal.
all gradients of surface then it is equivalent to knowing the full surface. Among all the
gradients, the first gradient ∇x ψ is known as normal and the second gradient ∇x (∇x ψ) is
known as curvature. Although all gradients contain the information of surface, we assume
that the traction t at a point depends on surface only through the unit normal and not
on higher gradients such as curvature. This assumption is known as Cauchy’s hypothesis
which is depicted in Fig. (1). It can be observed in Fig. (1) that there are three different
surfaces having common normal n but having different curvatures passing through a point
x have the same traction vector.
Let y be a position vector in the cylinder C . Then using linear momentum balance to
If cylinder approaches the disc, i.e., tends to zero, then the volume integral goes to zero
as ρa and ρb are bounded. Furthermore, the surface integral remains finite. Therefore,
we can write Z
lim t(y, t, n) ∂Γ = 0.
→0 ΓC
Let Γ is curved surface, Γ+ is flat surface with n as outward normal and Γ− is flat surface
with −n as outward normal to the cylinder. Then ΓC = Γ ∪ Γ+ ∪ Γ− . Let D be a disc
passing through x in the cylinder with same radius as that of cylinder (see Fig. (2)).
Then Γ vanishes and Γ+ and Γ− approaches D as → 0. Therefore, above integral can
be written as Z
t(y, t, n) + t(y, t, −n) ∂Γ = 0.
D
Since the radius of cylinder is arbitrary, the continuity of integrand implies that the
integrand must vanish at x, i.e.,
t(x, t, n) + t(x, t, −n) = 0,
which proves the Cauchy’s lemma.
This result is equivalent to Newton’s law of action and reaction as shown in Fig. (3).
References
The concept of stress is central to the continuum mechanics. In this lecture, we show the
existence of stress tensor. We also derive the differential form of linear momentum balance
which is known as Cauchy’s equation of motion-I (first Cauchy’s equation of motion).
The Cauchy’s theorem and the stress tensor:
Let the traction field t(x, t, n) be a continuous function of x at any instant of time t for a
given unit vector n. Let the density ρ(x, t), acceleration a(x, t) and body force field per
unit mass b(x, t) be bounded for every x in B at any time t. Then the linear momentum
balance implies the existence of stress tensor τ such that
Proof. Let {e1 , e2 , e3 } be a set of orthonormal basis to the 3-D Cartesian space. Consider
a tetrahedron shown in Fig. (1) with three faces lying on Cartesian coordinate planes pass
through O and an inclined plane pass through A, B and C. Let x be a position vector
of O. Let Γn , Γ1 , Γ2 and Γ3 be four bounding surfaces ∆ABC, ∆OCB, ∆OAC ∆OBA,
respectively. Outward normals to the surfaces ∆ABC, ∆OCB, ∆OAC and ∆OBA are
n, −e1 , −e2 and −e3 , respectively.
Applying the surface integral to four faces of tetrahedron (see Fig. 1), we can write
Z Z Z Z Z
ρ(a − b) ∂Ω = t(y, t, n) ∂Γ + t(y, t, n) ∂Γ + t(y, t, n) ∂Γ + t(y, t, n) ∂Γ.
Ω Γn Γ1 Γ2 Γ3
Let AS be surface area of Γn (i.e., ∆ABC) and h be the distance from O to Γn . Let n1 ,
n2 and n3 be components of normal n along e1 , e2 and e3 , respectively. Then the surface
area of Γ1 , Γ2 and Γ3 (i.e., area of ∆OCB, ∆OAC and ∆OBA) are given by n1 AS , n2 AS
and n3 AS , respectively. If h is small then, using the mean value theorem1 , we get
1
ρ(a − b)|x∗∗ As h = (t(x∗ , t, n) + t(x∗i , t, ei )ni ) As
3
where x∗∗ is centroid of tetrahedron and x∗ is centroid of Γn and x∗i is centroid of Γi .
Let h tends to zero such that the normal of Γn continues to be n. Then the continuity of
traction t and boundedness of ρ(a − b) implies
Following similar steps one can show Eq. (2) for the tetrahedron lying in other octants
of 3-D Cartesian space. We now define the Cauchy stress tensor τ by
Equations (3) and (2) imply τ (x, t)n = t(x, t, n). We note that one should use the tensor
product definition given in Eq. (16) of Lecture-4 to see the equivalence between τ (x, t)n
and t. Thus, there is a spatial tensor field known as Cauchy stress tensor.
Problem 1. Show that linear momentum balance along with Cauchy’s hypothesis imply
the following governing differential equation.
Dv
ρ = ∇x · τ + ρb,
Dt
where n is normal to the boundary Γ and τ is Cauchy stress tensor. Since b(x, t) is body
force, v(x, t) is velocity, ρ(x, t) is density and t(x, t) is traction, we can write the linear
momentum balance for material volume Ω as
d Z Z Z
ρv ∂Ω = ρb ∂Ω + t ∂Γ.
dt Ω Ω Γ
Substituting stress tensor and applying second transport theorem to the above equation,
we get Z
Dv Z Z
ρ ∂Ω = ρb ∂Ω + τ n ∂Γ.
Ω Dt Ω Γ
Applying the divergence theorem to the surface integral and grouping terms, yields
Z
Dv
ρ − ∇x · τ − ρb ∂Ω = 0.
Ω Dt
Since the material volume Ω is arbitrary, application of the localization theorem give rise
to the first Cauchy’s equation of motion shown in Eq. (4). We now show that the Cauchy
stress and differential form of equation of motion (first Cauchy’s equation of motion)
also imply the linear momentum balance. Considering the linear momentum of material
volume Ω, we obtain
d Z Z
Dv
ρv ∂Ω = ρ ∂Ω (Using transport theorem-II)
dt Ω ZΩ Dt
= (∇x · τ + ρb) ∂Ω (Using first Cauchy’s equation of motion)
ZΩ Z
= τ n ∂Γ + ρb ∂Ω (Using divergence theorem)
ZΓ Z Ω
= t ∂Γ + ρb ∂Ω (Since τ n = t).
Γ Ω
Thus, the linear momentum balance along with Cauchy’s hypothesis imply the existence
of Cauchy stress tensor. Conversely, the Cauchy stress tensor with first Cauchy’s equation
of motion imply the linear momentum balance.
ti = τij nj ,
!
∂vi ∂vi ∂τij
ρ + vj = + ρbi .
∂t ∂xj ∂xj
Applying divergence theorem to the second term on the right hand side, we get
!
Z
∂(ρv) ∂ρ Z Z
− v + (∇x v)ρv ∂Ω = ρb ∂Ω + τ n ∂Γ,
Ωc ∂t ∂t Ωc Γc
Applying the divergence theorem to the second term on the left hand side of equation,
we get the following linear momentum balance for the control volume.
Z
∂(ρv) Z Z Z
∂Ω + ρv(v · n) ∂Γ = ρb ∂Ω + t ∂Γ. (5)
Ωc ∂t Γc Ωc Γc
Since Ωc is fixed in space, we can also write the linear momentum balance as
d Z Z Z Z
ρv ∂Ω + ρv(v · n) ∂Γ = ρb ∂Ω + t ∂Γ. (6)
dt Ωc Γc Ωc Γc
The term Γc ρv(v · n) ∂Γ represents the rate of change of linear momentum due to the
R
We now show the implication of angular momentum balance on stress tensor. Let B0
be a reference configuration and B be a deformed configuration at an instant of time t.
Let Ω0 be a part of B0 and Ω be a corresponding material volume in B. Let x0 be a
fixed point in the space. Then the principle of angular momentum balance states that the
rate of change of angular momentum of material volume Ω is equal to the net moment of
forces acting on Ω with respect to x0 , i.e.,
d Z Z Z
ρ(x − x0 ) × v ∂Ω = ρ(x − x0 ) × b ∂Ω + (x − x0 ) × t ∂Γ, (1)
dt Ω Ω Γ
Let u be a constant arbitrary vector. Taking dot product on both sides of equation, we
obtain
Z Z
x × (∇x · τ ) ∂Ω · u = x × (τ n) ∂Γ · u
Ω
Z Γ
= (x × (τ n)) · u ∂Γ.
Γ
Using a property of scalar triple product from Lecture-3, we can have [x, τ n, u] =
[u, x, τ n] = τ T (u × x) · n. Substituting this relation in above equation, yields
Z Z
x × (∇x · τ ) ∂Ω · u = τ T (u × x) · n ∂Γ.
Ω Γ
Cancelling the second term in right hand side, above relation reduces to
Z
τ : ∇x (u × x) ∂Ω = 0
Ω
(x × τ n) · u = (τ n) · (u × x) = τ T (u × x) · n,
we can write
Z Z
(x × τ n) · u ∂Γ = τ T (u × x) · n ∂Γ
Γc
ZΓc
= ∇x · τ T (u × x) ∂Ω
ZΩc
= (τ : ∇x (u × x) + (u × x) · (∇x · τ )) ∂Ω
Ωc
Let t be a traction vector and v be a velocity field. Then the linear momentum balance
implies
Dv
Z Z
x × t ∂Γ = x× ρ − ρb ∂Ω
Γc Ωc Dt
Z
Dv Z
= ρx × ∂Ω − ρx × b ∂Ω.
Ωc Dt Ωc
Using divergence theorem to the second term on right hand side of equation, we get
Z Z Z
∂ Z
x × t ∂Γ + ρx × b ∂Ω = (ρx × v) ∂Ω + (ρ(x × v) ⊗ v) n ∂Γ
Γc Ωc Ωc ∂t Γc
We obtain the following form of the conservation of angular momentum for control volume
using the definition of tensor product between two vectors.
Z
∂ Z Z Z
(ρx × v) ∂Ω + ρ(x × v)(v · n) ∂Γ = x × t ∂Γ + ρx × b ∂Ω. (4)
Ωc ∂t Γc Γc Ωc
Since Ωc is fixed region in space, the conservation of angular momentum for control volume
can also be written in the following form.
d Z Z Z Z
ρx × v ∂Ω + ρ(x × v)(v · n) ∂Γ = x × t ∂Γ + ρx × b ∂Ω. (5)
dt Ωc Γc Γc Ωc
2. At any point in continuum body at least a set of three orthogonal planes exist such
that the traction vector is collinear with the normal n, i.e., t = λn.
Proof follows from the symmetry of stress tensor and the relation between traction
and stress tensor, i.e., t = τ n.
3. The plane where unit normal n and traction t are collinear is called principal plane
and the direction n is called principal direction. Since t = λn, the scalar λ is called
principal stress. We note that all principal stresses are real. The principal directions
corresponding to distinct principal stresses are mutually orthogonal.
Proof follows from the symmetry of stress tensor (see Lecture-10).
4. The principal directions are the directions corresponding to extremum normal stress.
Tn = n · τ n. (6)
It is easy to see that the extremum of normal stress n · τ n is given by the Lagrange
multiplier as λ = n · τ n. Thus, extremum of the quantity is equal to eigenvalue or
principal stress and the extremum directions are eigenvectors or principal directions.
5. The angle between maximum shear stress plane and maximum normal stress is 45◦
and the angle between maximum shear stress plane and minimum normal stress is
also 45◦ . In other words, let λ1 , λ2 and λ3 be principal stresses such that λ1 ≥ λ2 ≥
Similar to previous proof, the condition for extremization of shear stress can be
stated as
∂Ts2 ∂(1 − ñ21 − ñ22 − ñ23 )
+γ = 0.
∂ ñk ∂ ñk
Taking partial derivative, we get
(λ1 − λ2 )2 ñ22 + (λ3 − λ1 )2 ñ23 − γ ñ1 = 0
(λ1 − λ2 )2 ñ21 + (λ2 − λ3 )2 ñ23 − γ ñ2 = 0
(λ1 − λ3 )2 ñ21 + (λ2 − λ3 )2 ñ22 − γ ñ3 = 0.
In order to get maximum shear stress, we need to solve above equations along with
constraint equation ñ21 + ñ22 + ñ23 = 1. It is easy to verify that the following sets are
solutions to above equations (Chandrasekharaiah and Debnath, 1994):
(i) ñ1 = ±1, ñ2 = 0, ñ3 = 0.
(ii) ñ1 = 0, ñ2 = ±1, ñ3 = 0.
(iii) ñ1 = 0, ñ2 = 0, ñ3 = ±1
(iv) ñ1 = ñ2 = ± √12 , ñ3 = 0.
(v) ñ1 = ñ3 = ± √12 , ñ2 = 0.
Among all sets of solutions the maximum shear stress planes are given by solution
set (v), i.e., ñ1 = ñ3 = ± √12 , ñ2 = 0. Furthermore, the maximum shear stress is
given by
1
Ts = (λ1 − λ3 ).
2
References
It is convenient to work with Lagrangian description for solid mechanics as usually ref-
erence configuration is known. Therefore, we present the transformation of differential
form of linear momentum balance from spatial description to material description. The
new stress measure called first Piola-Kirchhoff stress is defined, in this lecture, which can
take care of transformation of Cauchy stress. To retain the symmetry, we define another
stress measure called second Piola-Kirchhoff stress which requires the concept of power
conjugate and it is discussed. We now start with transformation of linear momentum
balance from spatial description to material description.
The linear momentum balance and the first Piola-Kirchhoff stress tensor:
Let B0 be a reference configuration and B be a deformed configuration at an instant
of time t. Let x = χ(X, t) be a map from B0 to B. Let ρ(x, t), v(x, t), b(x, t) and
τ (x, t) be density, velocity, body force per unit mass and Cauchy stress fields over B,
respectively. Then recall from Lectures 26 and 27 that the equations of motion can be
written as
Dv
ρ = ∇x · τ + ρb ∀x ∈ B, (1)
Dt
and
τ = τT. (2)
In solid mechanics, the goal of the problem is to find the deformation of the body and
the state of stress under the action of applied forces. However, it can be noted that the
governing equations (Cauchy’s equations of motion) consists of the variable x which itself
is unknown. Thus, we transform the equations into the reference configuration B0 . It is
clear from above equations that the following terms need to be transformed in order to
achieve the target.
Let F be a deformation gradient. Then we define the first Piola-Kirchhoff stress tensor
T (X, t) as Piola transformation1 of Cauchy stress, i.e.,
T = Jτ F −T = τ cof F , (3)
where J is determinant of F .
Using Piola identity, i.e., ∇X · cof F = 0 (see Problem-1 in Lecture-19), we can show
that
∇X · T = J∇x · τ , ∀x = χ(X, t). (4)
The proof is presented in Problem-2 of Lecture-19.
Let Ω0 be a part of B0 and Ω be corresponding material volume in B. Let Γ0 and Γ
be boundaries of Ω0 and Ω, respectively. Let n0 and n be field of unit normals to Γ0 and
Γ, respectively. Then, using Eq. (4) and ∂Ω = J∂Ω0 , we get
Z Z
∇x · τ ∂Ω = ∇X · T ∂Ω0
Ω Ω0
Since the traction field in spatial description is given by t = τ n, using first Piola-Kirchhoff
stress we can have traction vector in material description
t0 (X, t, n0 ) = T n0 . (5)
The relation ρ0 (X) = Jρ(x, t), where ρ0 (X) is the density field of reference configuration
B0 , follow from conservation of mass (see Eq (2) in Lecture-24).
T T = F −1 T F T . (8)
1
See the Piola transformation in Lecture-19
Clearly, the first Piola-Kirchhoff stress is not a symmetric tensor. In summary, the dif-
ferential form of equations of motion in the reference configuration can be stated as
∂ 2χ
ρ = ∇X · T + ρ0 b0
∂t2
FTT = TFT
T n0 = t0 .
Now the goal is to find the stress measure in Lagrangian description such that it is
symmetric.
Mechanical energy balance and the second Piola-Kirchhoff stress tensor:
Let v̄ be a velocity of particle and f be a force exerted on the particle. Then, in classical
mechanics, the power expended on particle f · v̄ is balanced by the rate of change of kinetic
energy of particle. We can have the counterpart of this notion in continuum mechanics
and it can be obtained from the equations of motion.
Taking dot product with velocity field v on both sides of equation of motion, i.e., Eq.
(1), we get
Dv
ρv · = v · (∇x · τ ) + ρv · b.
Dt
Let L be a velocity gradient, i.e., L = ∇x v, and D be a rate of deformation i.e., D =
(∇x v + ∇x v T )/2. Then the above equation can be written as
D v·v
ρ = v · (∇x · τ ) + ρv · b
Dt 2
= ∇x · (τ T v) − τ : ∇x v + ρv · b
= ∇x · (τ T v) − τ : L + ρv · b
= ∇x · (τ T v) − τ : D + ρv · b (since τ is symmetric).
Let Ω be a material volume in B. Then taking the integration of above equation over Ω,
we get following mechanical energy balance
D v·v
Z Z Z Z
T
ρ ∂Ω = ∇x · (τ v) ∂Ω − τ : D ∂Ω + ρv · b ∂Ω.
Ω Dt 2 Ω Ω Ω
v · t ∂Γ +
ρv · b ∂Ω represents the net power
R R
Clearly, right hand side of equation Γ Ω
d R v·v
expended by the external forces on the material volume Ω and the term ρ ∂Ω
dt Ω 2
R
represents the rate of change of kinetic energy of material volume. The term Ω τ : D ∂Ω
is known as stress power or internal power.
Thus, the external power expended by the force field on material volume is balanced
by the sum of internal power and rate of change of kinetic energy.
R
Since Ω τ : D ∂Ω is stress power (internal power), the Cauchy stress τ is said to
be power conjugate to the rate of deformation tensor D. We now examine the power
conjugate to first Piola-Kirchhoff stress by transforming the stress power to material
description.
Let Ω0 be a volume in reference configuration corresponding to the material volume
Ω. Let T be first Piola-Kirchhoff stress. Let ρ0 be density of reference configuration. Let
b0 be body force per unit mass in material description in and t0 be first Piola-Kirchhoff
traction vector. Then using Eq. (9), we get
!
d Z 1 ∂χ ∂χ Z Z
∂χ 0 Z
∂χ 0
ρ0 · ∂Ω0 + T F T : L ∂Ω0 = · t ∂Γ0 + ρ0 · b ∂Ω0 .
dt Ω0 2 ∂t ∂t Ω0 Γ0 ∂t Ω0 ∂t
DF
Substituting the relation, = LF , we obtain the following mechanical energy balance
Dt
in Lagrangian framework.
!
d Z 1 ∂χ ∂χ Z
DF Z
∂χ 0 Z
∂χ 0
ρ0 · ∂Ω0 + T : ∂Ω0 = · t ∂Γ0 + ρ0 · b ∂Ω0 . (10)
dt Ω0 2 ∂t ∂t Ω0 Dt Γ0 ∂t Ω0 ∂t
DF
The term T : represents the stress power per unit mass. Thus, the material time
Dt
DF
derivative of deformation gradient is power conjugate to the first Piola-Kirchhoff
Dt
stress.
We now want to find the stress measure that is power conjugate to the rate of Green
strain tensor. Let E be the Green strain tensor. Then
1
τ :D = τ : (L + LT )
2
1 DF −1 DF T
= τ :( F + F −T )
2 Dt Dt
1 DF DF T
= τ : F −T (F T + F )F −1
2 Dt Dt
1 −1 DF DF T
= (F τ F −T ) : (F T + F)
2 Dt Dt
1 −1 DC
= (F τ F −T ) : (Since the left Cauchy-Green strain C = F T F )
2 Dt
1 −1 −1 DC
= J (F T ) :
2 Dt
DE 1
= J −1 (F −1 T ) : Since E = (C − I) .
Dt 2
S = F −1 T . (11)
Substituting for first Piola-Kirchhoff stress in terms of Cauchy stress (i.e., Eq. (3)), we
get the following relation between second Piola-Kirchhoff stress and Cauchy stress.
S = JF −1 τ F −T . (12)
Upon substituting Eq. (11) in Eq. (10), we get the following mechanical energy balance
in terms of second Piola-Kirchhoff stress.
!
d Z 1 ∂χ ∂χ Z
DE Z
∂χ 0 Z
∂χ 0
ρ0 · ∂Ω0 + S: ∂Ω0 = · t ∂Γ0 + ρ0 · b ∂Ω0 .
dt Ω0 2 ∂t ∂t Ω0 Dt Γ0 ∂t Ω0 ∂t
Clearly, the second Piola-Kirchhoff stress tensor S is power conjugate to rate of Green
strain tensor. The second Piola-Kirchhoff stress tensor is symmetric stress measure in
material description unlike the first Piola-Kirchhoff stress tensor. The constitutive equa-
tions also take simple form due to symmetric nature of this stress measure. Thus, the
equations of motion in reference configuration using the second Piola-Kirchhoff stress can
be written as
∂ 2χ
ρ = ∇X · (F S) + ρ0 b0 , (13)
∂t2
S T = S, (14)
0 0
F Sn = t. (15)
Reference
The energy conservation is known as the first law of thermodynamics which is one of
the basic axioms of continuum mechanics. This brings the effect of heat energy into the
continuum mechanics.
First law of thermodynamics: energy conservation:
Every continuum body always interact with its surroundings and in the process of interac-
tion there can be heat and work exchange between continuum body and its surroundings.
The interaction of heat and work is governed by the first law of thermodynamics also
known as law of conservation of energy. In previous lecture, we discussed the mechanical
energy balance of continuum body by ignoring the heat energy interaction with its sur-
roundings. We now account the heat interaction along with mechanical energy (or work)
in the first law of thermodynamics.
Rate of work transfer:
Let Ω be a material volume in deformed configuration B. Let ρ(x, t), b(x, t) and v be
density, body force per unit mass and velocity fields over Ω, respectively. Let Γ be the
boundary of Ω and n be its normal field. Let t(x, t, n) be a traction on Γ. Then the
kinetic energy of Ω is defined by
1Z
K(Ω) = ρv · v ∂Ω, (1)
2 Ω
and the power expended by external forces on Ω is defined by
Z Z
P (Ω) = ρb · v ∂Ω + t · v ∂Γ. (2)
Ω Γ
dW
We now define the net rate of work transfer as the delivery of expanded power to the
dt
continuum body without spending on the motion, i.e.,
dW dK
= P (Ω) − . (3)
dt dt
Internal energy:
The energy of continuum body excluding the kinetic energy is known as internal energy
which is denoted by U . Let e(x, t) be a specific internal energy (i.e., internal energy per
unit mass). Then the internal energy of material volume is given by
Z
U= ρe ∂Ω. (4)
Ω
Substituting Eqs. (3), (6) and Eq. (4) in Eq. (5), we get the following integral form of
first law of thermodynamics:
d Z Z Z
1d Z Z Z
ρe ∂Ω = ρb · v ∂Ω + t · v ∂Γ − ρv · v ∂Ω + ρQh ∂Ω − q · n ∂Γ. (7)
dt Ω Ω Γ 2 dt Ω Ω Γ
Let us consider the sum of internal energy and kinetic energy as total energy of Ω. Then
the rate of work done on the body plus rate of heat supplied to the body equal to rate of
change of total energy of the body. We now derive the differential form of conservation of
energy based on the statement of first law of thermodynamics, i.e., the energy expended
(work + heat) by the environment on the body is equal to the energy stored in the body.
Applying the second transport theorem to above equation (i.e., Eq. (8)), we get
D v·v
Z Z Z Z Z
ρ + e ∂Ω = ρb · v ∂Ω + t · v ∂Γ + ρQh ∂Ω − q · n ∂Γ. (9)
Ω Dt 2 Ω Γ Ω Γ
Let τ (x, t) be a Cauchy stress tensor field. Then using divergence theorem, we can have
Z Z
q · n ∂Γ = ∇x · q ∂Ω,
ZΓ ZΩ Z Z
T
t · v ∂Γ = (τ n) · v ∂Γ = τ v · n ∂Γ = ∇x · τ T v ∂Ω.
Γ Γ Γ Ω
D v·v
Z Z Z Z Z
ρ + e ∂Ω = ρb · v ∂Ω + ∇x · τ T v ∂Ω + ρQh ∂Ω − ∇x · q ∂Ω.
Ω Dt 2 Ω Ω Ω Ω
(∇x v + ∇x v T )
Since the Cauchy stress is symmetric, we have τ : ∇x v = τ : .
2
Dv
Substituting the linear momentum balance ρ = ∇x · τ + ρb and the relation τ :
Dt
T
∇x v = τ : (∇x v + ∇x v )/2, in above equation, we get the following differential form of
first law of thermodynamics.
De
ρ = τ : D − ∇x · q + ρQh , (10)
Dt
where D is the rate of deformation, i.e., D = (∇x v + ∇x v T )/2.
Conservation of energy in reference configuration:
Let Ω0 be part of reference configuration and Ω be its corresponding material volume.
Let Γ0 be boundary of Ω0 . Let ρ0 be density of reference configuration. Let e0 (X, t) and
Q0h (X, t) be Lagrangian description of specific internal energy and heat generation per unit
mass, i.e., e(x, t) = e(χ(X, t), t) = e0 (X, t) and Qh (x, t) = Qh (χ(X, t), t) = Q0h (X, t).
Using Piola transform (see Lecture-19), we get
Dv
Upon substituting the linear momentum balance, i.e., ∇x · τ = ρ − ρb and applying
Dt
divergence theorem, we obtain
De Dv
Z Z Z Z Z
ρ ∂Ω = τ T v · n ∂Γ − ρ − ρb · v ∂Ω − q · n ∂Γ + ρQh ∂Ω,
Ωc Dt Γc Ωc Dt Γc Ωc
where n is unit normal field to Γc . We can have the following form by rearranging terms.
D v·v
Z Z Z Z Z
ρ + e ∂Ω = (τ n) · v ∂Γ + ρb · v ∂Ω − q · n ∂Γ + ρQh ∂Ω.
Ωc Dt 2 Γc Ωc Γc Ωc
D v·v
Z Z Z Z Z
ρ + e ∂Ω = t · v ∂Γ + ρb · v ∂Ω − q · n ∂Γ + ρQh ∂Ω.
Ωc Dt 2 Γc Ωc Γc Ωc
Using the divergence theorem, we have the following energy balance for control volume:
∂ v·v v·v
Z
Z
ρ +e ∂Ω + ρ + e (v · n) ∂Γ
Ωc ∂t
Z
2 Z Γc
Z
2 Z
= t · v ∂Γ + ρb · v ∂Ω − q · n ∂Γ + ρQh ∂Ω.
Γc Ωc Γc Ωc
Reference
The second law of thermodynamics is another basic axiom of continuum mechanics. Un-
like other basic axioms, the second law is not a conservation law. However, this brings
new information on the direction of heat transfer and also imposes some restrictions on
constitutive relations.
The second law and the entropy:
The first law of thermodynamics states that the energy is conserved and does not put
any restriction on the direction of the process. For example, consider a hot body in
contact with a cold body. Naturally, the hot body releases heat energy and cold body
gains same amount of heat to get thermal equilibrium. The reverse process, i.e., hot
body becoming hotter and cold body becoming colder by exchanging equal amount of
heat energy, is not possible but does not violate the first law. Thus, the second law of
thermodynamics is stated to account the direction of the thermodynamic process. The
second law has an interesting history1 and it has many equivalent statements. Here, we
adopt a form of the second law of thermodynamics called the Clausius-Duhem inequality.
A new thermodynamic quantity called entropy is introduced to account the directionality
of thermodynamic processes.
The entropy physically represents a measure of the disorderness of the system. We know
from basic thermodynamics that any system can possess internal entropy. In addition,
the system can also exchange the entropy with its surroundings through the interaction
of energy and/or mass. Thus, the entropy production of the system can be defined by the
difference between the internal entropy and the entropy influx. The second law asserts that
the entropy production universe (system + surroundings) is nonnegative. Considering
the material volume as system, we can apply the second law of thermodynamics to the
continuous medium.
Mathematical statement of entropy imbalance:
Let Ω be a material volume of continuum body. Let S be net internal entropy of the
material volume Ω. Let J be the rate of entropy flow to the material volume from its
surroundings. Let H be the rate of entropy production. Then
dS
H= −J. (1)
dt
The second law of thermodynamics states that
H ≥ 0. (2)
1
D. Kondepudi and I. Prigogine, “Modern thermodynamics: From heat engines to dissipative struc-
tures”, 1998, John Wiley & Sons.
The non-negativity of entropy production is also known as entropy imbalance. The inte-
gral form of entropy imbalance can be stated by introducing field variables.
where Γ is boundary of material volume Ω and n is unit normal field to Γ. The negative
sign appears for the first term on the right hand side of equation as f is outflux (not
influx).
Let η(x, t) be specific entropy, i.e., entropy per unit mass. Then the net internal
entropy of material volume is defined by
Z
S = ρη ∂Ω. (4)
Ω
d Z Z Z
ρη ∂Ω ≥ − f · n ∂Γ + ς ∂Ω. (5)
dt Ω Γ Ω
where e(x, t) is specific internal energy. Taking material time derivative, we obtain
Dψ De Dη Dθ
= − θ−η . (11)
Dt Dt Dt Dt
De De
Substituting from the fist law of thermodynamics, i.e., ρ = τ : D − ∇x · q + ρQh ,
Dt Dt
we get
Dψ 1 1 Dη Dθ
= τ : D − ∇x · q + Qh − θ−η . (12)
Dt ρ ρ Dt Dt
Rearranging terms, we obtain
Dη 1 Dψ 1 Dθ
− θ + Qh − ∇x · q = − τ :D+η .
Dt ρ Dt ρ Dt
Substituting this relation in Eq. (9), we get the following Clausius-Duhem inequality in
terms of free-energy.
Dψ Dθ 1 q · ∇x θ
+η − τ :D+ ≤ 0. (13)
Dt Dt ρ ρθ
We now define the dissipation δ per unit volume by
!
Dψ Dθ 1
δ =τ :D−ρ +η − q · ∇x θ. (14)
Dt Dt θ
Thus, the dissipation δ ≥ 0.
The Clausius-Duhem inequality in Lagrangian description:
Let ρ0 be the density of reference configuration. Let F (X, t) be deformation gradient
and J be its determinant. Let T (X, t) and S(X, t) be first and second Piola-Kirchhoff
stress tensors. Let θ0 (X, t), ψ 0 (X, t), η 0 (X, t), Q0h (X, t) and q 0 (X, t) be material de-
scription of temperature, free-energy, entropy, heat generation per unit mass and heat
flux, respectively. Then we have the following relationships:
ρ0 = ρJ,
1 DF
τ :D=τ :L = T : ,
J Dt
q 0 = JF −1 q
∇X · q 0 = J∇x · q
∇X θ 0 = F T ∇x θ
Substituting all the terms in Eq. (13), we get the Clausius-Duhem inequality with respect
to reference configuration
Dψ 0 Dθ0 1 DF q 0 · ∇X θ 0
+ η0 − T : + ≤ 0. (15)
Dt Dt ρ0 Dt ρ0 θ 0
Since the total time derivative and material time derivative are equal, we can also write
∂ψ 0 ∂θ0 1 ∂F q 0 · ∇X θ 0
+ η0 − T : + ≤ 0. (16)
∂t ∂t ρ0 ∂t ρ0 θ 0
Let E(X, t) be a Green strain tensor. Then we have the following Clausius-Duhem
inequality in Lagrangian description.
∂ψ 0 ∂θ0 1 ∂E q 0 · ∇X θ0
+ η0 − S: + ≤ 0. (17)
∂t ∂t ρ0 ∂t ρ0 θ0
Dρ
Applying the conservation of mass, i.e., = −ρ(∇x · v), we get
Dt
Z
D(ρη) Z
Qh Z
1
+ ρ(∇x · v)η ∂Ω ≥ ρ ∂Ω − q · n ∂Γ.
Ωc Dt Ωc θ Γc θ
D(ρη) ∂(ρη)
Since = + ∇x (ρη) · v, we can write
Dt ∂t
Z
∂(ρη) Z
Qh Z
1
+ ∇x (ρη) · v + ρ(∇x · v)η ∂Ω ≥ ρ ∂Ω − q · n ∂Γ.
Ωc ∂t Ωc θ Γc θ
Using divergence theorem, we obtain the following Clausius-Duhem inequality for control
volume: Z
∂(ρη) Z Z
Qh Z
1
∂Ω + ρηv · n ∂Γ ≥ ρ ∂Ω − q · n ∂Γ. (18)
Ωc ∂t Γc Ωc θ Γc θ
We use the inequality statement of second law of thermodynamics to impose some restric-
tions on constitutive relations.
References
We need a frame of reference to describe any physical quantity. For example, the earth
velocity is measured with respect to the sun. Unfortunately, the frame of reference is not
necessarily a fixed object (i.e., object at rest). For example, the frame of reference can
be attached either to the earth or to the moon. Clearly, the frames of reference can have
a relative motion with respect to each other. The description of motion of one frame
with respect to other is known as Euclidean transformation. This transformation resem-
bles rigid body motion. Furthermore, the physical quantities that are observed in two
frames of reference are related by the Euclidean transformation. In particular, kinematical
quantities as well as governing equations can be transformed from one frame of reference
to other by the Euclidean transformation. In this lecture, we discuss transformation of
kinematical quantities.
Frame of reference:
A frame of reference is fixed to the observer1 who makes observations in terms of position
and time. We now consider two observers who are in relative motion with each other and
start observing the same event using their rulers and clocks. Since both observers posses
their individual clocks and rulers, they can make different measurement of distances and
times of the same event. However, we assume that both observers are measuring length
and time in same units. In other words, both the observers can have different time
settings in their clocks although they have same units of measurements. We now derive
the relationship between observed kinematical quantities.
Let B0 be a reference configuration and B be a deformed or observed configuration.
Let F ∗ and F be two reference frames which are in relative motion. Without loss of
generality, let a fixed-star in the universe be a frame of reference F ∗ and F be a moving
frame with respect to fixed star. Let O∗ be a origin of F ∗ and O be a origin of F . Let
{e∗1 , e∗2 , e∗3 } and {e1 , e2 , e3 } be orthonormal basis to the frame F ∗ and F , respectively,
as shown in Fig. (1). Since both clocks of observers can set to different initial time, Let
t∗ and t be time setting in F ∗ and F , respectively. Let s be difference in time settings.
Then the position of a point in B and times are related by
x∗ = Q(t)x + c(t),
t∗ = t − s, (1)
where Q(t) represents rotation of frame F with respect to F ∗ and c(t) represents position
1
The observer could be any object, i.e., human, animal, instrument etc.
of O with respect to O∗ . As noted, the position and time relationship shown in Eq. (1)
is known as Euclidean transformation.
The change of frame of reference should not be confused with a change of basis. In
any given frame, we can choose infinitely many orthonormal basis. Transformation from
one basis to another basis can be done using a rotation tensor which is time independent.
Whereas, the rotation tensor appearing in change of frame is a function of time. Since the
rotation tensor in frame change is function of time, we can have a skewsymmetric tensor
DQ T
Wf = Q . (2)
Dt
The skewsymmetric tensor W f is known as frame-spin and it represents the rate of spin
of frame F with respect to fixed frame F ∗ .
For simplicity, we set Q(0) to be identity and c(0) to be zero vector. One can draw
all the conclusions that we make in this discussion, without making any assumption on
Q(0) and c(0). Clearly, the change of frame affects the quantities that are pertaining to
the deformed configuration but it does not affect reference configuration.
• Material points and material vectors are invariant, i.e., points and vector in refer-
ence configuration are not affected by the change of frame.
Frame-indifferent fields:
Let φ(x, t), v(x, t) and T (x, t) be scalar, vector and tensor fields over the deformed
configuration B by the frame F and φ∗ (x∗ , t∗ ), v ∗ (x∗ , t∗ ) and T ∗ (x∗ , t∗ ) be same scalar,
tensor and vector fields observed by the frame F ∗ . If these fields follow the relation
then they are said to be frame-indifferent or objective. We now show that some kine-
matical quantities are frame-indifferent while some others are not. If the quantity is
frame-indifferent then the magnitude of quantity observed by observers in both frames
is same. In addition, the frame-indifferent second-order tensors have same eigenvalues in
both the frames of reference.
where Q is rotation tensor, i.e., relative rotation between frames, and c is relative trans-
lation of frames. Of course t∗ = t − a as noted in Eq. (1).
We now determine the relation between various kinematical quantities using Eq. (4). If
the quantity is a function of time alone then we indicate time derivative with superposed
dQ
dot, e.g., Q̇ ≡ . We note that time derivative of any quantity with respect to t∗ and
dt
t are same as t∗ = t − a.
Deformation gradient:
Let F ∗ and F be deformation gradients observed in frame F ∗ and F . Taking material
gradient on both sides of Eq. (1), we get
Thus, the deformation gradient depends on frame. However, it can be observed that
det(F ∗ ) = det(F ) as det(Q) = 1. Hence, the transformation of volume element is invariant
under the change of frame.
Right and left Cauchy-Green strain tensors:
Let C ∗ and B ∗ be right and left Cauchy-Green strain tensors observed in frame F ∗ ; C
and B be right and left Cauchy-Green strain tensors observed in frame F . Then, using
Eq. (5) and definitions, we have
C ∗ = (F ∗ )T F ∗ = (QF )T QF = F T QT QF = F T F = C,
B ∗ = F ∗ (F ∗ )T = QF (QF )T = QF F T QT = QBQT .
Thus, the right Cauchy-Green strain tensor is invariant and the left Cauchy-Green strain
tensor is frame-indifferent.
Right and left stretch tensors:
Let U ∗ and U be right stretch tensors observed from the F ∗ and F frames; V ∗ and V
be left stretch tensors observed from the F ∗ and F frames. Then, using deformation
gradient in F ∗ and F , we get
F ∗ = R∗ U ∗ = V ∗ R∗ , F = RU = V R,
F ∗ = R∗ U
R∗ = QR, V ∗ R∗ = QV R.
V ∗ = QV QT . (7)
Thus, the right stretch tensor is invariant and the left stretch tensor is frame-indifferent.
Green strain and Almansi strain tensors:
Recall the definition of Green strain E and Almansi strain Ẽ from Lecture-18. Using
right and left Cauchy-Green strain in F ∗ and F frames, we get
E ∗ = E, (8)
∗ T
Ẽ = QẼQ . (9)
Similar to right and left Cauchy-Green strain tensors, the Green strain tensor is invariant
whereas Almansi strain tensor is frame-indifferent.
Velocity field:
Let v ∗ and v be velocity fields observed from F ∗ and F . Then taking material time
derivative on both sides of Eq. (4), we get
Dχ∗
v∗ =
Dt
Dχ DQ Dc
= Q + χ+
Dt Dt Dt
= Qv + Q̇x + ċ. (10)
Clearly, the velocity is neither invariant nor frame-indifferent. Physically, it can be seen
that velocity is not frame-indifferent as different observers can record different velocities.
For example consider a small ball moving radially outward in a straight groove on spinning
disc. Let F ∗ be fixed to the center of spinning disc and F be fixed to the ball. Then
velocity observed in frame F is zero whereas non-zero velocity is observed in frame F ∗ .
Thus, velocity is not frame-indifferent, i.e., velocity in one frame cannot be obtained by
simple orthogonal transformation of velocity in another frame.
Acceleration field:
Let a∗ and a be acceleration field observed from F ∗ and F ∗ . Then taking the material
derivative on both sides of Eq. (10), we get the following relation between accelerations:
Similar to velocity, the acceleration field is also neither invariant nor frame-indifferent.
Velocity gradient:
The velocity gradient is defined as spatial gradient of velocity. Let L∗ and L be velocity
gradient observed from F ∗ and F . Then using relation between deformation gradient
DF
Substituting the relation = LF , we get
Dt
L∗ = QLQT + Q̇QT . (12)
Thus, the velocity gradient is neither invariant nor frame-indifferent. Since the tensor
Q̇QT is skewsymmetric, the trace of velocity gradient tensor, i.e., spatial divergence of
velocity, is frame-indifferent.
∇x∗ · v ∗ = tr L∗ = tr L = ∇x · v. (13)
DẼ ∗
= D ∗ − (L∗ )T Ẽ ∗ − Ẽ ∗ L∗
Dt
= QDQT − (QLQT + Q̇Q)T (QẼQT ) − (QẼQT )(QLQT + Q̇Q)
= Q(D − LT Ẽ − ẼL)QT − QQ̇T QẼQT − QẼQT Q̇QT
DẼ T
= Q Q + Q̇ẼQT + QẼ Q̇T (Since Q̇QT = −QQ̇T ).
Dt
Thus, material time derivative of Green strain is invariant and the material time derivative
of Almansi strain is neither invariant nor frame-indifferent.
In summary, we have the following transformation laws for the kinematical quantities:
F ∗ = QF ,
C ∗ = C,
B ∗ = QBQT ,
U∗ = U,
V ∗ = QV QT ,
E ∗ = E,
Ẽ ∗ = QẼQT ,
v ∗ = Qv + Q̇x + c,
a∗ = Qa + 2Q̇v + Q̈x + c̈,
L∗ = QLQT + Q̇QT ,
D ∗ = QDQT ,
W ∗ = QW QT + Q̇QT ,
DE ∗ DE
= ,
Dt Dt
DẼ ∗ DẼ T
= Q Q + Q̇ẼQT + QẼ Q̇T .
Dt Dt
These quantities can be grouped into following three categories:
• The tensors B, V , Ẽ and D and the scalars J (i.e., det F ) and tr(L) (i.e., ∇x · v),
are frame-indifferent.
DE
• The tensors C, U , E and are invariant.
Dt
DẼ
• The tensors F , L, W and and the vectors v and a are neither frame-indifferent
Dt
nor invariant.
Reference
The principle of frame-indifference states that the form of governing equations must be
independent of frame of reference. In other words, there is no preferred frame such as
‘inertial’ frame in classical mechanics setup. In this lecture, we shall see the assumptions
that are required to make mass balance and laws of dynamics independent of frame of
reference.
x∗ = Q(t)x + c(t),
t∗ = t − s,
where Q(t) represents relative rotation, c(t) represents relative translation and s repre-
sents the time difference in clocks.
ρ0 = ρ∗ J ∗ = ρJ. (2)
ρ∗ = ρ. (3)
Thus, the density is frame-indifferent. Substituting Eqs. (3) and (1)2 in conservation of
mass, we get
Dρ∗ Dρ
+ ρ∗ ∇x∗ · v ∗ = 0 =⇒ + ρ∇x · v = 0. (4)
Dt Dt
Thus, conservation of mass has same form in all frames.
Conservation of linear momentum:
We now postulate that traction vector is frame-indifferent. Consequently, the Cauchy
We note that the first Cauchy’s equation of motion in F ∗ and F can be written as
b = QT b∗ − 2ω × v − ω̇ × x − ω × (ω × x) − QT c̈.
The terms other than QT b∗ are arising due to relative motion between reference frames.
These terms are usually known as
• 2ω × v Coriolis force,
• ω̇ × x Euler force,
• ω × (ω × x) centrifugal force,
Galilean transformation:
The Galilean transformation is a special case of Euclidean transformation where Q and
ċ are constants. Physically, in Galilean transformation the frames move at constant lin-
ear velocity with respect to each other. It can be noted that Coriolis force, Euler force,
centrifugal force and inertial force of relative translation becomes zero in Galilean trans-
formation. Hence, the acceleration and body force per unit mass are frame-indifferent,
i.e,
b∗ = Qb and a∗ = Qa.
Thus, the linear momentum balance is naturally frame-indifferent in Galilean transfor-
mation. In other words, the linear momentum balance has same form in all frames that
are related by Galilean transformation.
Conservation of angular momentum:
We have shown in Lecture-27 that the conservation of angular momentum is equivalent
to symmetry of Cauchy stress tensor. Let τ ∗ and τ be Cauchy stress fields observed by
the frames F ∗ and F , respectively. Using frame-indifference nature of Cauchy stress and
symmetry property in frame F , we have
T
(τ ∗ )T = Qτ QT = Qτ T QT = Qτ QT = τ ∗ .
W f Qτ QT = Qτ QT W f .
Qτ QT = λI,
where λ is some constant. Hence, Cauchy stress tensor must have the form τ = λI if
its material time derivative is frame-indifferent. However, in general the Cauchy stress
tensor need not have the form of λI. Thus, in general, the material time derivative of
Cauchy stress is not frame-indifferent.
The rate of stress is required to represent the constitutive relations of rate-dependent
materials. These constitutive relations demand the frame-indifferent stress rates as they
are necessarily frame-indifferent. Clearly, as shown above, the rate of Cauchy stress is not
frame-indifferent. Thus, we now construct a stress rate that is frame-indifferent using the
invariance nature of second Piola-Kirchhoff stress tensor.
S ∗ = J (F )−1 τ ∗ (F )−T
= JF −1 Q−1 Qτ QT Q−T F −T
= JF −1 τ F −T
= S
Since the Piola-Kirchhoff stress is invariant, the material time derivative is also invariant,
i.e.,
DS ∗ DS
= .
Dt Dt
Taking material derivative of relation S = JF −1 τ F −T , we get
DS D −1
= JF τ F −T
Dt Dt
DJ −1 DF −1 Dτ −T DF −T
= F τ F −T + J τ F −T + JF −1 F + JF −1 τ
Dt Dt Dt Dt
Dτ
= (tr L)F −1 τ F −T + JF −1 Lτ F −T + JF −1 F −T + JF −1 τ LT F −T
Dt
Dτ
= JF −1 − Lτ − τ LT + (tr L)τ F −T ,
Dt
where
Dτ
τ◦ = − Lτ − τ LT + (tr L)τ . (9)
Dt
is known as Truesdell stress rate. It is easy to see that the Truesdell stress rate τ ◦ is
frame-indifferent. Using material rate of second Piola-Kirchhoff stress tensor, we can
write
1 DS T
τ◦ = F F .
J Dt
Let (τ ◦ )∗ and τ ◦ be Truesdell stress rates which are observed by frames F ∗ and F . Then
1 ∗ DS ∗
(τ ◦ )∗ = ∗
F (F ∗ )T
J Dt
1 DS T T
= QF F Q
J Dt
= Qτ ◦ QT .
We can obtain alternative frame-indifferent stress rates using Lie derivative. The Lie
derivative of spatial tensor field G with respect to another tensor field P is defined by
D −1
LP (G) = P P GP −T P T .
Dt
Many frame-indifferent stress rates can be obtained using the Lie derivative by appropri-
ately choosing G and P in above definition. We note that the Truesdell stress rate can
also be defined using Lie derivative.
Truesdell stress rate:
Let LF (Jτ ) be the Lie derivative of Jτ with respect to F . Then the Truesdell stress rate
τ ◦ is defined by LF (Jτ )/J. Hence, we can write
1 D −1
τ◦ = F F Jτ F −T F T .
J Dt
DF −1
Substituting = −F −1 L, we get Eq. (9).
Dt
Oldroyd stress rate:
The Oldroyd stress rate τ is defined as Lie derivative of τ with respect to F , i.e.,
D −1
τ =F F τ F −T F T .
Dt
DF −1
Using the relation = −F −1 L, we have
Dt
Dτ
τ = − Lτ − τ LT . (10)
Dt
G∗ = QGQT .
P ∗ = QP ,
where P ∗ and P are observed by F ∗ and F . Then the frame-indifferent rate can be
DG DP
obtained in terms of G, P , and using the Lie derivative
Dt Dt
D −1
LP (G) = P P GP −T P T . (14)
Dt
Thus, various frame-indifferent rates of both stresses and strains can be obtained.
References
Here, we show that the principle of frame-indifference plus the first law of thermodynamics
is equivalent to all other conservation laws (i.e., conservation of mass, linear momentum
and angular momentum). As discussed in Lecture-29, the first law of thermodynamics
states that rate of change of total energy (i.e., the kinetic and internal energy) is equal to
the rate of work done by external forces plus rate of change of heat energy, i.e.,
d Z 1
Z Z
ρ e + v · v ∂Ω = (t · v + γ) ∂Γ + ρ (b · v + Qh ) ∂Ω. (1)
dt Ω 2 Γ Ω
• The principle of frame-indifference states that the first law of thermodynamics as-
sumes same form in all frames of reference.
Let density (ρ), internal energy (e), heat flux across the surface (γ), heat supply per unit
mass (Qh ) and traction vector (t) be frame-indifferent. Let b be a body force that follows
transformation rule stated in principle of frame-indifference for linear momentum balance
(see Lecture-32). Then we have the following result.
The result of frame-indifference applied to first law of thermodynamics:
The principle of frame-indifference applied to the first law of thermodynamics implies:
and heat flux across normal to the Γ in the frame F ∗ , respectively. Let ρ(x, t), t(x, t, n),
a(x, t), b(x, t), e(x, t), Qh (x, t) and γ(x, t, n) be density, traction, acceleration, body
force per unit mass, specific internal energy, heat generation per unit mass and heat flux
across normal to the Γ in the frame F . We note that γ(x, t, n) is analogous to traction
t(x, t, n).
As mentioned in previous lecture, the Euclidean transformation between the frames
F and F is given by
∗
The assumption we considered for the density, internal energy, heat flux, heat supply,
traction and body force can be written as
Let us defined the energy density (total energy per unit volume) of continuum body f
defined by
v·v
f =ρ e+ .
2
Since Ω is a part of deformed configuration B, the balance of energy for Ω observed by
the frame F
d Z Z Z
f ∂Ω = (t · v + γ) ∂Γ + ρ (b · v + Qh ) ∂Ω.
dt Ω Γ Ω
Using first transport theorem, we get
!
Z
Df Z Z
+ f ∇x · v ∂Ω = (t · v + γ) ∂Γ + ρ (b · v + Qh ) ∂Ω. (3)
Ω Dt Γ Ω
v ∗ = v + ċ, and a∗ = a.
Using the transformation law for the field of body force per unit mass, we have
b∗ = b.
Since traction is frame-indifferent, substituting the above relations in Eq. (5), we get
!
λ2 Dρ
Z
Z Z
λv · û + + ρ∇x · v ∂Ω + λρ(a − b) · û ∂Ω = λt · û ∂Γ. (6)
Ω 2 Dt Ω Γ
The argument similar to the Cauchy’s theorem (see Lectures 26 and 27) yields the exis-
tence of Cauchy stress tensor τ (x, t) such that t = τ n.
Using the similar arguments that were used in Cauchy’s theorem, we get the existence of
heat flux vector q(x, t) such that γ = q · n.
Conservation of mass:
Substituting the relation t = τ n in Eq. (6), we get
!
λ2 Dρ
Z
Z Z
λ(v · û) + + ρ∇x · v ∂Ω + λρ(a − b) · û ∂Ω = λτ n · û ∂Γ.
Ω 2 Dt Ω Γ
Using the relation τ n · û = n · τ T û, the divergence theorem and ∇x · (τ T û) = (∇x · τ ) · û,
we can write
!
λ2 Dρ
Z Z
λ(v · û) + + ρ∇x · v ∂Ω = λ (∇x · τ + ρ(b − a)) · û ∂Ω. (7)
Ω 2 Dt Ω
∇x · τ + ρ(b − a) = 0. (9)
Using the transformation law between frames, we have (a∗ − b∗ ) = Q(a − b) and t∗ = Qt.
Substituting these relations in above equation, we can write
Z Z
∗
(ρQ(a − b) · v − ρ(a − b) · v) ∂Ω = (Qt · v ∗ − t · v) ∂Γ.
Ω Γ
Using the property of dot product and the divergence theorem, we can write
Z Z
∗
T
ρ(a − b) · (Q v − v) ∂Ω = ∇x · τ T (QT v ∗ − v) ∂Ω
Ω ZΩ
= (∇x · τ ) · (QT v ∗ − v) + τ : ∇x (QT v ∗ − v) ∂Ω.
Ω
Reference
Here, we discuss the need of constitutive relations and also the restrictions imposed by
the principle of material frame-indifference.
Necessity of constitutive relations:
We observe in physical world that different materials respond differently for the same
forces. For example, steel and rubber with same geometry undergoes different deforma-
tions for the same amount of forces. Similarly, response of water or gas is different from
the solids under the action of forces. Though all materials follow the same basic principles
but they do exhibit different behaviors. It shows that the basic principles that we dis-
cussed are not sufficient to characterize the behavior of materials. In fact, we now show
the same argument mathematically.
We have the following basic laws, i.e., conservation of mass, conservation of momenta,
conservation of energy and second law of thermodynamics:
Dρ
+ ρ∇x · v = 0, (1)
Dt
Dv
ρ = ∇x · τ + ρb, and τ T = τ , (2)
Dt
De
ρ = τ : D − ∇x · q + ρQh , (3)
Dt
Dη ρQh 1 q
ρ ≥ − ∇x · q + 2 · ∇x θ. (4)
Dt θ θ θ
In summary, we have five independent equations as second law of thermodynamics is
inequality. On the other hand, we have following 16 unknowns in the equations,
These unknowns (ρ, v, τ , e, q, θ and η) are also known a state variables as they represent
the state of continuum at every point.
Assuming that the continuum body interaction with external environment is known,
i.e., body forces per unit mass b and heat sources distribution Qh are known1 . Clearly,
the unknowns are more than the equations and hence it is not possible to determine
the unknowns uniquely. Thus, the problem is ill-posed with above governing equations
except for some trivial cases of rigid body motion. In order to make problem well-posed,
we need to specify 11 more equations known as constitutive relations. These constitutive
relations are specified on the basis of mechanical and thermal response of material, i.e.,
these equations are dependent on the internal constitution of the material.
Constitutive relations cannot be chosen arbitrarily and they must follow a set of prin-
ciples and rules imposed by physical laws and material symmetry. Though there is great
reduction in the constitutive relation due to these principles and rules but there are con-
stants in the constitutive relations that cannot be evaluated alone by these principles. So,
finally experiments are needed to obtain the constants in the constitutive relations for a
given material. We now discuss the formulation of constitutive relations.
Constitutive relations and the principle of determinism:
Let B0 be a reference configuration and B be a deformed configuration at an instant of
time t. Let x = χ(X, t) be a map from B0 to B. The constitutive relations can be stated
as eleven implicit functionals of 16 unknown fields in the governing equations, i.e.,
∞
R [ρ(Y , t − p), χ(Y , t − p), τ (Y , t − p), e(Y , t − p), q(Y , t − p),
p=0
∀Y ∈B0
θ(Y , t − p), η(Y , t − p), X, t] = 0. (5)
where p represents the time history and t represent present time. The implicit constitutive
relation indicating that R at material a point X ∈ B0 may also depend on other martial
points Y ∈ B0 . We note that the functional R should be function of velocity, but the
mapping function χ is introduced in the arguments of R as the velocity is material (total)
time derivative of χ.
Notation used in above constitutive relations:
The principle of determinism states that the past history of motion and temperature at
all points of continuum body can determine the present state of the body. Furthermore, it
excludes the dependence of behavior on outside of body and future events. Therefore, we
1
The assumption of known body forces and heat source distribution is criticized by L. C. Woods in “The
bogus axioms of continuum mechanics”, Bulletin of the Institute of Mathematics and its Applications, 17,
pp. 98-102 (1981). However, the assumption is widely accepted. The framework of continuum mechanics
avoiding this assumption can be seen in I-Shih Liu, “Continuum Mechanics”, Springer, 2002.
use explicit functional equations such that they depend on history of motion, temperature
and density. This argument is also consistent with physical perception as density, motion
and temperature can be perceived by our physical senses. Thus, we choose density, motion
and temperature as independent variables whereas the Cauchy stress tensor, heat flux
vector, internal energy and entropy as dependent variables. Eleven constitutive relations
in explicit form can be written as
∞
τ (x, t) = τ̂ (ρ(Y , t − p), χ(Y , t − p), θ(Y , t − p), X, t),
p=0
∀Y ∈B0
∞
q(x, t) = q̂ (ρ(Y , t − p), χ(Y , t − p), θ(Y , t − p), X, t),
p=0
∀Y ∈B0
∞
e(x, t) = ê (ρ(Y , t − p), χ(Y , t − p), θ(Y , t − p), X, t),
p=0
∀Y ∈B0
∞
η(x, t) = η̂ (ρ(Y , t − p), χ(Y , t − p), θ(Y , t − p), X, t),
p=0
∀Y ∈B0
where x = χ(X, t). We note that all constitutive relations τ̂ , q̂, ê and η̂ are dependent
on same set of independent variables. The rule of considering same set of independent
variables in the explicit constitutive relations is known as principle of equipresence.
The constitutive equations need to satisfy following three requirements:
• material symmetry,
These three requirements impose severe restrictions on the constitutive relations. There-
fore, it is useful for practical problems as constitutive relations can be obtained with few
experiments. Before imposing these requirements, we also assume that the material is
simple. We now define simple material or principle of local action.
Simple materials:
In general, the constitutive relations (relation between state variables such as stress, defor-
mation, density, temperature, heat flux, internal energy and entropy) at a point depend
on thermomechanical history of every point in the body. This is known as non-local
property of constitutive relationship. The non-local nature of constitutive relations poses
difficulty in obtaining solution to the thermomechanical problems. However, it is rarely
relevant to many practical problems in the mechanics. Therefore, we assume that the
thermomechanical history of neighbourhood can affect the response at a point but not
the furthest points. This assumption is known as simple material or principle of local
action. Furthermore, we consider first-order approximation of dependent variable to ac-
count for local action. Let X be a material point and Y be a point in the neighbourhood.
Let dX = Y − X. Then, using Taylor’s series approximation, we get
ρ(Y , t − p) = ρ(X, t − p) + ∇X ρ(X, t − p) · dX + o(dX),
χ(Y , t − p) = χ(X, t − p) + F (X, t − p)dX + o(dX), (6)
θ(Y , t − p) = θ(X, t − p) + ∇X (X, t − p) · dX + o(dX),
In addition to simple material, we also assume that constitutive relations are not depen-
dent explicitly on time. In other words, we are assuming that material do not experience
aging2 .
The constitutive relations for the simple material with the assumption of no aging can
be written as
∞
τ (X, t) = τ̂ (χ(X, t − p), F (X, t − p), θ(X, t − p), g(X, t − p), X),
p=0
∞
q(X, t) = q̂ (χ(X, t − p), F (X, t − p), θ(X, t − p), g(X, t − p), X),
p=0
∞
e(X, t) = ê (χ(X, t − p), F (X, t − p), θ(X, t − p), g(X, t − p), X),
p=0
∞
η(X, t) = η̂ (χ(X, t − p), F (X, t − p), θ(X, t − p), g(X, t − p), X).
p=0
Despite the assumptions of principle of local action (simple material) and no aging, the
present model includes the most of constitutive relations which are in practice. For
example Hooke’s law for elastic solids and Newton’s law of viscosity for fluids. As noted
earlier, we impose principle of material frame-indifference on constitutive relations.
Principle of material frame-indifference (MFI):
The principle of material frame-indifference postulates that the constitutive relations have
same form in all frames of reference. This is similar to the principle of frame-indifference
for the governing equations. However, the principle of frame-indifference for the gov-
erning equations has been universally accepted but not the principle of material frame-
indifference. There are some works that show the constitutive relations do not have
2
The assumption of no aging can be deduced from the principle of material frame-indifference, see
I-Shih Liu, “Continuum mechanics”, Springer, 2002.
same form in all frames (see for example Müller, 1972)3 . There are series of discussions
on the aspects of this principle and it has been shown that there are two assumptions
involved in this principle4 . The two assumptions involved are observer-invariance and
form-invariance.
Let τ̂ ∗ and τ̂ be constitutive relations for the quantities τ ∗ and τ , respectively. We
note that the quantities τ ∗ and τ are description of same physical variable from the
frames F ∗ and F , respectively. As stated in previous lectures, the frames of reference
are related by Euclidean transformation. Let l∗ and l be list of independent variables,
that are present in the constitutive relations, observed by F ∗ and F , respectively. Then
the observer-invariance can be written as
Without debating much on controversy of this principle, in the absence of any experi-
mental evidence, we assume the principle of material frame-indifference as an axiom.
We present a simple example to understand the consequence of principle of material
frame-indifference. In the next lecture, we discuss the consequence of principle of material
frame-indifference on general constitutive relations of simple materials.
Example 1. Let τ be a stress response. Let ρ, v, L and D be density, velocity, velocity
gradient and rate of deformation, respectively. Let the constitutive relation of material be
τ = τ̂ (ρ, v, L).
If the constitutive relation follows the principle of material frame-indifference then the
constitutive relation reduces to
τ = τ̂ (ρ, D). (10)
Proof. Let F ∗ and F be fixed and moving frames. Then they are related by the Euclidean
transformation
χ∗ (X, t∗ ) = Q(t)χ(X, t) + c(t), (11)
3
Müller, I., “On the frame dependence of stress and heat flux”, Archive for Rational Mechanics and
Analysis, 45, pp. 241-250, 1972
4
Jog, C. S., “Foundations and applications of mechanics: continuum mechanics”, Narosa Publishing
House, New Delhi, 2007.
where t∗ = t − s, Q(t) represents relative rotation between frames and c(t) represents
translation.
This relation shows that the stress response cannot be a function of velocity as c(t)
is arbitrary. Let D and W be rate of deformation and spin tensors. Then we have
L = D + W . Of course D and W are symmetric and skewsymmetric part of velocity
gradient. Substitution of relation L = D + W in Eq. (12) yields
Note that the velocity v in the argument is removed as we have shown it is independent
of velocity. Since the orthogonal tensor Q(t) is arbitrary, we choose Q(t0 ) = I and
Q̇(t0 ) = −W . Evaluating the above equation at t = t0 , we get
References
where t∗ = t − s, Q(t) represents relative rotation between frames and c(t) represents
translation.
Let G, g and φ be tensor, vector and scalar valued constitutive relations of continuum
body B observed by the frame F . Let G∗ , g ∗ and φ∗ be corresponding constitutive
relations of same body B observed by the frame of reference F ∗ . Then the principle
of material frame-indifference provides the following relation between these constitutive
relations.
G(l∗ ) = QG(l)QT ,
g(l∗ ) = Qg(l), (2)
∗
φ(l ) = φ(l),
Consequently,
∞
ê (Q(t − p)χ(X, t − p) + c(t − p), Q(t − p)F (X, t − p), θ(X, t − p), g(X, t − p), X)
p=0
∞
= ê (χ(X, t − p), F (X, t − p), θ(X, t − p), g(X, t − p), X),
p=0
∞
η̂ (Q(t − p)χ(X, t − p) + c(t − p), Q(t − p)F (X, t − p), θ(X, t − p), g(X, t − p), X)
p=0
∞
= η̂ (χ(X, t − p), F (X, t − p), θ(X, t − p), g(X, t − p), X).
p=0
Since the stretch tensor, U (X, t) is invertible tensor, we can have R(X, t) = F (X, t)U −1 (X, t).
Substituting this relation in Eq. (8), we get
∞
τ̂ (F (X, t − p), θ(X, t − p), g(X, t − p), X) =
p=0
∞
F (X, t)U −1 (X, t) τ̂ (U (X, t − p), θ(X, t − p), g(X, t − p), X) U −1 (X, t)F T (X, t). (9)
p=0
We now recall the relation between the second Piola-Kirchhoff stress tensor and the
Cauchy stress tensor from Lecture-28, i.e., S = det(F )F −1 τ F −T . Hence, we now de-
fine the constitutive response function for second Piola-Kirchhoff stress
∞
Ŝ (F (X, t − p), θ(X, t − p), g(X, t − p), X)
p=0
∞
= det(F (X, t))F −1 (X, t) τ̂ (F (X, t − p), θ(X, t − p), g(X, t − p), X)F −T (X, t). (10)
p=0
Substituting Eq. (9) and the relation det(F (X, t)) = det(U (X, t)) in Eq. (10), we get
∞
Ŝ (F (X, t − p), θ(X, t − p), g(X, t − p), X)
p=0
∞
= det(U (X, t))U −1 (X, t) τ̂ (U (X, t − p), θ(X, t − p), g(X, t − p), X) U −1 (X, t), (11)
p=0
q
Since the right stretch tensor U (X, t) = F T (X, t)F (X, t) and the right Cauchy-Green
strain C(X, t) = F T (X, t)F (X, t), we can define a new function
∞ ∞
S̃ (C(X, t−p), θ(X, t−p), g(X, t−p), X) = Ŝ (F (X, t−p), θ(X, t−p), g(X, t−p), X)
p=0 p=0
so that it is a function of C. Using Eq. (11), the new function can explicitly be written
as
∞
S̃ (C(X, t − p), θ(X, t − p), g(X, t − p), X)
p=0
∞
= det(U (X, t))U −1 (X, t) τ̂ (U (X, t − p), θ(X, t − p), g(X, t − p), X) U −1 (X, t), (12)
p=0
q
where U (X, t) = C(X, t).
Similar to previous procedure, using polar decomposition and the principle of material
frame-indifference, heat flux vector shown in Eq. (5) can be written as
∞
q̂ (F (X, t − p), θ(X, t − p), g(X, t − p), X)
p=0
∞
= R(X, t) q̂ (U (X, t − p), θ(X, t − p), g(X, t − p), X), (13)
p=0
Recall transformation of heat flux into Lagrangian description (see Lecture-29), i.e.,
q 0 (X, t) = det(F )F −1 q(x, t). Therefore, we define a new vector valued function
∞
q̃ 0 (C(X, t − p), θ(X, t − p), g(X, t − p), X)
p=0
∞
= det(U (X, t))U −1 (X, t) q̂ (U (X, t − p), θ(X, t − p), g(X, t − p), X), (15)
p=0
q
where U (X, t) = C(X, t).
Applying similar steps to internal energy and entropy response function i.e., to Eqs.
(6) and (7), we get
∞ ∞
ê (F (X, t−p), θ(X, t−p), g(X, t−p), X) = ê (U (X, t−p), θ(X, t−p), g(X, t−p), X),
p=0 p=0
(16)
∞ ∞
η̂ (F (X, t−p), θ(X, t−p), g(X, t−p), X) = η̂ (U (X, t−p), θ(X, t−p), g(X, t−p), X).
p=0 p=0
(17)
In conclusion all constitutive relations can be expressed as right Cauchy-Green strain. In
particular, stress response is related to strain tensor and its history. Hence, the consti-
tutive relation in solid mechanics is known as stress-strain relation. We now discuss the
restrictions imposed by the material symmetry on constitutive relation.
Material symmetry:
Most materials exhibits some material symmetries, i.e., material properties about some
planes are symmetric. The constitutive relations must reflect the symmetry possessed by
the material. The symmetries can be two types: (i) rotational symmetry (ii) reflectional
symmetry. We discuss these symmetries through examples.
Rotational symmetry:
Rotation of reference configuration leaves the indistinguishable deformed configuration
for some angle of rotations. We now present two examples to understand the rotational
symmetry. The angle of rotation corresponding to symmetry transformation is π in first
π
example while in second example.
2
Example-1 (square plate under bi-axial stretch):
Consider a square plate of homogeneous, fiber reinforce, composite material with a size
of a × a as shown in Fig. (1a). Let the fibers are aligned along AB and we assume that
the stiffness is more along the direction of fiber (i.e., along AB) than that of orthogonal
direction (i.e., along BC) as shown in Fig. (1d). We apply a uniform traction tu on
edges AD and BC as shown in Fig. (1b) and let the elongation along the load be ∆ax1 .
Similarly, we apply the uniform traction tu on edges AB and CD and let the elongation
along loading direction be ∆ay2 (see Fig. (1c)). Clearly, the elongation ∆ay2 is more than
∆ax1 as the stiffness along BC is less than AB.
We now consider another loading of bi-axial traction on the reference configuration.
Let the traction in both the directions be equal as shown in Fig. (2). Since the stiffness
is more along the fibers (see distribution of stiffness in Fig. (1d)), the plate deforms less
along fiber direction. If we mark a circle in the center of plate, it deforms into an elliptical
shape as shown in Fig. (2). It can be noted that the rotation of reference configuration by
an angle of π also leaves the same deformed configuration under same loading conditions
as shown in Fig. (2). In other words, if we do not mark A, B, C and D for the corners
then it is not possible to recognize the reference configuration by looking at deformed
configuration. Thus, the angle of symmetry for this problem is π.
Example-2 (circular plate under hydrostatic pressure):
Consider a circular plate of homogeneous material with stiffness distribution that is shown
B A B A
C D C D
(a) (b)
A
B
C
D
(d)
(c)
Figure 1: Square plate of homogeneous, fiber reinforce, composite material: (a) reference
configuration (b) loading along fiber direction (b) loading along orthogonal to fibers (d)
stiffness distribution of plate
B A
C D
Rotation of
reference
configuration
D C
Deformed
configuration
A B
in Fig. (3). The variation in stiffness is responsible for the deformed shape depicted
in Fig. (3) under hydrostatic pressure. It can be observed that rotation of reference
configuration by an angle of π/2 then applying the hydrostatic pressure would also assume
same deformed shape as there is symmetry in stiffness distribution. Similar to earlier
example, if there are no marking on the reference configuration then it is not possible to
distinguish the orientation of reference configuration by looking at deformed configuration.
A
A
B H B H
C G C G
D F
Rotation of E F
D
reference E
configuration
Stiffness distribution
G
H F
A E
B D
C
Deformed configuration
Reflectional symmetry:
We present an example to explain the reflectional symmetry. Let us consider a circular
plate with distribution of stiffness shown in Fig. (4). It can be observed that the reference
configuration can be deformed into a star like shape as shown in Fig. (4) under a hydro-
static pressure. Clearly, the stiffness distribution is symmetric about x2 -axis. Therefore,
if the reference configuration were to make reflection about the axis then the deformation
is same under the hydrostatic pressure as shown in Fig. (4).
A A
B H
B H
C G
C G
D F
Reflection of E
D E F
reference
configuration Stiffness distribution
A
H B
G C
F D
E
Deformed configuration
• Closed under binary operation: For all Q1 , Q2 ∈ G, the element generated by the
operation should also be in G, i.e., Q1 ∗ Q2 ∈ G.
(Q1 ∗ Q2 ) ∗ Q3 = Q1 ∗ (Q2 ∗ Q3 ).
• Existence of inverse element: For each element Q ∈ G, there exist Q−1 ∈ G such
that
Q−1 ∗ Q = Q ∗ Q−1 = I.
Orthogonal group:
Collect all orthogonal tensors and denote with Orth. In other words,
We now define usual product between tensors as binary operation between tensors. Then,
clearly, the product of any two orthogonal tensors is again an orthogonal tensor and also
usual product is associative. We also note that the identity tensor also belongs to the
set. In addition, by definition of orthogonal tensor, the transpose operator is the inverse
and also orthogonal tensor. Thus, the set of all orthogonal tensor follows all the axioms
of group and it is known as orthogonal group.
Material symmetry group:
A transformation defined over a reference configuration is called as material symmetry
transformation if it does not alter the deformed configuration. The collection of these
material symmetry transformations forms a group and this group is known as material
symmetry group. For example the material symmetry group is a subset of orthogonal
group for solids. This fact can be observed from the previous examples. Materials are
classified based on these symmetry groups. We now define the fluid and solid based on
these symmetry groups.
The fluid and the solid:
Fluid:
Take some fluid (water) into one test-tube and let it be a reference configuration. We now
pour the fluid from one test-tube into another test-tube of same size. We can see that
the fluid undergoes large deformation, still the fluid in second test-tube starts behaving
exactly similar to the previous one. The reason for this behavior is that the fluid does not
have any preferred reference configuration. The only similarity in both configuration is
that the density is same at similar locations. In other words, the transformation is density
preserving. Thus, the material symmetry group for fluid is unimoduluar transformations,
i.e., the collection of tensors those have unit determinant. Clearly, the fluid is an isotropic
material.
Solid:
Let us consider a bar and apply loads to deform the bar. We now consider the deformed
configuration as new reference configuration and apply similar loads as that of previous
case. But then the deformation response would be different from the previous case. Thus,
the solids posses a preferred reference configurations unlike fluids. The symmetry group
of solids is subgroup of orthogonal group. Furthermore, if the solid is isotropic then the
symmetry group is proper orthogonal group, i.e., group of rotational tensors.
We now define the material based on degree of symmetry.
Isotropic material:
A material whose symmetry group is orthogonal group is called as isotropic material.
Clearly, the orthogonal group includes all rotations and reflections. A material that
consists of rotation tensors and not reflections as symmetry group is known as hemitropic
material. In our discussion, we don’t distinguish the difference between isotropic and
hemitropic material. We refer these two materials as isotropic material.
Anisotropic material:
A material that has fewer symmetries than isotropic is called anisotropic material.
Transversely isotropic material:
A material that has all rotation tensors about a fixed axis as symmetry group is known
as transversely isotropic material. Of course this is a special case of anisotropic material.
Orthotropic material:
A material that has reflectional symmetry about three mutually orthogonal planes is
known as orthotropic material. Of course this is another special case of anisotropic ma-
terial.
Special class of materials:
In many practical problems, we make the following assumption to simplify the problem.
Homogeneous material:
The continuum body is said to be homogeneous if the constitutive relations are not ex-
plicitly dependent on material point, i.e, material properties are same at all points. In
fact this assumption has been made in previous examples to explain material symmetry.
Elastic material:
The material is said to be elastic if the constitutive relation is independent of the history
of deformation and temperature.
We now see that the material symmetry imposes certain restrictions on constitutive
relations along with principle of material frame-indifference.
∞
η̂ (F (X, t − p), θ(X, t − p), g(X, t − p), X)
p=0
∞
= η̂ (F (X, t − p)H, θ(X, t − p), H T g(X, t − p), X).
p=0
Let us choose the orthogonal tensor Q such that Q = H, i.e., Q is in symmetry group.
Applying the principle of material frame-indifference, i.e., Eqs. (4), (5) (6) and (7),
to the right hand side of above equations, we get the following final form of constitutive
equations.
∞
τ̂ (QF (X, t − p)QT , θ(X, t − p), Qg(X, t − p), X)
p=0
∞
=Q τ̂ (F (X, t − p), θ(X, t − p), g(X, t − p), X) QT , (18)
p=0
∞
q̂ (QF (X, t − p)QT , θ(X, t − p), Qg(X, t − p), X)
p=0
∞
= Q q̂ (F (X, t − p), θ(X, t − p), g(X, t − p), X), (19)
p=0
∞
ê (QF (X, t − p)QT , θ(X, t − p), Qg(X, t − p), X)
p=0
∞
= ê (F (X, t − p), θ(X, t − p), g(X, t − p), X), (20)
p=0
∞
η̂ (QF (X, t − p)QT , θ(X, t − p), Qg(X, t − p), X)
p=0
∞
= η̂ (F (X, t − p), θ(X, t − p), g(X, t − p), X). (21)
p=0
Clearly, the orthogonal tensor Q belongs to the material symmetry group. Till now
there are no assumptions made over the material symmetry. We now make assumption
of isotropic material and show that the constitutive relations indeed reduces to isotropic
functions.
Constitutive relations for isotropic materials:
As noted earlier, full orthogonal group is symmetry group for isotropic materials. There-
fore, Q in above equations (i.e., Eqs. (18), (19), (20), and (21)) is arbitrary for isotropic
material.
We now recall that the constitutive equations can be represented as function of right
stretch tensor and thereby the right Green strain tensor as shown in Eqs .(9), (14), (16)
and (17). In other words, there exist constitutive response functions such that
∞
τ̂ (F (X, t − p), θ(X, t − p), g(X, t − p), X)
p=0
∞
= F (X, t) τ̃ (U (X, t − p), θ(X, t − p), g(X, t − p), X) F T (X, t), (22)
p=0
∞
q̂ (F (X, t − p), θ(X, t − p), g(X, t − p), X)
p=0
∞
= F (X, t) q̃ (C(X, t − p), θ(X, t − p), g(X, t − p), X), (23)
p=0
∞
ê (F (X, t − p), θ(X, t − p), g(X, t − p), X)
p=0
∞
= ẽ (C(X, t − p), θ(X, t − p), g(X, t − p), X), (24)
p=0
∞
η̂ (F (X, t − p), θ(X, t − p), g(X, t − p), X)
p=0
∞
= η̃ (C(X, t − p), θ(X, t − p), g(X, t − p), X), (25)
p=0
√
as U = C. Applying these relations to above constitutive responses shown in Eqs. (18)
and (19), we get
∞
QF QT τ̃ (QC(X, t − p)QT , θ(X, t − p), Qg(X, t − p), X) QF T QT
p=0
∞
= QF τ̃ (C(X, t − p), θ(X, t − p), g(X, t − p), X) F T QT ,
p=0
!
∞
T T
QF Q q̃ (QC(X, t − p)Q , θ(X, t − p), Qg(X, t − p), X)
p=0
!
∞
= QF q̃ (C(X, t − p), θ(X, t − p), g(X, t − p), X) .
p=0
Since Q and F are invertible tensors, we get the following form of stress and heat flux
constitutive responses for an isotropic material.
∞
τ̃ (QC(X, t − p)QT , θ(X, t − p), Qg(X, t − p), X)
p=0
∞
=Q τ̃ (C(X, t − p), θ(X, t − p), g(X, t − p), X) QT , (26)
p=0
∞
q̃ (QC(X, t − p)QT , θ(X, t − p), Qg(X, t − p), X)
p=0
∞
= Q q̃ (C(X, t − p), θ(X, t − p), g(X, t − p), X). (27)
p=0
Applying Eqs. (24) and (25) to the constitutive responses (20) and (21), we get the
following form of constitutive responses for internal energy and entropy.
∞
ẽ (QC(X, t − p)QT , θ(X, t − p), Qg(X, t − p), X)
p=0
∞
= ẽ (C(X, t − p), θ(X, t − p), g(X, t − p), X), (28)
p=0
∞
η̃ (QC(X, t − p)QT , θ(X, t − p), Qg(X, t − p), X)
p=0
∞
= η̃ (C(X, t − p), θ(X, t − p), g(X, t − p), X). (29)
p=0
Recall isotropic functions from Lecture-12. Clearly, the constitutive relations for isotropic
material are isotropic functions. More detailed account of material symmetry and its
rigorous treatment can be found in Jog (2007) or in Liu (2002).
References
In this lecture, we discuss the restrictions imposed by the second law of thermodynamics
on constitutive relations of elastic solids. As mentioned in Lecture-35, the constitutive
response of elastic material does not depend on thermomechanical history.
Constitutive relations in Lagrangian framework:
It is convenient to work in Lagrangian framework for the solid mechanics. Therefore, we
consider the constitutive relations that are defined in material description, rather than
the spatial description. The constitutive equations for the first Piola-Kirchhoff stress,
internal energy and entropy are given by
∂ 2 e0 2 0
0 ∂ η
− θ = 0. (8)
∂g 0 ∂θ0 ∂g 0 ∂θ0
∂η 0
= 0. (9)
∂g 0
∂e0
= 0. (10)
∂g 0
ψ 0 = e0 − η 0 θ 0 . (12)
∂ψ 0
T = ρ0 . (13)
∂F
Taking the derivative of free energy ψ 0 with respect to temperature θ0 and substituting
in Eq. (6), yields
∂ψ 0
η0 = − 0 . (14)
∂θ
Substituting Eqs. (4), (5) and (6) in Clausius-Duhem inequality, i.e., in Eq. (3), we get
q0 · g0 ≤ 0 (15)
This shows that heat flux flows in the negative gradient direction, or, alternatively, heat
flows from hot to cold regions. This is a classical statement of second law of thermody-
namics.
Summary of the relations:
In summary, we have the following three observations:
• The constitutive relations of first Piola-Kirchhoff stress, internal energy and entropy
are independent of the gradient of temperature
• There is a free energy ψ 0 = e0 − η 0 θ0 such that the first Piola-Kirchhoff stress and
entropy are given by
∂ψ 0 ∂ψ 0
T = ρ0 , and η 0 = − 0 .
∂F ∂θ
• Heat flows from hot region to cold region, i.e.,
q 0 · g 0 ≤ 0.
We now discuss two special cases of the general constitutive relations: (i) The heat con-
duction in solids (ii) The deformation of hyperelastic solids.
Heat conduction in rigid solids:
Let the heat flux vector q 0 = q̂ 0 (X, F , θ0 , g 0 ). Then the heat conductivity tensor is
defined by
0 ∂ q̂i0 0
0
[K(F , θ )]ij = − 0 (X, F , θ , g ) . (17)
∂gj 0
g =0
If g 0 → 0 then o(|g 0 |) can be neglected. Substitution of Eq. (18) in Eq. (15) yields
We now derive the governing equation of heat conduction for rigid solids, i.e., for the
solids with negligible deformation.
Taking the material time derivative of Helmholtz free energy function ψ 0 (X, F , θ0 )
(see Eq. (12)), we get
∂ψ 0 ∂ψ 0 ∂ψ 0
= : Ḟ + 0 θ̇0 . (21)
∂t ∂F ∂θ
Substituting Eqs. (13) and (14), we obtain
∂ψ 0 1
= T : Ḟ − η 0 θ̇0 . (22)
∂t ρ0
Using Eq. (12), we can write the material time derivative of free energy
∂ψ 0 ∂e0 ∂η 0 0 ∂θ0
= − θ − η0 . (23)
∂t ∂t ∂t ∂t
We can have the following relation by equating total time derivative of free energy in Eqs.
(22) and (23).
∂e0 ∂η 0
ρ0 = T : Ḟ + ρ0 θ0 . (24)
∂t ∂t
Upon substitution of Eq. (24) in the first law of thermodynamics (i.e., in Eq. (11) of
Lecture-29), we get
∂η 0
ρ0 θ 0 = ρ0 Q0h − ∇X · q 0 . (25)
∂t
We now define a specific heat capacity c by
∂ 0
c(X, F , θ0 ) = 0
e (X, F , θ0 ). (26)
∂θ
Combining Eqs. (12) and (14), we can have the following expression for specific heat
capacity.
∂η 0
c = θ0 0 . (27)
∂θ
If the deformations are negligible (i.e., solid is rigid) then Eq. (16)3 give rise to the relation
∂η 0 ∂η 0 ∂θ0
= 0 . (28)
∂t ∂θ ∂t
Substitution of Eqs. (28) and (20) in Eq. (25), we can obtain the following heat conduction
equation.
∂θ0
ρ0 c = ∇X · Kg 0 + ρ0 Q0h . (29)
∂t
Though the conductivity tensor K is not necessarily symmetric, in practice we assume
that the conductivity tensor is symmetric. This assumption is known as Onsager reciprocal
relations. If the medium (continuum body) is isotropic and homogeneous then the heat
conduction equation can be written as
∂θ0
ρ0 c = κ∇X · g 0 + ρ0 Q0h , (30)
∂t
where κ is conductivity constant, i.e., K = κI.
Before discussing hyperelastic solids, we discuss the relation between second Piola-
Kirchhoff stress and Helmholtz free energy.
∂ ψ̂ 0
T = ρ0 , (33)
∂F
then the second Piola-Kirchhoff stress S can be obtained by
∂ ψ̃ 0
S = ρ0 , (34)
∂E
∂ ψ̂ 0
: U = DF ψ̂ 0 (F , θ0 )[U ]
∂F
= DF ψ̃ 0 (E, θ0 )[U ]
= DE ψ̃ 0 (E, θ0 )[DF (E)[U ]] (Using chain rule)
1
= DE ψ̃ 0 (E, θ0 )[F T U + U T F ]
2
1 ∂ ψ̃ 0 T
= : F U + UTF
2 ∂E
" #T
F ∂ ψ̃ 0 ∂ ψ̃ 0
= + :U (Since A : (B T C) = (CAT ) : B)
2 ∂E ∂E
! !
∂ ψ̃ 0 ∂ ψ̃ 0
= F :U Since is symmetric
∂E ∂E
Therefore, the first Piola-Kirchhoff stress
∂ ψ̂ 0 ∂ ψ̃ 0
T = ρ0 = ρ0 F (35)
∂F ∂E
Recall the following relation between first and second Piola-Kirchhoff stress tensors from
Lecture-28.
S = F −1 T . (36)
Substitution of Eq. (35) in Eq. (36) give rise to Eq. (34). This completes the proof.
Since ψ̂ 0 and ψ̃ 0 are having same dependency on θ0 , Eq. (14) implies the following
relation for entropy.
∂ ψ̃ 0
η 0 = η̃ 0 (X, E, θ0 ) = − 0 . (37)
∂θ
In summary, we have the following relations for the second Piola-Kirchhoff stress and
entropy
∂ ψ̃ 0 ∂ ψ̃ 0
S = ρ0 and η 0 = − 0 .
∂E ∂θ
We now specialize these relations to hyperelastic materials.
Hyperelastic solids:
The hyperelastic solid is defined as a elastic solid with a stored energy function called
strain energy density function W (X, E) such that
∂W
S(X, E) = . (38)
∂E
We now show that thermoelastic solid is hyperelastic solid in two special cases. These
special case are (i) isothermal deformation (ii) isentropic deformation. In former case the
temperature is constant during the process of deformation and in later case the entropy
is constant during deformation.
Isothermal case:
Since θ0 is constant in isothermal deformation, the thermoelastic solid is clearly a hy-
perelastic solid with W (X, E) = ρ0 ψ̃ 0 (X, E, θ0 ). Using Eq. (34), we can obtain second
Piola-Kirchhoff stress
∂ ψ̃ 0 ∂W
S = ρ0 = . (39)
∂E ∂E
Isentropic case:
In isentropic deformation η 0 is constant. Taking derivative of Helmholtz free energy with
respect to temperature θ0 (see Eq. (12)), yields
∂ψ 0 ∂e0
= − η0. (40)
∂θ0 ∂θ0
Combining Eqs.(37) and (40), we get
∂e0
= 0. (41)
∂θ0
Clearly, the internal energy e0 is independent of temperature. Thus, the stored energy
function or strain energy density function W (X, E) = ρ0 e0 for isentropic process.
Using the principle of material frame-indifference, we can write the internal energy
∂ ψ̃ 0 ∂ẽ0 ∂W
S = ρ0 = ρ0 = (43)
∂E ∂E ∂E
In conclusion, either isothermal or isentropic cases the thermoelastic solid can be consid-
ered as hyperelastic solid.
Isotropic hyperelastic solid:
From the previous discussion the stress response (second Piola-Kirchhoff stress) for the
hyperelastic solid can be written as
S = S̃(X, E) (44)
For the isotropic solid, using Rivlin-Ericksen representation theorem (see Lecture-12), we
can have
S = α(X, IE )I + γ(X, IE )E + β(X, IE )E 2 , (45)
where IE indicates three principal invariants of E. This is the most general stress-strain
relation for the isotropic hyperelastic solids. Linearized version of this model is known as
St Venant-Kirchhoff model.
St Venant-Kirchhoff model:
The linear relation between second Piola-Kirchhoff stress and Green strain is known as
St Venant-Kirchhoff model. The linearization of Eq. (45) yields
where λ and µ are Lamé constants. The stored energy or strain energy density function
for St Venant-Kirchhoff model can be written as
λ
W (E) = (tr E)2 + µ(tr E 2 ). (47)
2
Limitation of the model: We expect the strain energy density W (X, E) → ∞ when the
material is compressed to a point, i.e., det(F ) → 0. However, this is not true with the St
Venant-Kirchhoff model. Thus, this model is good approximation when strains are small.
We now discuss few other common models for elastic materials such that the shortcoming
in St Venant-Kirchhoff model is accounted.
Other common models:
Neo-Hookean model:
The strain energy density
q
W (E) = a tr(I + 2E) + Γ det(I + 2E) , (48)
q
where Γ(s) = c s2 − d ln(s) and a, c, d > 0. We note that det(F ) = det(F T F ) =
q
det(I + 2E).
Mooney-Rivlin model:
The strain energy density
q q
−1
W (E) = a tr(I + 2E) + b det(I + 2E) tr (I + 2E) +Γ det(I + 2E) , (49)
References
In this lecture, we discuss the theory of linear elasticity. Though there are strong assump-
tions in this theory, due to simplicity, it is widely used for predicting the behaviour of
solids.
Kinematics:
We want to have a strain measure that is linear function of displacements. In order to
obtain such a linear strain, we assume that the displacement gradient ∇X u is small.
Therefore, the quadratic term that appears in the Green strain tensor (see Eq. (6) in
Lecture-20) can be neglected. Consequently, we have a linear strain measure
1
= ∇X u + (∇X u)T .
2
This linear strain is also known as small strain tensor. In addition to small displacement
gradients, if the displacements are assumed to be small then the linearization of Almansi
strain tensor also coincides with the small strain tensor. In other words, if displacement
gradients and displacements are small then we have the linear strain
1 1
= ∇X u + (∇X u)T ≈ ∇x u + (∇x u)T
2 2
We note that there is no difference between material gradient and spatial gradient in
linear elasticity. Therefore, we can write the small strain tensor
1
= ∇u + (∇u)T . (1)
2
We now show that Eq. (3) represents the rigid motion with the assumptions of small
displacements and displacement gradients. Recall the following equation for rigid motion
from Lecture-18.
x = Q(t)X + c(t). (4)
Let W = Ŵ t. Then for small t we have the following relation from Eqs. (4) and (5).
x − X = W X + c. (6)
Since displacement field u = x − X, we get the result stated in Eq. (2). It is easy to see
that W and c can be function of X but they are not function of time t.
As defined in Eq. (1), six components of strain tensor depends on three components of
displacements. Hence, a unique strain field can be obtained for a give displacement field.
But for a given strain components, the displacement field is not necessarily unique and
compatible. In order to ensure a single valued displacement field, the given strain field
should follow compatibility conditions discussed below.
Compatibility conditions:
The strain field should satisfy the following condition known as compatibility equations
to ensure single valued displacement field.
∇ × (∇ × ) = O. (7)
In indicial notation
∂ 2 nq
(∇ × (∇ × ))ij = imn jpq = 0. (8)
∂Xm ∂Xp
Clearly, there are six equations as the strain tensor is symmetric. We now show that every
displacement field satisfies the compatibility conditions.
Thus, for a given displacement field the compatibility conditions are automatically satis-
fied. On the other hand, for a give strain field the compatibility ensures the single valued
displacement field.
We now state two important properties of curl of curl of strain tensor which will be
used for simplifying the compatibility conditions.
• It follows from previous proof that the trace of tensor ∇ × (∇ × ) is zero, i.e.,
tr (∇ × (∇ × )) = 0, (10)
or, alternatively,
(∇ × (∇ × ))ii = 0. (11)
Using − δ identity (see Lecture-3) and Eq. (8), we get
∇2 (tr ) − ∇ · (∇ · ) = 0. (12)
Problem 1. The compatibility stated in Eq. (7), i.e., ∇×(∇×) = O, can be alternatively
expressed as
∇2 + ∇ (∇(tr )) − ∇(∇ · ) − (∇(∇ · ))T = O. (13)
or, alternatively in indicial notation,
Proof. We use indicial notation to prove the result. Consider the ij th component of curl
of curl of strain field
∂ 2 nq
(∇ × (∇ × ))ij = imn jpq (15)
∂Xm ∂Xp
Recall the following general − δ relation from Lecture-3.
δij δip δiq
imn jpq = det δmj δmp δmq
Since ∇ × (∇ × ) = O, we get
T = Jτ F −1 and S = JF −1 τ F −T , (19)
where F and J are deformation gradient and its determinant, respectively. Since F =
I + ∇u (see Lecture-17) and the displacement gradient ∇u is small, F ≈ I. Hence, the
relations shown in Eq. (19) imply
T ≈ τ and S ≈ τ . (20)
We will show the relation between engineering constants (i.e., Young’s modulus E and
Poisson’s ratio ν) and Lamé constants using thought experiments. For isotropic material,
the strains can also be represented in terms of stress tensor
1+ν ν
= τ − tr(τ )I. (23)
E E
We now present relation between engineering constants (Young’s modulus E and Poisson’s
ratio ν) and Lamé constants along with bounds on these constants.
x1 = X1 + γX2 , x2 = X2 , and x3 = X3 .
1 unit
τ12 = µγ.
Since the Cauchy stress is hydrostatic pressure, i.e., τ = −pI, and and p are positive
constants, we have
3λ + 2µ > 0. (25)
The inequalities shown in Eq. (24) and (25) are equivalent to E > 0 and −1 < ν < 0.5,
i.e.,
µ > 0 and 3λ + 2µ > 0 ⇐⇒ E > 0 and − 1 < ν < 0.5.
In stress approach, the compatibility conditions shown in Eq. (7) is expressed in terms
of stress components known as Beltrami-Michell compatibility conditions. The stress field
is guessed such that it satisfies the Beltrami-Michell compatibility conditions and the
boundary conditions.
We now present both the Navier equation of elasticity and Beltrami-Michell compati-
bility conditions.
Reference
τ = τ̂ (X, F , θ, g, Ḟ ),
q = q̂(X, F , θ, g, Ḟ ),
e = ê(X, F , θ, g, Ḟ ),
η = η̂(X, F , θ, g, Ḟ ),
for every unimodular tensor H, i.e., for every H such that det(H) = 1. Let L and ρ be
velocity gradient and density field, respectively. Then, choosing H = J 1/3 F −1 , we get
since ρ0 = ρJ.
Proof. The given constitutive relation τ = τ̃ (X, ρ, θ, g, L). Let F ∗ and F be fixed and
moving frames as discussed in the principle of material frame-indifference (see Lecture-34).
Then
τ̃ (X ∗ , ρ∗ , θ∗ , g ∗ , L∗ ) = Qτ̃ (X, ρ, θ, g, L)QT ,
where Q(t) is relative rotation between frames. Since X ∗ = X, ρ∗ = ρ, θ∗ = θ, g ∗ = g and
L∗ = QLQT + Q̇QT (see Lectures 31, 32 and 35) and the velocity gradient L = D + W
where D is symmetric part and W is skew symmetric part, above equation can be written
as
τ̃ (X, ρ, θ, g, QDQT + QW QT + Q̇QT ) = Qτ̃ (X, ρ, θ, g, L)QT , (2)
where Q is arbitrary rotation tensor. For any given skewsymmetric tensor W , we can
choose an orthogonal tensor such that
Q(0) = I and Q̇(0) = −W .
Substituting these relations in Eq. (2), we get
τ̃ (X, ρ, θ, g, L) = τ̃ (X, ρ, θ, g, D).
Thus, the principle of material frame-indifference implies above relation. Since the given
function is isotropic, the converse is trivial.
Applying similar steps that are discussed in Problem-1 to other constitutive relations,
we can show that they are also function of rate of deformation. In summary, we have
τ = τ̃ (X, ρ, θ, g, D)
q = q̃(X, ρ, θ, g, D)
e = ẽ(X, ρ, θ, g, D)
η = η̃(X, ρ, θ, g, D), (3)
The following Clausius-Duhem inequality (second law of thermodynamics) in terms of
internal energy e and entropy η can be obtained from Eqs. (10) and (13) of Lecture-30.
De Dη 1 q·g
−θ − τ :D+ ≤ 0. (4)
Dt Dt ρ ρθ
Since the constitutive relations are not direct function of time, we have
De ∂e ∂e ∂e ∂e Dη ∂η ∂η ∂η ∂η
= ρ̇ + θ̇ + · ġ + : Ḋ and = ρ̇ + θ̇ + · ġ + : Ḋ.
Dt ∂ρ ∂θ ∂g ∂D Dt ∂ρ ∂θ ∂g ∂D
Substituting these relations in above Clausius-Duhem inequality, we get
! ! !
∂e ∂η ∂e ∂η ∂e ∂η
−θ ρ̇ + −θ θ̇ + −θ · ġ
∂ρ ∂ρ ∂θ ∂θ ∂g ∂g
!
∂e ∂η 1 q·g
+ −θ : Ḋ − τ :D+ ≤ 0.
∂D ∂D ρ ρθ
Let v be a spatial velocity field. Then the conservation of mass can be written as ρ̇ =
−ρ ∇x · v = −ρ tr(L) = −ρ I : L. Substitution of this relation yields
! ! !
∂e ∂η ∂e ∂η ∂e ∂η
−θ θ̇ + −θ · ġ + −θ : Ḋ
∂θ ∂θ ∂g ∂g ∂D ∂D
! !
∂e ∂η 1 q·g
+ − ρ − ρθ I− τ :D+ ≤ 0. (5)
∂ρ ∂ρ ρ ρθ
Clearly, the constitutive relations are not function of θ̇, ġ and Ḋ as shown in Eq. (3).
Since θ̇, ġ and Ḋ are arbitrary, we get
∂e ∂η ∂e ∂η ∂e ∂η
−θ = 0, −θ = 0, and −θ =O (6)
∂θ ∂θ ∂g ∂g ∂D ∂D
! !
∂e ∂η 1 q·g
ρ − ρθ I+ τ :D− ≥0 (7)
∂ρ ∂ρ ρ ρθ
Differentiating Eq. (6)1 with g and Eq. (6)2 with θ, we can write
∂ 2e ∂ 2η ∂ 2e ∂ 2η ∂η
−θ = 0 and −θ − = 0.
∂g∂θ ∂g∂θ ∂θ∂g ∂θ∂g ∂g
Combining these two equations yield
∂η
= 0. (8)
∂g
∂e
= 0. (9)
∂g
Thus, the internal energy and entropy are independent of temperature gradient g. Similar
steps applied to Eq. (6)1 and Eq. (6)3 , we get
∂η ∂e
= O and =O (10)
∂D ∂D
In conclusion, constitutive relation to internal energy and entropy reduces to
δ(ρ, θ, g, D) ≥ 0 (12)
The thermodynamic equilibrium is defined as the process that is independent of time and
also uniform. Physically, the thermodynamic equilibrium refers to mechanical equilibrium
(force balance), thermal equilibrium (no heat transfer between system and surroundings)
and chemical equilibrium (no change in reactants and products with time). For the
thermodynamic equilibrium, we have the following velocity and temperature fields.
D = O and g = 0. (15)
δ(ρ, θ, 0, O) = 0. (16)
It is clear from Eqs. (12) and (16) that the dissipation function δ is minimum at thermody-
namic equilibrium. In order to have minimum dissipation at thermodynamic equilibrium,
we need to have the necessary condition
∂δ ∂δ
= O and =0 (17)
∂D ∂g
and positive definite Hessian (second derivative of dissipation) for sufficiency condition.
The Hessian of dissipation function is given by
∂ 2δ ∂ 2δ
∂D∂D ∂D∂g
H=
. (18)
∂ 2δ ∂ 2δ
∂D∂g ∂g∂g
Taking derivatives to the dissipation function (see Eq. (13)), we get
!
∂δ ∂e ∂η ∂τ 1 ∂q
= τ + ρ2 −θ I+ :D− · g, (19)
∂D ∂ρ ∂ρ ∂D θ ∂D
∂δ ∂τ q 1 ∂q
= :D− − · g, (20)
∂g ∂g θ θ ∂g
where
! !
∂τ ∂τkl ∂q ∂qk
:D = Dkl , ·g = gk ,
∂D ij
∂Dij ∂D ij
∂Dij
! !
∂τ ∂τjk ∂q ∂qj
:D = Djk , and ·g = gj .
∂g i
∂gi ∂g i
∂gi
In above relations the subscript T .E. represents thermodynamic equilibrium. Since the
derivatives should vanish at thermodynamic equilibrium (see Eq. (17)), we have
τ = −pI and q = 0, (22)
T .E. T .E.
Since the free energy ψ = e − ηθ (see Lecture-30), the thermodynamic pressure can be
written as
∂ψ
p = ρ2 . (24)
∂ρ
In other words, the stress tensor can be decomposed into two parts: (i) thermodynamic
equilibrium stress (ii) viscous stress. Therefore, the stress tensor
τ = −pI + σ. (27)
• The constitutive relations for internal energy and entropy are given by
where !
2 ∂e ∂η
p=ρ −θ .
∂ρ ∂ρ
• Let σ = σ̃(X, ρ, θ, g, D) be viscous stress then the total stress is given by
τ = −pI + σ.
σ = σ̃(ρ, θ, g, D),
q = q̃(ρ, θ, g, D),
e = ẽ(ρ, θ),
η = η̃(ρ, θ).
The isotropic functions of viscous stress and heat flux have the following representation1
It is clear that both the viscous stress σ and also the heat flux q are dependent on rate
of deformation D and temperature gradient g.
Uncoupled viscous heat-conducting fluids:
In case of uncoupled system, we assume that viscous stress is independent of temperature
gradient and the heat flux is independent of rate of deformation. The constitutive relations
for uncoupled system can be written as
where ID represents set of three invariants of D. We can have two special cases: (i) rate
of deformation is zero (D = O) and temperature gradient is non-zero (g 6= 0) (ii) rate
of deformation is non-zero (D 6= O) and temperature gradient is zero (g = 0). We note
that later case is also known as Reiner-Rivlin fluid.
Heat conducting fluid:
Since the rate of deformation is zero (D = O), we only have heat conduction inequality
q · g ≤ 0 =⇒ β0 (ρ, θ, g · g, O) g · g ≤ 0. (32)
k = −β0 . (33)
1
Liu, I-S., Continuum Mechanics, Springer-Verlag, Berlin, 2002.
q = −kg. (34)
This relation is known as Fourier law of heat conduction for fluids. Furthermore, the
Fourier law is similar to both solids and also for fluids. Through this law is important for
fluids, there are many instances where convective heat transfer is more than conduction.
Thus, in many practical problems involving fluids the conduction can be ignored. There
are few problems where the conduction is important. For example if the fluid does not have
sufficient space (thin fluid layer between walls) to convect then conduction is important.
Reiner-Rivlin fluids:
In this case, we assume that the temperature gradient is zero (g = 0). The constitutive
relation for the viscous stress can be written as
σ : D ≥ 0. (36)
References
Recall the general constitutive relations for heat conducting fluid from Lecture-38. In
this lecture, we specialize these general constitutive relations to the linear, viscous heat-
conducting fluid. The linear, viscous heat-conducting fluid is also known as classical
heat-conducting fluid. The classical heat-conducting fluid without temperature gradient
is known as Newtonian fluid.
We recall from Lecture-38 that the Reiner-Rivlin fluid is the general heat-conducting
fluid without temperature gradient. Therefore, the Newtonian fluid model can also be
thought of the linearized Reiner-Rivlin fluid model. In conclusion, either ignoring temper-
ature gradient in classical heat-conducting fluid model or linearizing Reiner-Rivlin fluid
model, we get Newtonian fluid model.
From above discussion, clearly, the Newtonian fluid model is a special case of Reiner-
Rivlin fluid model. However, there are no practical fluids that can be model with the
Reiner-Rivlin fluid model but the Newtonian fluid model. On the other hand, the non-
Newtonian fluids like polymer solutions, blood, etc., need to be accounted for long-range
time history through higher time derivatives of deformation gradient. Since the Reiner-
Rivlin fluid model based short-range time history, it is not suitable for non-Newtonian
fluids. Thus, the Newtonian fluid model is an important model in comparison with the
Reiner-Rivlin fluid model. Since linear models are important, in this lecture, the consti-
tutive relations for the classical heat-conducting fluid model are discussed.
Linear (classical) viscous heat-conducing fluid:
We now consider a special class of viscous heat-conducting fluids obeying linear consti-
tutive relationship with respect to rate of deformation D and temperature gradient g.
The following linear constitutive relations for viscous stress and heat flux can be obtained
from the general relations (see Eq. (31) in Lecture-38).
σ = λ (tr D)I + 2µ D, (1)
q = −k g, (2)
where λ = λ̃(ρ, θ) is known as dilatational viscosity, µ = µ̃(ρ, θ) is known as shear viscosity
or dynamic viscosity, and k = k̃(ρ, θ) is known as coefficient of thermal conductivity.
Recall the following dissipation function from Lecture-38.
q·g
δ(ρ, θ, g, D) = σ : D − , (3)
θ
where σ is viscous stress, q is heat flux, D is rate of deformation, θ is temperature and
g is temperature gradient.
∂D∂D ∂D∂g
H=
, (4)
∂ 2δ ∂ 2δ
∂D∂g ∂g∂g
we derive the bounds on λ, µ, and k.
Since the viscous stress σ is independent of g and the heat flux q is independent of D
(see Eqs. (1) and (2)), it is easy to see that the cross derivative terms involved in Hessian
H are zero, i.e., in Eq. (4)
∂ 2δ ∂ 2δ
= 0 and = 0.
∂D∂g ∂g∂D
∂ 2δ ∂ 2δ
Therefore, it is sufficient to have positive definiteness of and to have semi-
∂D∂D ∂g∂g
positive definiteness of Hessian H. The positive definiteness implies
! !
∂ 2δ ∂ 2δ
Sij Skl > 0 and ui uj > 0. (5)
∂Dij ∂Dkl ∂gi ∂gj
for every non-zero arbitrary symmetric second-order
! tensor S and for every non-zero
∂ 2δ
arbitrary vector u. Of course Sij Skl = 0 if and only if S = O and
!
∂Dij ∂Dkl
∂ 2δ
ui uj = 0 if and only if u = 0.
∂gi ∂gj
Since the viscous stress tensor σ is not a function of g, using Eqs. (2) and (3), we get
∂ 2δ
= kI. (6)
∂g∂g
Substituting the relation in Eq. (5)2 , we obtain
k > 0. (7)
Since q is not a function of D, taking derivative of dissipation function δ (see Eq. (3))
with respect to D, we get
!
∂δ ∂δ ∂σpq
= = σij + Dpq . (8)
∂D ij
∂Dij ∂Dij
The symmetric tensor S can be decomposed into spherical and deviatoric parts. Let
deviatoric part of S be S̃. Then
tr(S)
S̃ = S − I. (13)
3
Substituting this relation in above inequality and using property I : S̃ = 0, we get
2
λ + µ (tr S)2 + 2µS̃ : S̃ > 0. (14)
3
Choosing S = I, we get
2
λ + µ > 0, (15)
3
and choosing symmetric tensor S such that tr(S) = 0, we get
µ > 0. (16)
Starting with Eq. (12), we can have an alternative proof for bounds on dilatational and
shear viscosities which is presented in the following problem.
s · M s > 0. (18)
A known result from linear algebra (see Lecture-11) that above positive definiteness of
matrix M , i.e., Eq. (18), implies all eigenvalues are positive. Furthermore, converse is
also true. Thus, we have
3λ + 2µ > 0 and µ > 0.
This completes the proof.
The term (∇v)v contributes to the nonlinearity with respect to velocity in the Navier-
Stokes equation. Furthermore, the greatest difficulty involved in getting analytical solu-
tion is due to this nonlinear term.
Specific volume and specific heat capacity:
We now define two quantities known as specific volume and specific heat at constant
volume. The specific volume v is defined as the reciprocal of density, i.e.,
1
v= . (22)
ρ
It is clear that constant density is also equivalent to constant specific volume. The specific
heat capacity at constant volume is defined by
!
∂e
cv = . (23)
∂θ v
We note that subscript v represents the thermodynamic process with constant volume.
Physically, the specific heat at constant volume indicates the increase of internal energy
required to raise one degree of temperature in Kelvin scale.
Clearly, the specific heat can be function of both temperature and density as internal
energy e = ẽ(ρ, θ). In practice, there is a change in specific heat with temperature but
the change is small. So, usually, in engineering problems the specific heat is assumed to
be constant.
Energy balance:
We now derive the energy balance in terms of specific heat, pressure, temperature, viscous
stress, heat flux and heat generation. Consider the first law of thermodynamics (see Eq.
(10) in Lecture-29)
De
ρ = τ : D − ∇ · q + ρQh . (24)
Dt
Substitution of the decomposition of Cauchy stress, i.e., τ = −pI + σ, in energy balance
yields
De
ρ = −pI : D + σ : D − ∇ · q + ρQh .
Dt
Since I : D = tr(D) = ∇ · v, we get
De
ρ + p∇ · v = σ : D − ∇ · q + ρQh . (25)
Dt
We note that the boldface letter v is velocity and lightface letter v is specific volume.
Since the internal energy e = ẽ(ρ, θ), the material time derivative can be written as
De ∂e ∂e
= θ̇ + ρ̇.
Dt ∂θ ∂ρ
Using Eq. (23) and mass conservation ρ̇ = −ρ∇ · v, we get
De ∂e
= cv θ̇ − ρ (∇ · v). (26)
Dt ∂ρ
As defined in Lecture-38, the thermodynamic pressure
!
2 ∂e ∂η
p=ρ −θ .
∂ρ ∂ρ
Substituting constitutive relations for classical viscous heat conducting fluid, i.e., σ =
λtr(D)I + 2µD and q = −k∇θ, we get
!
Dθ ∂p
ρcv = −θ (∇ · v) + λ(∇ · v)2 + 2µD : D + k∇2 θ + ρQh . (30)
Dt ∂θ v
Dρ
+ ρ∇ · v = 0,
Dt
or, alternatively,
∂ρ
+ ∇ · (ρv) = 0.
∂t
The Navier-Stokes equations:
!
∂v
ρ + (∇v)v = −∇p + (λ + µ)∇(∇ · v) + µ∇ · (∇v) + ρb.
∂t
Energy equation:
!
Dθ ∂p
ρcv = −θ (∇ · v) + λ(∇ · v)2 + 2µD : D + k∇2 θ + ρQh ,
Dt ∂θ v
Equation of state:
f (p, ρ, θ) = 0. (31)
We have six equations and six unknown fields, i.e., the velocity field v, pressure field p,
temperature field θ and density field ρ. So we can solve the problem with appropriate
boundary conditions. Furthermore, the following differential equation can be used for
evaluating entropy production.
Dη
ρθ = λ(∇ · v)2 + 2µD : D + k∇2 θ + ρQh . (32)
Dt
References
In this lecture, we discuss the governing equations for incompressible Newtonian fluid.
The density remains constant throughout the motion, i.e., ρ = ρ0 , for incompressible
fluid. In addition, if the density is same at every point then it is called homogeneous
incompressible fluid. In the following discussion, we assume that the fluid is homogeneous
incompressible.
Kinematic constraint:
The incompressibility is a kinematic constraint on a fluid which can be written as
where v is the velocity field and D is rate of deformation tensor. The hydrostatic pressure
p developed in the fluid as reaction to incompressible constraint. However, the hydrostatic
pressure cannot be determined by the constitutive law as density is invariant under the
stress. Furthermore, the pressure can only be determined by the boundary conditions and
equations of motion.
Constitutive relations:
Recall the following constitutive relations for Cauchy stress, heat flux, internal energy,
and entropy from Lecture-38.
τ = τ̃ (X, θ, g, D)
q = q̃(X, θ, g, D)
e = ẽ(X, θ, g, D)
η = η̃(X, θ, g, D), (2)
Since the density is constant for in compressible fluid, the density does not appear in
the argument of constitutive relations. Recall from Eqs. (8) and (9) in Lecture-38 that
the internal energy and the entropy for viscous heat-conducting fluid are independent of
temperature gradient. Now following same procedure that was discussed in Lecture-38
for incompressible fluid, we can also show that the constitutive relations for the internal
energy and the entropy are independent of temperature gradient, i.e.,
and
δ(θ, g, D) ≥ 0. (5)
tr(τ )
Let τ be Cauchy stress. Then the hydrostatic pressure p = − and the deviatoric
3
stress σ = τ + pI. Then the stress power
σ : D = (pI + τ ) : D
= −p (tr D) + τ : D
= τ :D (Since tr(D) = 0) . (6)
∇ · v = 0.
Navier-Stokes equation:
!
∂v
ρ0 + (∇v)v = −∇p + µ∇ · (∇v) + ρ0 b.
∂t
Energy equation:
Dθ
= 2µD : D + k∇2 θ + ρ0 Qh ,
ρ0 cv
Dt
The entropy production can be evaluated using
Dη
ρ0 θ = 2µD : D + k∇2 θ + ρ0 Qh . (8)
Dt
Clearly, the governing equations for incompressible fluids are simple when they compared
with the general governing equations that are presented in Lecture-39. Since there are five
unknowns (i.e., velocity, pressure and temperature) and five governing equations (i.e., one
continuity equation, three Navier-Stokes equations, and one energy balance), the problem
is solvable with appropriate boundary conditions. In general, the shear viscosity µ is
assumed to be independent of temperature so that the energy equation is decoupled from
continuity and Navier-Stokes equations.
Bernoulli’s theorem:
The Bernoulli’s theorem (or the Bernoulli’s principle) is a special case of conservation
of energy. This principle can also be derived from momentum equation for incompress-
ible fluids. However, we derive the Bernoulli’s equation from energy equation so that it
holds for both compressible and incompressible fluids. We need to have the following
assumptions to derive the result.
!
∂α
• The flow is steady state, i.e, any flow quantity α is independent of time =0 .
∂t
• Fluid is inviscid, i.e., viscous stress σ = O.
• Body forces are conservative, i.e., there exist a scalar function φ such that the body
force b = ∇φ.
Let us consider the conservation of energy (see Eq. (10) in Lecture-29), i.e.,
De
ρ = τ : D − ∇ · q + ρQh .
Dt
Since q = 0 and Qh = 0, above equation reduces to
De
ρ = τ : D. (11)
Dt
Using the relation between total time derivative and spatial time derivative (see Lecture-
De ∂e ∂e
20), we have = + v · ∇e. Since the flow is steady state, i.e., = 0, and τ = −pI,
Dt ∂t ∂t
Eq. (11) can be written as
ρv · ∇e = −p(∇ · v).
Subtracting v · ∇p on both sides of equation, we get
ρv · ∇e − v · ∇p = −p(∇ · v) − v · ∇p
!
1
= −pρv · ∇ − v · ∇p (using Eq. (9))
ρ
!
p
= −ρv · ∇
ρ
!
v·v p
=⇒ ρv · ∇e + ρv · ∇ − ρv · ∇φ = −ρv · ∇ (using Eq. (10))
2 ρ
We note that the velocity vector is a tangent to the streamline at any point in fluid. Let
s be the arc length of streamline. Then the above equation can be written as
!
p v·v
0 = ρv · ∇ + −φ+e
ρ 2
!
vi ∂ p v·v
= ρ|v| + −φ+e
|v| ∂xi ρ 2
!
dxi ∂ p v·v
= ρ|v| + −φ+e
ds ∂xi ρ 2
!
d p v·v
= ρ|v| + −φ+e .
ds ρ 2
This implies
p v·v
+ − φ + e = constant along streamline.
ρ 2
We have not made any assumption about the compressibility of fluid and hence it is valid
for both compressible and incompressible fluids. Under gravitational body force field,
φ = −gz, above equation reduces to the famous Bernoulli’s equation.
p v·v
+ + gz + e = constant along streamline. (13)
ρ 2
h = e + pv, (14)
1
where v is specific volume. Recall from Lecture-39 that the specific volume, v = .
ρ
Substituting the enthalpy in Eq. (13), we get the following form of Bernoulli’s equation
for compressible fluids.
v·v
h+ + gz = constant along streamline. (15)
2
Applying the Bernoulli’s equation for two points on a streamline, we get
v1 · v1 v2 · v2
h1 + + gz1 = h2 + + gz2 . (16)
2 2
Bernoulli’s equation for incompressible fluids:
If the fluid is incompressible then we have
Dρ ∂ρ
= 0 =⇒ + v · ∇ρ = 0.
Dt ∂t
In addition to incompressible if the flow is steady state then we get v · ∇ρ = 0. Thus, the
density is constant along streamline.
We have the kinematic constraint ∇ · v = 0 (see Eq. (1)) if the fluid is incompressible.
The conservation of energy (with all assumptions stated in Bernoulli’s theorem) shown in
Eq. (11) can be written as
De
= −p∇ · v (Since τ = −pI). (17)
Dt
De
The incompressible condition (∇ · v = 0) implies = 0. The relation between total
Dt
De ∂e
time derivative and spatial time derivative = + v · ∇e along steady state condition
Dt ∂t
∂e
= 0, we get v · ∇e = 0. This relation shows that the internal energy is constant along
∂t
each streamline. If the internal energy is constant then Eq. (15) reduces to the following
Bernoulli’s equation for incompressible fluid.
p v·v
+ + gz = constant along streamline. (18)
ρ 2
Applying Eq. (18) at two points on a streamline for incompressible fluid, we get the
following form of Bernoulli’s equation.
p1 v1 · v1 p2 v2 · v2
+ + gz1 = + + gz2 . (19)
ρ 2 ρ 2
This equation is usually referred as Bernoulli’s equation.
Reference