Professional Documents
Culture Documents
Understanding that the dynamics used by Boltzmann In the following we analyze the Maxwell derivation of
in deriving his equation is stochastic, and not deter- the distribution of velocities that bears his name and
ministic, we may say, retrospectively, that the paradox the Boltzmann derivation of his equation as well as his
of irreversibility is resolved, or rather, that it becomes theorem of irreversibility. After that, we show that the
immaterial. The stochastic aspect of kinetic theory that probabilistic and stochastic reasoning they used can be
we advocate here is generally overlooked by authors employed to derive a Kolmogorov equation associated
analyzing the early development of kinetic theory. They with a stochastic dynamics [21, 22]. These results allow
tend to emphasize the deterministic Newtonian motion, us to say that the kinetic theory can be understood as
supposedly adopted by Maxwell and Boltzmann, as well stochastic thermodynamics [28–32] avant la lettre.
as the paradox of irreversibility to the detriment of the
stochastic aspect. 2. Maxwell Distribution
The texts of Maxwell and Boltzmann sometimes
appear to be exempt of probabilistic reasonings. This The probabilistic approach to kinetic theory was explic-
happens because in these cases the texts were written itly stated by Maxwell in his comment concerning the
in a language in which the term probability is not collision of two molecules [33]. When dealing with an
explicit mentioned. For instance, a typical proposition enormous number of collisions occurring in a small inter-
of Maxwell [10, 11] is: “let the number of particles for
val of time the problem becomes insurmountable and,
which x lies between x and x + dx, be N f (x)dx”. This
says Maxwell [33]: “We are therefore obliged to abandon
should be understood as: “let f (x)dx be the probability
the strictly kinetic method and to adopt the statistical
of the position of a particle liesbetween x and x + dx”.
method”. The statistic method, which is founded on
This peculiar language which is still used today does
probability theory, was used in several sciences including
not mean that the authors did not wish to base kinetic
physical sciences, one application of which was the
theory on probability grounds. On the contrary. They
description of errors. But it was Maxwell who used it as a
employed this type of discourse simply because it was
description of the physical processes themselves [23, 34].
the probabilistic language of that time.
In this sense, Maxwell is the one who introduced the
The key to recognize that the content of the proposi-
probabilistic approach in theoretical physics [23, 34].
tions of kinetic theory is probabilistic lies in the distinc-
Maxwell [10, 11] introduced his distribution of veloc-
tion we must make between probability and frequency of
ities by stating that the number of particle whose
an event [23]. The first term refers to an abstract concept
Cartesian coordinates lies between v1 and v1 + dv1 , v2
which can be understood as a nonnegative number
and v2 + dv2 , and v3 and v3 + dv3 is N f (v)dv1 dv2 dv3
associated with a space of events. Its main property
where
is that the sum of the probabilities of the elementary
events that make up the whole space of events equals 2 2 2
f (v) = ce−(v1 +v2 +v3 )/α ,
2
(1)
the unity. The space of events are usually identified as a
set of real numbers in which case a variable that take any which is understood as the probability density distribu-
√
value on this set is called a random variable. When this tion of the velocity v = (v1 , v2 , v3 ), and c = 1/α3 π 3 .
variable depend on time, one uses the term stochastic The probability distribution h(v) of the absolute
variable. The second term, frequency, refers to the real value of the velocity is related to f (v) by h(v)dv =
counterpart of probability, or its interpretation, or yet f (v)dv1 dv2 dv3 from which follows
how it is measured. Some probabilistic propositions, such
as that of Maxwell mentioned above, were written using h(v) = 4πcv 2 e−v
2
/α2
. (2)
words akin to frequency of an event.
The probabilistic character of kinetic theory was To determine the probability distribution g(ε) of the
pointed out by Jeans in his book of kinetic theory kinetic energy ε = mv 2 /2 we use the relation g(ε)dε =
published in 1904 who considered it to be a conse- h(v)dv from which we obtain
quence of molecular chaos [5]. The stochastic assumption
√
implicit in the Boltzmann equation was called the g(ε) = c′ ε e−βε , (3)
collision number hypothesis (Stosszahlansatz) by Paul √
and Tatiana Ehrenfest [24, 25]. In a paper of 1907 [26], β = 2/mα2 and c′ = 2β 3/2 / π.
they expressed their views on the irreversibility of the Maxwell made two attempts to derive his distribu-
Boltzmann equation, contained in the H-theorem, using tion of velocities. The first attempt, contained in his
an analogy with the urn model that they introduced and paper of 1860 [10, 11], is based on the assumption of
which is clearly an example of a stochastic dynamics. isotropy which means that the probability distribution
It is worth pointing out yet that the Gibbs statistical f (v) cannot depend on the choice of directions of the
mechanics, which can be understood as kinetic theory of Cartesian axes. Thus f (v) should be a function of
system in thermodynamic equilibrium, has a probabilis- v = |v|, which is invariant by any change of the Cartesian
tic character which is explicitly mentioned in the title of axes. From the isotropy, Maxwell concludes that the
his book on the subject published in 1902 [27]. independent Cartesian components are also statistically
Revista Brasileira de Ensino de Física, vol. 46, e20230322, 2024 DOI: https://doi.org/10.1590/1806-9126-RBEF-2023-0322
Oliveira e20230322-3
independent leading him to the result that f (v) should when the number of pairs of molecules that changes
be a product of the probabilities of each one of the from (v, u) → (v′ , u′ ) is equal to the number of the
Cartesian components. Using the factorization of f (v), reverse process (v′ , u′ ) → (v, u). This is equivalent to
he was able to derive the distribution (1). the condition
Although the assumption of isotropy leads us to
conclude that the velocity distribution is a function of f1 (v)f2 (u)F = f1 (v′ )f2 (u′ )F ′ , (5)
the absolute value of the velocity, it does not necessarily
implies that the independent Cartesian components of where here F stands for the transition probability rate
the velocity are statistically independent. Therefore one F (v − u, v′ − u′ ) of the direct process and F ′ stands
cannot necessarily conclude from isotropy that f (v) for the transition probability rate F (v′ − u′ , v − u)
factorizes into the distributions of the Cartesian com- of the reverse transition. The equality between (5) is
ponents, and this demonstration of Maxwell cannot be identified as the reversibility condition usually called
considered to be a valid one [35]. If this reasoning were detailed balance condition.
correct then any isotropic system of particles, whatever Next, Maxwell makes the probabilistic assumption
may be its dynamics, would obey the Maxwell distribu- that the transition probability rate of the direct process
tion [35]. Despite its incorrectness this demonstration is equal that of the reverse process, that is, F = F ′ , from
still repeated in some text books of physics dealing with which follows the relation
kinetic theory.
f1 (v)f2 (u) = f1 (v′ )f2 (u′ ). (6)
In the second attempt, contained in his paper of
1867 [12], Maxwell used probabilistic concepts that we Using this relation, Maxwell derives his distribution of
recognize as the fundamental concepts of a stochastic velocities using the conservation of the kinetic energy
process, and which we will examine in the following. before and after collision, given by (72). Comparing this
One of the stochastic concepts used by Maxwell is the expression with the logarithm of (6), one concludes that
concept of transition rate probability which is implicit
in the following proposition concerning the encounters ln f1 (v) = c1 − βm1 v 2 /2,
ln f2 (u) = c2 − βm2 u2 /2,
of two molecules [12]: (7)
which may be written in the form
The number of such encounters in unit time
will be n1 n2 F de where n1 and n2 are the 2 2
f1 (v) = c′1 e−βm1 v /2
, f2 (u) = c′2 e−βm2 u /2
, (8)
number of molecules of each kind under
consideration, and F is a function of the which is the Maxwell distribution of velocities.
relative velocity and of the angle θ, and de If we denote by v1 , v2 , v3 the Cartesian components
depends on the limits of variation within of v, then the Maxwell probability density distribution
which we class encounters of the same kind. is written as
According to this statement, F is a function of |v − 2 2 2
f (v1 , v2 , v3 ) = ce−βm(v1 +v2 +v3 )/2 , (9)
u| = |v′ − u′ | and of the angle θ which is half of the
angle between the relative velocities v − u and v′ − u′ . where c = (βm/2π)3/2 . From this distribution we may
These two dependences is equivalent to saying that F obtain the probability distribution of the absolute value
is a function of v − u and v′ − u′ . Maxwell then writes v = (v12 + v22 + v32 )1/2 of v, which is
n1 = f1 (v, t)dV and n2 = f2 (u, t)dV from which follows
that the number of encounters is proportional to
βm
3/2
2
h(v) = 4πc v 2 e−βmv /2
. (10)
′ ′
f1 (v, t)f2 (u, t)F (v − u, v − u )τ, (4) 2π
where τ is a small interval of time. This expression is It is worth writing the Maxwell distribution in terms of
interpreted as the probability density that at a given the kinetic energy x = mv 2 /2, which is
instant of time t the velocities of the molecules are √
g(x) = c′ x e−βx , (11)
(v, u) and that at an earlier time t − τ they are (v′ , u′ ).
The product f1 (v, t)f2 (u, t) is the probability density √
where c′ = 2β 3/2 / π.
that at time t the first particle has velocity v and Let us write the reversibility condition (5) as
the second has velocity u. Finally, the quantity F is
interpreted as the probability density that the particles F′ f1 (v)f2 (u)
have velocities (v, u) at time t, given that that at an = . (12)
F f1 (v′ )f2 (u′ )
earlier time t − τ they had velocities (v′ , u′ ). In other
words F is the transition probability rate of the process We have seen above that Maxwell assumed F = F ′ from
(v, u) → (v′ , u′ ), which he obtained the relation (6) and from this relation
In thermodynamic equilibrium, the state of a gas he reached the distribution of velocities. We could of
becomes unchanged. Maxwell tell us that this happens course assume the Maxwell distribution of velocities and
DOI: https://doi.org/10.1590/1806-9126-RBEF-2023-0322 Revista Brasileira de Ensino de Física, vol. 46, e20230322, 2024
e20230322-4 Kinetic theory of gases: stochastic thermodynamics avant la lettre
then conclude that F = F ′ , provided that the kinetic The change in the energy of the molecules is due
energy is conserved. to the elastic collision between two molecules. These
Let us suppose that instead of the velocities we use the collisions change the kinetic energies of the molecules
kinetic energy to describing the state of the molecules. while keeping their sum constant. The determination of
In this case, the reversibility condition is written as dA is thus reduced to the determination of the number of
collisions occurring in the interval τ such that after the
G′ g1 (x)g2 (y)
= , (13) collision one of the molecules have energy between x and
G g1 (x′ )g2 (y ′ ) x + dx. Similarly, dB is determined by considering that
where x and y are the kinetic energies of the molecules before the collision the energy of one of the molecules
before a collision and x′ and y ′ after the collision, and the have energy between x and x + dx.
g1 and g2 are the distribution of energies given by (11), Let x and y be the kinetic energies of two colliding
and G and G′ are the transition rates of the direct and molecule and x′ the kinetic energy of one of the molecule
reverse processes. Using (13) and bearing in mind the after the collision. The kinetic energy y ′ of the other
conservation of energy, x + y = x′ + y ′ , we find molecule becomes determined by the conservation of the
√ kinetic energy x′ + y ′ = x + y. Let us denote by dn the
G′ xy
= √ ′ ′, (14) number of collisions that takes place during an interval
G xy τ such that the kinetic energies of one of the molecules
which is the ratio between the transition rates when the before the collision lies between x and x + dx and the
states of the molecules is written in term of the kinetic other is between y and y + dy. Boltzmann argues that
energy instead of the velocity. this number is proportional to g(x, t)dx, to g(y, t)dy, and
to a factor ψ(x, y; x′ , y ′ )dx′ which depends on the nature
3. Boltzmann equation of the collision. Therefore,
dn = τ g(x, t)dxg(y, t)dyψ(x, y; x′ , y ′ )dx′ , (16)
In the very first paragraph of his paper of 1872 [8, 13],
Boltzmann declares the probabilistic character of kinetic where y ′ = x+y−x′ . If we integrate in all possible values
theory. He says that “the problems of the mechanical of y and x′ , determined by the restriction 0 ≤ x′ ≤ x+y,
theory of heat is a problem of probability theory”. we find the number of collisions in unit volume occurring
He then continues stating that a result of the theory in the interval τ such that the kinetic energy is between
of probability, “like the result of any other theory, is a x and x + dx, which is the quantity dB, that is
necessary consequence of definite premises,” and that the Z ∞ Z x+y
result is confirmed on account of the enormous number of dB = τ dx g(x, t)g(y, t)ψ(x, y; x′ , y ′ )dydx′ .
molecules involved. In other words, Boltzmann says that 0 0
(17)
the theory of probability is an essential part of kinetic
To write an expression for dA, we have to consider
theory and not just a method for obtained incomplete
the the number of collisions that takes place during an
or approximate laws.
interval τ such that the kinetic energy of one of the
In his paper of 1872 [8, 13], Boltzmann retakes the
molecules after the collision lies between x and x + dx,
Maxwell problem of finding the final distribution of
and the energies of the molecules before the collision lies
velocities of a gas. In this process he reaches the equation
between ξ and ξ + dξ, and between ξ ′ and ξ ′ + dξ ′ , which
that bears his name. Instead of using the velocity v to
we denote by dν. Boltzmann argues that this quantity
define the dynamic state of a molecule, as did Maxwell,
is given by
Boltzmann chose the kinetic energy x = mv 2 /2. In a
second part of the paper, however he used the velocity dν = τ g(x′ , t)dx′ g(y ′ , t)dy ′ ψ(x′ , y ′ ; x, y)dx, (18)
as the dynamic state of a molecule. In the following we ′ ′
where y = x + y − x, and that
derive his equation using the space of kinetic energies
Z ∞ Z x+y
and after that, the space of velocities.
Let g(x, t)dx be the number of molecules in unit dA = τ dx g(x′ , t)g(y ′ , t)ψ(x′ , y ′ ; x, y)dx′ dy,
0 0
volume such that at time t the molecules have kinetic (19)
energy between x and x + dx. The change in the number where y ′ = x + y − x′ .
of molecules per unit volume during an interval of time Replacing the expressions for dA and dB in equation
τ is g(x, t + τ )dx − g(x, t)dx. To find this variation we (15), dividing by τ and taking the limit τ → 0, we find
determine the number of molecules per unit volume that, Z
∂
in the interval τ , acquires the energy between x and g(x, t) = {g(x′ , t)g(y ′ , t)ψ(x′ , y ′ ; x, y)
x + dx, which we denote by dA, and the number of ∂t
molecules per unit volume that, in the same interval τ , − g(x, t)g(y, t)ψ(x, y; x′ , y ′ )}dx′ dy, (20)
loses energies between x and x + dx, which we denote by ′ ′
where y should be replaced by x+x −y and the integral
dB. We may thus write
carried out in the region y ≥ 0 and x′ ≤ x + y, which is
g(x, t + τ )dx − g(x, t)dx = dA − dB. (15) the Boltzmann equation.
Revista Brasileira de Ensino de Física, vol. 46, e20230322, 2024 DOI: https://doi.org/10.1590/1806-9126-RBEF-2023-0322
Oliveira e20230322-5
Let N (t) be the integral of g(x, t) in the variable x. If which Boltzmann calls the fundamental equation for the
we take the time derivative of N , and replace ∂f /∂t by time variation of g(x, t). We recall that in this equation
the right-hand side of (20) we find dN/dt = 0. It means one should make the replacement y ′ = x + x′ − y.
that N is a constant in time, and we may choose its value The detailed balance condition is represented by the
to be the unity. We may write then equality of the expressions (22) and (23) or equivalently
Z ∞ by the vanishing of the integrand of (26), which is
g(x, t)dx = 1, (21) g(x′ ) g(y ′ ) g(x) g(y)
0 √ √ ′ = √ √ . (27)
x ′ y x y
and that g(x, t) can indeed be understand as a probabil-
ity density distribution. To find f from this equation, we should keep in mind
The crucial quantity in the derivation of the Boltz- that this equation should be consistent with the con-
mann equation is ψ(x, y; x′ , y ′ ), understood as the tran- servation of kinetic energy x′ + y ′ = x + y. Comparing
sition rate probability from from (x, y) to (x, y ′ ), that this equation with the logarithm of the equation above
is, proportional to the conditional probability density we find
that the molecules have energies x′ and y ′ at a certain 1
ln g(x) − ln x = c − αx. (28)
time t, given that they had energies x and y at an 2
earlier time t − τ . This quantity cannot be derived from or
pure mechanics reasoning alone, but should be consistent √
with the conservation of mechanical energy, which, in g(x) = C xe−αx . (29)
the present case of hard spheres is the kinetic energy, Recalling that x = mv 2 /2 is the kinetic energy of a
x + y = x′ + y ′ . molecule, we see that this is no other than the Maxwell
It is worth mentioning that dn equals τ dxdydx′ distribution of velocities written in terms of the kinetic
multiplied by energy.
It is worth writing the Boltzmann equation (26) as
g(x, t)g(y, t)ψ(x, y; x′ , y ′ ), (22)
g(x′ , t) g(y ′ , t)
Z
∂
which is the first term of the integrand in (20), and that g(x, t) = √ √ ′
∂t x′ y
dν equals τ dxdydx′ multiplied by
g(x, t) g(y, t)
g(x′ , t)g(y ′ , t)ψ(x′ , y ′ ; x, y), (23) − √ √ ϕ(x, y; x′ , y ′ )dydx′ dy ′ ,
x y
which is the second term of the integrand in (20). (30)
The first term above is proportional to the probability where now we are including y ′ as the third variable of
density that the molecules have energies x′ and y ′ at integration, with the understanding that the integral is
a certain time t and that they have energies x and carried out over the surface y + y ′ − x′ = x of the space
y at an earlier time t − τ . A similar interpretation spanned by the variables x′ , y, and y ′ , such that y ≥ 0,
is obtained for the second term above by exchanging x′ ≥ 0, and y ′ ≥ 0.
(x′ , y ′ ) and (x, y). The equalities of these two quantities We now consider the space of velocities. Let f (v)dv
is the condition of detailed balance used by Maxwell be the probability of finding a molecule with velocity
to reach his distribution of velocities, and also used by v withe Cartesian coordinates v1 , v2 , and v3 inside the
Boltzmann as we shall see below. element dv = dv1 dv2 dv3 . If (v, u) are the velocities of
Boltzmann adopts a form for ψ(x, y; x′ , y ′ ) which is two molecules before a collision and (v′ , u′ ) the velocities
equivalent to after the collision, we assume the probability transition
ϕ(x, y; x′ , y ′ ) rate is that given by Maxwell which we denoted by F (v−
ψ(x, y; x′ , y ′ ) = √ , (24) u, v′ − u′ ) and we recall that |v − u| = |v′ − u′ |. In other
xy
words, if this condition is not fulfilled, F vanishes. We
where ϕ(x, y; x′ , y ′ ) = ϕ(x′ , y ′ ; x, y), from which we get remember that this condition is a consequence of the
√ ′ ′ conservation of the kinetic energy, which in the case of
ψ(x, y; x′ , y ′ ) xy molecules with the same mass is
= √ , (25)
ψ(x′ , y ′ ; x, y) xy
v 2 + u2 = v ′2 + u′2 . (31)
and we see that ψ coincides with G given by equation
Using the same reasonings employed before in the case
(14). Replacing the form (24) in (20) gives
of the space of kinetic energy, we arrive at the following
∂
Z ∞ Z x+y
g(x′ , t) g(y ′ , t) equation for the time evolution of f (v, t,
g(x, t) = √ √ ′ Z
∂t x′ y ∂
0 0
f (v, t) = {f (v′ , t)f (u′ , t) − f (v, t)f (u, t)}
g(x, t) g(y, t)
∂t
− √ √ ϕ(x, y; x′ , y ′ )dydx′ , (26)
x y × F (v − u, v′ − u′ )dudv′ du′ , (32)
DOI: https://doi.org/10.1590/1806-9126-RBEF-2023-0322 Revista Brasileira de Ensino de Física, vol. 46, e20230322, 2024
e20230322-6 Kinetic theory of gases: stochastic thermodynamics avant la lettre
where the integral is performed in the surface defined If we now we now perform the changes x ↔ x′ and
by (31). y ↔ y ′ , we find an expression which added which added
The detailed balance condition is now represented by with the previous expression gives
the equation Z
dE 1 g2 (x, y)
f (v′ )f (u′ ) = f (v)f (u), (33) dt
=
4
ln
g2 (x′ , y ′ )
which is the same as the Maxwell condition given by × {g2 (x′ , y ′ ) − g2 (x, y)}ϕ(x, y; x′ , y ′ )dxdx′ dydy ′ .
equation (6). Therefore, we obtain from this condition (39)
the Maxwell distribution of velocities as we did above
from equation (6). It should be remarked that all transformations above
leaves ϕ(x, y; x′ , y ′ ) invariant.
4. Theorem of Irreversibility Let us write the integrand as
g2 ′
As we have seen, Boltzmann derived the Maxwell dis- I = ln (g − g2 )ϕ. (40)
tribution (29) by considering the stationary solution g2′ 2
of equation (26) which is given by the condition of If g2′ ≥ g2 than ln(g2 /g2′ ) ≤ 0 and I ≤ 0 because ϕ ≥ 0.
irreversibility or detailed balance (27). He then asked If g2′ ≤ g2 than ln(g2′ /g2 ) ≥ 0 and and again I ≤ 0.
whether (29) would be the asymptotic solution of (26) Therefore we conclude that the dE/dt ≤ 0, that is, E can
for any initial condition. To answer this question he never decrease, and the the Boltzmann irreversibility
introduced the quantity theorem becomes demonstrated.
Z ∞
g(x, t) We now consider the theorem of irreversibility written
E= g(x, t) ln √ − 1 dx, (34)
0 x in terms of the probability defined in the space of
velocities. In this case the quantity E is defined by
and demonstrated that this quantity can never decrease,
dE/dt ≥ 0. We call this proposition the irreversibility Z
theorem but it is usually called Boltzmann H-theorem. E = f (v) ln f (v)dv. (41)
Boltzmann used the letter E in his original paper of
1872, as we did above, but some years later someone Deriving this expression with respect to time,
denoted it by the letter H, and Boltzmann himself ended Z
up using the letter H in his book on kinetic theory. dE ∂f (v)
= ln f (v)dv. (42)
Boltzmann remarks that one could use the expression dt ∂t
Z ∞
g(x, t) Replacing ∂f /∂t given by the Boltzmann equation (32),
E1 = g(x, t) ln √ dx, (35)
0 x we get
which differs from E by a constant as the integral of f dE
Z
equals the unity. Deriving E with respect to time, we = ln f (v){f (v′ , t)f (u′ , t) − f (v, t)f (u, t)}
dt
find
dE
Z ∞
∂g(x, t) g(x, t) × F (v − u, v′ − u′ )dudv′ du′ dv. (43)
= ln √ dx. (36)
dt ∂t x
0
Performing the exchange of variables v ↔ u and v′ ↔
To reach this result we should bear in mind that the u′ , and defining f2 (v, u, t) = f (v, t)f (u, t), we obtain
integral of g(x, t) in x equals the unity. the expression
Replacing (26) in (36), we may write Z
Z dE 1
dE g(x, t) = ln f2 (v, u)
= ln √ dt 2
dt x
× {f2 (v′ , u′ ) − f2 (v, u)}F dudv′ du′ dv. (44)
× {g2 (x′ , y ′ ) − g2 (x, y)}ϕ(x, y; x′ , y ′ )dxdx′ dydy ′ ,
(37) If now we perform the exchange of variables v ↔ v′ and
√ u ↔ u′ , we reach the expression
where g2 (x, y, t) = g(x, t)g(y, t)/ xy and we have
dropped the explicit time dependence of g2 . If we Z
dE 1 f2 (v, u)
perform the changes of variables x ↔ y and x′ ↔ y ′ , we = ln
dt 2 f2 (v′ , u′ )
find an expression which added which added with (37)
gives × {f2 (v′ , u′ ) − f2 (v, u)}F dudv′ du′ .dv (45)
Z
dE 1
= ln g2 (x, y) We remark that the exchanges of variables just employed
dt 2
leaves F (v − u, v′ − u′ ). Again, using the reasoning used
× {g2 (x′ , y ′ ) − g2 (x, y)}ϕ(x, y; x′ , y ′ )dxdx′ dydy ′ . above, we conclude that the integrand is negative from
(38) which follows that dE/dt ≤ 0 because F ≥ 0.
Revista Brasileira de Ensino de Física, vol. 46, e20230322, 2024 DOI: https://doi.org/10.1590/1806-9126-RBEF-2023-0322
Oliveira e20230322-7
DOI: https://doi.org/10.1590/1806-9126-RBEF-2023-0322 Revista Brasileira de Ensino de Física, vol. 46, e20230322, 2024
e20230322-8 Kinetic theory of gases: stochastic thermodynamics avant la lettre
which is a closed equation for the one-particle probabil- where Ψ is the entropy flux from the outside to the
ity density and can be solved. In this approximation, the system, given by
master equation is recognized as the Boltzmann equa-
W(ξ ′ |ξ)
Z
tion. It should be remarked that the original Boltzmann
Ψ=− W(ξ ′ |ξ)P(ξ) ln dξdξ ′ , (65)
equation, given by (32), is written in terms of velocities W(ξ|ξ ′ )
only. This happens because the transition probability
rate considered by Boltzmann, as well as by Maxwell, and Π is the production of entropy, given by
depends only on velocities of the molecules and not on
W(ξ ′ |ξ)P(ξ)
Z
their positions. Π= W(ξ ′ |ξ)P(ξ) ln dξdξ ′ . (66)
Considering that the transition rate w depends on the W(ξ|ξ ′ )P(ξ ′ )
velocities, we may integrate (57) in all positions to find
Z We remark that the entropy is not a conserved quantity
d like energy. The time variation of the entropy is not
P(v1 ) = {w(v1 , v2 |v1′ , v2′ )P(v1′ )P(v2′ )
dt in general equal to the flux of entropy Ψ. Entropy
− w(v1′ , v2′ |v1 , v2 )P(v1 )P(v2 )}dv1′ dv2′ dv2 . is also created by virtue of the irreversible processes
(58) occurring inside the system. This contribution is the
entropy production rate Π and is always nonnegative.
Using the property To demonstrate that Π is a nonnegative quantity we
write it as
w(v1′ , v2′ |v1 , v2 ) = w(v1 , v2 |v1′ , v2′ ), (59)
Z
1
it can be written as Π= {W(ξ ′ |ξ)P(ξ) − W(ξ|ξ ′ )P(ξ ′ )}
Z 2
d
P(v1 ) = {P(v1′ )P(v2′ ) − P(v1 )P(v2 )} W(ξ ′ |ξ)P(ξ)
dt × ln dξdξ ′ , (67)
W(ξ|ξ ′ )P(ξ ′ )
× w(v1 , v2 |v1′ , v2′ )dv1′ dv2′ dv2 , (60)
which is obtained performing the exchange of variables
which is identical to the Boltzmann equation (32). ξ ↔ ξ ′ in the integral of equation (66). The integrand
The above result allow us to conclude that the Boltz- in (67) has the form (a − b) ln(a/b) which is a non-
mann equation can be understood a master equation in negative expression, a result that allows us to conclude
the approximation of one-particle, which is an approx- that Π ≥ 0.
imation equivalent to say that the dynamic variables Let us consider the transition probability rate
of distinct molecules are statistically independent. This employed by Maxwell and Boltzmann. The transition
independence assumption was called molecular disorder rate they employed is invariant under the exchange of the
(molekularen Unordnung) by the Ehrenfests [24, 25]. velocities of the molecules before and after the collisions.
This is equivalent to say that
6. Entropy Production
W(ξ ′ |ξ) = W(ξ|ξ ′ ). (68)
One of the fundamental concepts of stochastic thermo-
dynamics is that of entropy production, to be defined in Replacing this property in the expression (65) shows
the following. The entropy of the system described by that the flux of entropy vanishes identically. Thus the
the probability density P(ξ, t) is defined by Kolmogorov equation associated with the Boltzmann
Z kinetic theory describes a closed system. There is no
S = − P(ξ) ln P(ξ)dξ, (61) exchange of entropy nor of energy. As to the entropy
production rate, wee see that it coincides with the
where we are dropping the explicit dependence on t, Its variation of entropy, that is Π = dS/dt. Since Π ≥ 0,
time derivative is it follows that
Z
dS ∂P(ξ) dS
=− ln P(ξ)dξ. (62) ≥ 0, (69)
dt ∂t dt
Replacing ∂P/∂t given by (52), we find
which can be understood as the Boltzmann theorem of
P(ξ) irreversibility. To reach this conclusion, it suffices to see
Z
dS
= W(ξ ′ |ξ)P(ξ) ln dξdξ ′ . (63) that the entropy S is the Boltzmann quantity E with
dt P(ξ ′ )
the reversed sign.
The right-hand side of the equation above is split in To verify that equation (45) derived by Boltzmann
two parts is indeed identical to (63), it suffices to use (68), and
dS write the probability distribution P(ξ) as a product of
= Π + Ψ, (64) one-particle probability distributions.
dt
Revista Brasileira de Ensino de Física, vol. 46, e20230322, 2024 DOI: https://doi.org/10.1590/1806-9126-RBEF-2023-0322
Oliveira e20230322-9
7. Collision of two particles Subtracting the equations (73) and (74), we get
Let v and u be the velocities of two molecules before the v′ − u′ = |v − u|e, (78)
collision and v′ and u′ after the collision. If the kinetic
from which follows the result (70) because |e| = 1.
energy and momentum are conserved before and after
In the one-dimensional case,
collision then the absolute value of the relative velocities
are the same before and after the collision, v ′ − u′ = |v − u|e (79)
′ ′
|v − u| = |v − u |. (70) where e takes the values +1 if v < u and −1 if v > u.
Usually this result is demonstrated assuming that the In any case v ′ − u′ = u − v and v ′ and u′ are uniquely
particles obeys the Newton equations of motion. How- given by
ever, this is not necessary. It is a consequence of any type µ (m1 − m2 )v + 2m2 u
of motion, be it Newtonian or not, as long as it leads to v′ = c + (u − v) = , (80)
m1 m1 + m2
the conservation of energy and momentum.
Our premises concerns the total momentum of the µ (m2 − m1 )u + 2m1 v
u′ = c − (u − v) = . (81)
two molecules and the total kinetic energy. The total m2 m1 + m2
momentum is the same before and after the collision, The transition rates that we consider here are asso-
′ ′
m1 v + m2 u = m1 v + m2 u, (71) ciated with processes in which the velocities of two
molecules changes while their momentum and their
as well as the kinetic energy, which we write as kinetic energy remains unchanged. The parametric for-
mulation that we defined above becomes appropriate to
m1 v ′2 + m2 u′2 = m1 v 2 + m2 u2 , (72) describe such processes. The transition rate can then
be calculate by considering that the unit vector e is an
where m1 and m2 are the masses of the molecules, and
independent random variable with a given probability
we are using a boldface letter to represent a vector and
density distribution which we denote by P (e). The
an ordinary letter to represent its absolute value.
conditional probability density of (v′ , u′ ) given (v, u) is
The two equations above can be obtained from a
obtained by
parametric formulation as follows
µ P (v′ , u′ |v, u)dv′ du′ . = P (e)de (82)
v′ = c + |v − u|e, (73)
m1
In one dimension the velocities v ′ and u′ are uniquely
determined by v and u. Therefore in one dimension
µ
u′ = c − |v − u|e, (74) the motion determined by the conservation of kinetic
m2 energy and momentum cannot be stochastic and remains
where µ = m1 m2 /(m1 + m2 ), e is a unit vector, |e| = 1, deterministic. In fact, in this case the conservation of
and c is the velocity of the center of mass, defined by kinetic energy and momentum leads to the conclusion
that the motion follows the Newton equations of motion.
m1 v + m2 u Another result of the one dimensional case is as
c= . (75)
m1 + m2 follows. If all particles have the same mass than from
That is, given v and u, then v′ and u′ given para- the equations (80) and (81), the particles exchange their
metrically by the equations (73) and (74) obey the two velocities in a collision. Therefore the probability density
conservation laws above for an arbitrary unitary vector distribution does not change and remains that of the
e. initial condition. There is no relaxation to the Maxwell
Multiplying (73) by m1 and summing with (74) distribution, unless the initial condition is already a
multiplied by m2 , we find the result Maxwell distribution. In other words, both sides of the
Boltzmamm equation vanish identically.
m1 v′ + m2 u′ = (m1 + m2 )c, (76)
8. Conclusion
which is equivalent to the conservation of momentum
(71) if we consider the definition of c.
We have shown that the kinetic theory of gases devel-
Taking the square of (73) and of (74), summing the
oped by Clausius, Maxwell, and Boltzmann can be
resulting equations after multiplying by m1 and m2 ,
understood as stochastic thermodynamics. This result
respectively, we find
was accomplished by showing that the dynamics used by
m1 v ′2 + m2 u′2 = (m1 + m2 )c2 + µ|v − u|2 . (77) Boltzmann to reach his equation is a stochastic dynamics
and not properly a deterministic Newton equation of
Replacing the definition of c in this equation we are led motion. Using the probability transition rate devised
to the conservation of energy (72). by Maxwell and used by Boltzmann we constructed the
DOI: https://doi.org/10.1590/1806-9126-RBEF-2023-0322 Revista Brasileira de Ensino de Física, vol. 46, e20230322, 2024
e20230322-10 Kinetic theory of gases: stochastic thermodynamics avant la lettre
Kolmogorov equation, that gives the time evolution of [13] L. Boltzmann, Sitzungsberichte der Kaiserlichen
the probability density distribution associated with the Akademie der Wissenchaften 66, 275 (1872).
whole state of the system. [14] K. Huang, Statistical Mechanics (Wiley, New York,
The Kolmogorov equation is the basic equation of 1963).
the stochastic thermodynamics from which we deter- [15] Y.B. Rumer and M.S. Ryvkin, Thermodynamics, Statis-
mine the time evolution of the entropy as well as the tical Physics and Kinetics (Mir, Moscow, 1980).
flux of entropy and the entropy production rate. The [16] L.E. Reichl, A Modern Course in Statistical Mechanics
Boltzmann equation is reached using an approximation (University of Texas Press, Austin, 1980).
which corresponds to consider the velocities of dis- [17] C. Cercignani, The Boltzmann Equation and its Appli-
cations (Springer, New York, 1988).
tinct molecules as independent statistically. Within this
[18] S.R.A. Salinas, Introduction to Statistical Physics
approximation we have also shown that the entropy pro-
(Springer, New York, 2001).
duction rate is associate with the Boltzmann H-theorem.
[19] G.M. Kremer, Uma Introdução à Equação de Boltzmann
The Markov assumption is the fundamental hypothe-
(EDUSP, São Paulo, 2005).
sis of the stochastic dynamics developed here. Another
[20] G.M. Kremer, An Introduction to the Boltzmann Equa-
assumption is the one related to statistical independence tion and Transport Processes in Gases (Springer, Berlin,
of the molecules, which we have employed as an approx- 2010).
imation to reach the Boltzmann equation from the [21] N.G. van Kampen, Stochastic Processes in Physics and
Kolmogorov equation. These two distinct assumption are Chemistry (North-Holland, Amsterdam, 1981).
implicit in the paper of Boltzmann of 1872 but they are [22] T. Tomé and M.J. de Oliveira, Stochastic Dynamics and
not clearly distinguished in the historical texts on the Irreversibility (Springer, Cham, 2015).
kinetic theory. Boltzmann himself did not distinguish [23] M.J. de Oliveira, Rev. Bras. Ens. Fis. 44, e20220087
them and considered them consequences of the gas being (2022).
molecularly disordered (molekular Ungeordnet) [3]. The [24] P. Ehrenfest and T. Ehrenfest in Encyklopädie der math-
two assumptions were distinguished by Jeans but he ematischen Wissenschaften (Teubner, Leipzig, 1907–
considered them consequences of molecular chaos [5]. 1914); volume 4, issue 4, article 32.
The Ehrenfests on the other hand not only distinguish [25] P. Ehrenfest and T. Ehrenfest, The Conceptual Founda-
them but gave them different names. The fundamental tions of the Statistical Approach in Mechanics (Dover,
hypothesis concerning the stochastic dynamics they New York, 1990).
called the collision number hypothesis (Stosszahlansatz) [26] P. Ehrenfest and T. Ehrenfest, Physikalische Zeitschrift
8, 311 (1907).
[24, 25] and the one related to the statistical inde-
[27] J.W. Gibbs, Elementary Principles in Statistical Mech-
pendence of velocities they called molecular disorder
anis (Scribner, New York, 1902).
(molekularen Unordnung) [24, 25].
[28] T. Tomé and M.J. de Oliveira, Phys. Rev. E 82, 021120
(2010).
[29] T. Tomé and M.J. de Oliveira, Phys. Rev. E 91, 042140
References (2015).
[30] T. Tomé and M.J. de Oliveira, J. Chem. Phys. 148,
[1] H.W. Watson, A Treatise on the Kinetic Theory of Gases 224104 (2018).
(Clarendon Press, Oxford, 1876). [31] M.J. de Oliveira, Phys. Rev. E 99, 052138 (2019).
[2] O.E. Meyer, Die kinetische Theorie der Gase (Mar- [32] M.J. de Oliveira, Rev. Bras. Ens. Fis. 42, e20200210
uschke und Berendt, Breslau, 1877). (2020).
[3] L. Boltzmann, Vorlesungen über Gastheorie (Barth, [33] E. Garber, S.G. Brush and C.W.F. Everitt, Maxwell on
Leipzig, 1896–1898), v. 1–2. Molecules and Gases (MIT Press, Cambridge, 1986).
[4] S.H. Burbury, A Treatise on the Kinetic Theory of Gases [34] M.J. de Oliveira, Rev. Bras. Ens. Fis. 41, e20180307
(Cambridge University Press, Cambridge, 1899). (2019).
[5] J.H. Jeans, The Dynamical Theory of Gases (Cambridge [35] A. Walstad, Am. J. Phys. 81, 555 (2013).
Universiyt Press, Cambridge, 1904).
[6] L. Boltzmann, Lectures on Gas Theory (University of
California Press, Berkeley, 1964).
[7] S.G. Brush, Kinetic Theory (Pergamon Press, Oxford,
1965), v. 1.
[8] S.G. Brush, Kinetic Theory (Pergamon Press, Oxford,
1966), v. 2.
[9] R. Clausius, Annalen der Physik und Chemie 100, 353
(1857).
[10] J.C. Maxwell, Philosophical Magazine 19, 19 (1860).
[11] J.C. Maxwell, Philosophical Magazine 20, 21 (1860).
[12] J.C. Maxwell, Philosophical Transactions of the Royal
Society of London 157, 49 (1867).
Revista Brasileira de Ensino de Física, vol. 46, e20230322, 2024 DOI: https://doi.org/10.1590/1806-9126-RBEF-2023-0322