Professional Documents
Culture Documents
By
ALFRI~D RENYI (Budapest), corresponding member of the Academy
Introduction
1 The Dirac &"function" and the uniform distribution on the whole real axis (or the
whole space etc.) arise in the same way in quantum mechanics and they are in a certain
sense dual to each other. As a matter of fact, it is easy to verify the following fact connected
with Heisenberg's uncertainty relation (for the sake of simplicity we restrict ourselves to
the one dimensional case): if the wave function ~,(x)of the position of a particle degener-
ates into a Dirac &"function,, the wave function ~o(p) of the imPUlse of the particle degen-
erates into a function for which [9(pll2=~const. is the "density" of a "uniform proba-
bility distribution in the whole phase space". This can be shown as follows: as it is known,
denoting by 9(p) the wave function of the position and by 9~(p) the wave function of the
+ c~ ip:c
1 ~f g,(x)e
impulse of a particle, we have 5o(p)~- [/2-~h 7~ dx. This formula shows that if
-GO
ipx o
~(x) degenerates into the Dirac function ~(X--Xo) , we have ~(p) -- _ 1_ e i~. for
which [~ (p) [o~ const. V2~-h
288 A. ~ N w
exposed at the first time as comments to my lecture [9], are published in his.
paper [13] in this volume and settle the question under what conditions
can the conditional probability, introduced in the present paper as a set:
function of two set variables, be expressed in quotient form by means
of (one or more) set functions of one variable. D. VAN DANTZIQ called m y
attention to the work of KEYNES, COPELAND and KOOPMAN. JU. V. LINNIK
called my attention t o a paper of J. NEYlVIAN [14], where the foundations of
the theory of probability are sketched in a form which has some points of:
contact with the point of view of the present paper. In Berlin A. N. KOLMOGOROV
called my attention to those conditional probability spaces which I call "Cavalieri
spaces" (w 2). I hope to return to the thorough investigation of such spaces in a.
forthcoming paper. I owe to a discussion with E. MARCZEWSKITheorem 14 of w 2..
The resuit of 3.5 has been suggested to the author by a remark of K. SARKADI..
A. CsAszAI~ and J. CzIPSZER kindly read the manuscript of the present paper
and made some valuable remarks which I have utilized in preparing t h e
final form of this paper.
l am thankful to all those mentioned for their remarks and suggestions..
that 0 C ~ and S ~ ~.] Let us suppose further that a non empty subset &,
ON A NEW AXIOMATIC THEORY OF PI~OBABILITY 28~
2 It will be seen that our axioms imply that O (~ N, but it is possible that oAR con-
tains all elements of c~ except 9 on the other hand, it is possible that N contains
only one set. The theory is somewhat less general, but considerably simpler if it is sup-
posed that N is an additive class of sets, i. e. if from B~EN and B2 ~ N it follows
B~ -~- B 2 ~ N. Concerning the consequences of this supposition see [13].
3 See 1.7 where an equivalent axiom (Axiom I l l ' ) i s discussed, and I. 8 where a
stronger form of this axiom is mentioned.
290 A, RI~NY1
are connected with each other by Axiom III. This connection is such that it
is in conformity with the usual definition of conditional probability. Namely,
:if we put Pc(A)--P(AIC) for A E c~ with CE 33 fixed, and define the con-
ditional probability P*(AIB) for a B E o% for which P c ( B ) > 0 , as usual in
P (AB)
the theory of KOLMOGOROV, by P*(A[B)== Pc(B) ' we have by Axiom III
THEOREM 1. P(AIB)=P(ABiB ).
PROOF. If in Axiom III C=B, we have P(A]B)P(BIB)--P(ABtB ).
Taking Axiom I into account Theorem 1 follows.
REMARK. It follows from Theorem 1 that P(SIB ) = 1; namely, by Theo-
rem 1 P(SIB)-~P(SBIB)--P(BIB) and thus, by Axiom I, P ( S [ B ) = 1.
THEOREM 2. P(AtB ) ~ 1.
PROOF. According to Axiom II we have P (A B]B) + P (ABIB) = P (BIB).
As by Axiom I P ( B ] B ) : 1 and P(A-BIB) ~ 0, it follows P(AB]B) <= 1 and
thus by Theorem 1 we obtain P(A[B) ~ 1.
THEOREM 3. P ( 9
PROOF. According to Axiom II P( 9 P( 9169169 ) and
thus P ( O I B ) ~ 0 .
REMARK 1. It follows from Theorem 3 that 9 ~g3, because if O would
belong to 33, we should have P( 9169 by Axiom I and P ( O ] O ) : 0
by Theorem 3; thus the assumption 9 C 33 leads to a contradiction.
REMARK 2. It follows from Theorem 1 and 3 that if A B : O , then
P(AIB):O.
ON A N E W AXIOMATIC T H E O R Y O F PROBABILITY 291
2P(AIB,~C)P(B,:IC)= 2P(ABk]C)=P(ABIC)=P(ABC]C)--
k=l 1~=1
:P(AC{C):P(AIC).
REMARK, We mention the following consequences of Theorem 7: Let
co
Q(A)---~ ~'Q(Ak). This follows simply from the remark that if A~B,~, we
k:l
the same B, may be used, and therefore the countable additivity of Q(A)
follows from that of P(A]B) for fixed B r e~g (Axiom ll).
Thus the first part of Theorem 8 is proved. Now we suppose that the
sequence B,, has besides a)--c) also lhe properly d). By a well-known the-
orem (see p.e. [15], w 13, Theorem A) the definition of Q(A) can be extended
to the smallest o-ring C-[*+ containing c'I* in such a manner that Q(A)
Co
A S* ~ ~.T A B ....
~t ---(I
1.6. t?andom variables on a conditional probability space. Let [S, ~%, ~`3,
P(AIB)] be a conditional probability space. If _~=~(a) denotes a real-valued
function defined for a ~ S which is measurable with respect to a , i.e. if Ax
denotes the set of those a ~ S for which _~(a)<x, we have A~ ~ for all
real x, we shall call ~ a random variable on [S, ~, g), P(AIB)]. Vector-valued
random variables are defined similarly. The (ordinary) conditional probability
distribution function of a random variable .~ with respect to an event B ~ 0'3
is defined by F(x]B) : P(A,,t B) ; if F(xlB ) is absolutely continuous, F'(x[B) ~-
: f ( x [ B ) is called the (ordinary) conditional probability density function of~
with respect to B. The conditional mean value M(~]B) of ~ with respect to
an event B ~ N is defined as the abstract Lebesgue integral
M(~! B) --: .t~(a)dP(AI B)
N
ON A NEW AXIOMATIC T H E O R Y O F P R O B A B I L I T Y 295
.['f(.)g.
(7b) P(A~] A~) ~z - -
ff(u)du
The generalized distribution function resp. density function of a random vector
is defined similarly.
1.7. An alternative form of Axiom llI. We have ah'eady pointed out
that our system of axioms can be characterised in the following manner: the
set S, the a-algebra c~ of subsets of S, the subset 03 of cr and the set func-
tion of two set variables P(A]B) defined for A r cr and B r 03 form a con-
ditional probability space if ~ ' n = [ S , cr P(A]B)] is an ordinary probability
space for every fixed B E 03 and if the probability spaces $~, and ~'~c, are
connected by Axiom llI if C r 03 and BC ~ 03. Thus different probability fields
can be combined to f o r m a conditional probability field only if they are
"compatible" in that sense that they satisfy Axiom III which c a n b e consid-
ered as the condition of compatibility.
Now it is easy to prove
THEOREM 9. Axiom III can be brought to the following equivalent form:
AxIoM IIl'. If B~03, C~03, B ~ C and P ( B I C ) > 0 , we have for any
P(ABIC)
P(A]B) = P(B[C)
and that clearly P ( B ~ I C ' ) = P ( B ' ] C ' ) > 0 and thus C' has the properties e),
#) and 7) and therefore B~C'E ~.
If B ' C ' = & and C'----B1 where B'~c;I, we have to verify
P(AiB'C')P(B'iC')=P(AB']C'). As P(AB'IC')=P(AB'C']C'), this rela-
tion is trivially equivalent to P(AIB,)=P(ABIIBO, which is satisfied by
force of (9).
It is easy to see that the definition of P(AIB 0 does not depend on the
choice of B,2; as a matter of fact, if both B., and B.a have properties cO, ,~),
and 7), it follows by Theorem 6 thai
P(AB, IB,2) P(AB~ IB:~)
P(B~IB~) P(BII&)
It is also clear that P(AB~iB,a) can not be defined otherwise as by (9) be-
cause if B~ is included into ~-3, (9) must hold by force of Axiom III.
Another possibility for including new sets into ~ is yielded bypassing
to the limit; this procedure is described by the following
THEOREM 1t. Let us suppose that B,~ ~ ~, B,~ ~ B,,+~, further
O3
P ( B ,~ !B~i~l)
i > 0 (n = O, 1,.. .) and that [-IP(B,,[B,~+I) converges. If
l~ = 0
Co
Bo~ = . ~ B,~ does not belong to aB, the definition of P(A B ) c a n be extended
for B ~ B ~ , by putting
(12) P ( A I B ~ ) = l i m P(A[B;,) for any A ~ ~,
Jt --~- c o
provided that the following condition is satisfied: if B ~ oq3 and P(BiBx ) > (~
for some N, we have BBx E ~.
PROOF. For an arbitrary A~e% we put A~~ AOO AB~B~.:_~
( k : 1, 2, ...). As AoO~B,~ for n ~ k, the sequence P(A(~OlB,,) is by Theorem
5, Remark 1 monotonically decreasing for n : / c , k@ 1 , . . . and thus
lira P (A(Z")!B,~~- - P (A(J':)i B~)
exists.
Putting
co
and thus
co
Cab
k=_O
(A,, I ~) = P (A,,~Bco),
it follows from (16) and (17) that
co
,~=0
,(21) p (A 18~,,) = ~2 e (ac~ tS:,0 = ~ P (A~'~1B,-) = co
1,::0 1;:0
IIP(B,,I B,,+~)
It follows from (21) that
BB~v E a-3 could be omitted. In what follows we do not suppose the validity
of Axiom III*; it should be mentioned, however, that Axiom Ill* is contained
as a special case in Axiom IV2 introduced by A. CSASZAR [13].
If a=[S,~'~,aZ, P(AIB)] and a ' = [ S ' , ~ ' , ~ ' , P ' ( A I B ) ] are two suck
conditional probability spaces t h a t S ~ S ' , c~Gc'{', ~3~a3' and P'(AIB)~-:
= P ( A I B ) if A ~ ~ and BE ~ , we shall say that the conditional probability
space ~ is imbedded into the conditional probability space ~'. In the forego-
ing section we considered only such special imbeddings of a conditional
probability space ~' into a conditional probability space ~' for which S ' = S,
~ ' ~ c r and ~ - ~ ' . These imbeddings o f ~ were obtained by extension of
,& in several manners. I f ~ is imbedded into ~', we shall write ~ ~ ~".
1.9. Continuity propertie s of conditional probability. It follows from
Axiom II that P(AIB) is for fixed B E N a bounded measure in A, and thus,
of course, continuous in A, i.e. if A , ~ A and A,~A,,+~ (or A,~@A,+ 0 for
a---~ 1, 2, . . . , we have for B E a3
Iim P(A, {B) - - P ( lira A,~[B).
and extend the definition of P(AIB) in the usual way for any fixed B ff g3
t o all sets A belonging to the least o-algebra cq, containing cqi. In this way
w e obtain a conditional probability space P = [ S , ~ , ~, P(A]B)] which will
b e called the Cartesian product of the conditional probability spaces N~:~
and denoted by 87=- g o ) , . . . , g ( ~ , ) , . . . . To prove that g' is really a condi-
tional probability space, we have only to verify the validity of Axiom III,
304 A. R~NYt
Putting
N
and
O3
and thus
N
where :,is uniquely determined by the condition B E ~%~,. Then [S, GL, ~3, P(AIB)]
is a conditional probability space.
PROOF. Axioms I and II are clearly satisfied. Instead of Axiom Ill we
verify the validity of Axiom III' which has been shown to be equivalent to
Axiom lII. Clearly Axiom III' is satisfied, because if B E ~ and C E ~%v and
B ~ C , only three cases are possible: ~ : 7 , ~ < 7 or 3 > 7. The case # : 7 is
evident. If # < , :" P(B]C)--:~(BC)
,
_ ~v(B) and as &~(B)< ~ and # < 7,
we laave by supposition , % ( B ) = 0 and therefore P ( B I C ) ~ 0 ; and thus there
is nothing to prove.
Finally 7" < ,~ is impossible, because from C E ~ 7 it follows !%(C)< @
and thus , u v ( C ) : 0 , what implies & ~ ( B ) = O because of B ~ C ; but this
contradicts B E s Thus Theorem 14 is proved.
The conditional probability spaces described by Theorem 14 will be
called "conditional probability spaces of alternative quotient-type". I f B E ~v,
we may call 7 the "dimension" of B, and say that the measures ,% are
"dimensionally ordered".
We mention two special cases:
a) A special case of a conditional probability space of alternative
quotient-type is obtained as follows : let f(x~, x~ . . . . . x,,) denote a non-negative
Borel measurable function in S--=En; let cq; denote the set of all measurable
subsets of E,,, and ~ the set of those measurable sets B, for which either
m(B)=.ff(x,,...,x,O dx~dx2.., dx,~ is finite and positive, o r B ' i s contained in a
k-dimensional subspace E !:~'~..... %~ of E,~ ( k = 1,2, n~l) defined by
x,~ : c ~ , x,.~ = c,,, ..., x~,,_7~ : c , : , , _ z . , and the integral
(27) m~:;,"":._;,
~''.....
..., ~"-~,_~= j'f(&, x2, . . . . x,~) dx,I. d x A 999 dxy,~
B
is finite and positive; where j ~ , j 2 , . . . , j~ denote those of the indices 1,2, ..., n
which are different from il, i.,, ..., i,~ ~; and in (26) x:~ = c~, x~-.~ c~,..,, x~,,_~= c,~_~,.
are substituted; we suppose that N is not empty; in the first case we
put
P (A IB) - - ~ (A B) .
re(B) '
ON A N E W AXIOMATIC T H E O R Y O F PROBABILITY 307"
el c 2
if B ~ ~ and B is contained in E~I,'~, " ' ~,-k
...~ ,,_~, we put
a<=t~b
f P(AlPl)aF(t)
b
fdF(t)
In this case we Shall say that ~ is regular with respect to the family
{R~;a~--t<fl}. If F(t) is absolutely continuous, we shall say that ~' is
absolutely regular.
Clearly if ~' is regular with respect to the family {R,}, it is also a
Cavalieri space with respect to {R,}. 9 As a matter of fact, C 1, C 2 and C 3
are identical with R 1, R2 and R3, and C 4 follows from R 4 and R5.
It is easy to prove by using FUBINfS theorem, that if 0 < f(xl, x2,..., x,,) < M,
the conditional probability space defined in 2.2, a) is regular with respect to
the family {R,} defined as follows: R, is the set of points (x~, x2 . . . . x~,) for
which a~=x~<& for i=~k, i = 1 , 2 . . . . . n, and x k = t ( a ~ t < f l ) .
It is Clear that if ~' is regular with respect to the family {R,; a ~ t < ~},
we have
(29) lim
h "> 0, h--~0
P(AIRFh)=P(AIRO
for almost every t in (cr fl) with respect to the measure generated by F(I)
on (a, ~') for any fixed A.
It is also evident that if g' is regular with respect to the family
{R,;e<~t<fl}, we have P(R~[R~)=0 for a~a<=t<b<~fl.
Let ~--:~(a) (a ( S ) be a random variable on a conditional probability
space 3=[S, c~, 23, P(AIB)], further let us denote by R~ the set of those
a ( S for Which g ( a ) = t ( - - ~ ~ t < -F ~ ) . Let us suppose that ~ is regular
with respect to the family {R,}, i.e. we have for A 6
b
j P (AIR,) dF(t)
(30) P (A IR:) ~' b
.idF(t)
a
for - - ~ < a < b < + o~. Evidently the conditional distribution generated by
on the real axis is the distribution
p b d F(b)---F(a)
P(a<~<b!c<~_<d) = (R~]R~.)= F(d)--F(c) for c<~a<b~d,
f f sin ~ d ~ d q ~
A B b, d
P ( A B a,b'~) ~176
b d
if a<b,
t d
"B 5
/a( t,
(o --<_c < d ,0,
2~ x
and let us suppose that P(Bo,5 t B t ) = F ( x , t ) is a continuous function of x
and t, strictly increasing as a function of x. Clearly [S, ~, ~5, P(A[B)] = 8
is a conditional probability space by any choice of the measures (J,(A); we
ON A NEW AXIOMATIC THEORY OF PROBABILITY 311
shall prove that it is a Cavalieri space with respect to the family {B~; 0 ~ t < 2:~}
if and only if
1 9~
(31) = sin ,9d,9.
ABt,
0 --<_a < b < 2~, i.e. ~- sin ~cl~dt-~-)~(b--a). By taking the derivative
a 0
of both sides, wi!h respect tob, by virtue ofthe continuity of x(t, ~), we obtain
sin,gd~)~ for every t (0 =< t < 2~). Thus, it follows easily that
0
(31) holds.
Our result can be stated as follows: the uniform probability distribution
on the surface of the sphere combined with arbitrary distributions on the
meridians q~--t gives always a conditional probability space, but this space
is a Cavalieri space with respect to the set of meridians (under suitable con-
tinuity restrictions) if and only if the density of the distribution is on every
1
meridian the same, and equal t o ~ sin,9. In this case the conditional prob-
ability space is clearly also regular with respect to the set of meridians, as
we have
b
" ~P(A]BOdt
p ( A ' B~~ ) - - b :1a f ( f l s i n ~gdoq]j dt -- "
e,
9 L;
9.
t HL.(t;) d#
j=l
k
i~__1~2.<.z, O ~ t j < T
P(A~IB~) . . . . - k
N o w let us suppose that 0 =< z~ < z.~ < T; in this case A,,C_A,..,~Br and thus
U ~ (tj) dt:,-
j-1
k
P(A~,.:]BT)-- ~ 9<~':
k T
,-
and
P (A~ A ~ ]Br) _ P (A~, [BT)
P(A~, ]AO - - P ( & , IA~.BT)
and thus
f k
.J j=l
k
f~: tj.. : zI
(33) p(& jA~o)-- j=l
f _Hfj(#)dti
k
i=l
ON A N E W AXIOMATIC T H E O R Y OF PROBABILITY 313
Xgdt)dt
(34) P(A~, [A~) - - ~,~
Jgk(tldt
0
j'fj(t)dt= + oo, but ~fe-~fj(t)dt= q~j(s) exists for some s > 0 ( j - - 1, 2, :.., k)
0 0
co
f(x--y) g(y)dy
lim -.4
+A
-- --h(x)
for j,/c~-~0, + 1 , +__2, .; thus if @ is the set of all integers, A and B are
finite subsets of 0 and B is not empty, then we have
where n(B) denotes the number of elements of B. The result can be stated
as follows: the conditional probability distribution generated by ~ [ / ~ ] on
tends to the uniform conditional probability distribution on ~ if /~ ~ + ~ .
3.2. The number of prime divisors. An interesting application of the
result in 3. 1 to number theory is as follows: let U(n) denote the number
of different prime divisors of the integer n; let zek(N) denote the number of
those natural numbers n ~ N for which U(n) ~ k. By a theorem of P. ERDOS
[16] w e have
~(N) ~_ (log log N ) 7~e_loglogN.(1 _1_o(1))
(36) N k!
;e--~ dx
('Z
o"
P(a <=~ < blc <=~ < d) -
e
ie- d
(7"
and thus
b--a
lim P(a ~ ~o < b[c < ~,~ < d ) - -
O->-+'(3O = d--c '
+co
As by supposition icf(t)] ~:~:1 for t::~:0 and lim [q~(t)l=0 , it follows easily
t~__+co
put B~ - - ~ H A
~,, and let us suppose that A~ E 23 and B,~ E 23 (k, n 0, 1, 2,...).
k=0
Let us suppose that ~ = t , ( a ) is an other random variable which generates
a regular uniform conditional probability distribution on the positive real
axis N+; let us denote by Ct the set of those a E S for which t~(a)= t and
put D x = .~.~ Cu; let us suppose that C,, E 23, D.~:E 23, further that B,,D~ E 23
O~y~x
(0 < x < + ~ , n = 0, 1, 2,...). We suppose
t~e-t
(40) P(Akl C,) k! (k=0, I,...),
i.e. that ~ has a Poisson distribution with mean value t under the hypothe-
sis that ~ = t. W e shall prove that in this case the conditional probability
distribution generated by ~ on the set of non-negative integers g+ is the
uniform distribution, i. e .
1
p(Aj~[B~)-=P(~-kI~<n) - for k = 0 , 1 .... , n ; n - - O , 1,....
' -- n-t-1
This can be shown as follows" We have by Theorem 12
P(A~ID~)
(41) P(A~[B,,) =~_.~limP(Ak[B,~D~,) = x+~lim p(B,~iD~ ) ,
j ' P ( A kiCk)dr
o
(43) P (A~:IB~,) ~-- GO
/r ~'
o
As by (40)
f P (A;,.i C0 dt = . )(.~tke-t
y - dt ,
o o
P(AeiB'~) - - n-~- 1
which was to be proved.
A realisation of what has been said can be given as follows: let S
denote the strip 0 _--<x < -[- ~ , 0 _--<y < 1 of the (x, y) plane, ~ the set of all
measurable subsets of S, ~ the set of all Borel-measurable subsets of S with
lI\ \
Fig. 1
positive and finite measure, and of all sets of positive linear measure lying on
some line x-~t (0 ~ t< -k ~), and let us put P(AIB ) - m2(AB)
m2(B) ' if m2(B)> O,
where m2(B) and m2(AB) denote the plane Lebesgue measure of B and AB,
respectively, and put P(AIB ) - mi(AB)
m~(B) ' if B is lying on a line x ~ t, where
318 A. R~NY[
where h(x,y)is positive and integrable on any finite rectangle of the (x,y)
plane.
]~d, y
Let us suppose further that.,~,v E~ if c < d and R ~ ' ~ if 7 < d ,
further that we have
+1
f P (A !R:~,)h (x, y) dx
(45) P(A ~ 'J) = '~ ,+ ( c < d)
.t'h(x, y)dx
and
d
.iP (A i (x, y) ay
(46) PtA
\ t *Ip+'x~
~% :c] - "
" d (7 < d).
y)dv
ON A N E W AXIOMATIC T H E O R Y O F P R O B A B I L I T Y 319
In this case clearly for any fixed y, h(x, y) is, as a function of x, the condi-
tional density function of ~ under hypothesis ~I=Y and, for any fixed
x, h(x, y) is, as a function of y, the conditional density function of ~/ under
the hypothesis ~ = x . As a matter o f fact, choosing 7 = - d ~---y we have for
c~a<b~d from (45)
b
~ h(x, y)dx
f~
(47): P(a~<b]c~<d,,2=y)-- a
f h (x, y) ax
c
It should be mentioned that the set /?~ defined by _~=x is not sup-
posed to belong to g3, and thus h(x,y) is not an ordinary conditional den-
sity function, but only a generalized conditional density function, in the sense
defined in 1.6. By other words, h(x,y) is the generalized density function
of the random variable ~ on the subspace (R.~, a~, g3~, P~) of (S, A, B, P)
where 6[.~ is the set of all sets of the form AB~ with A ~ c~, ~ the. set of
.all sets B 6 g3 which are subsets of /?x and P~(AIB ) is defined by
P~:(A IB) = P (A [B)
for A ~ and B~-3~. If RxE~, clearly h(x,y) is up to a constant factor
the ordinary conditional density function of ~ under the condition /?~ and
:in this case f(x)={h(x,y)dy is finite.
q-co
~y(x) ax
P(a<=~_<blc~<d)~lim p(a<~<b[c<~<d, 7 ~ B < d ) - - - ~a
ff(x) ax
c
Putting
h (x, y)
.(49) g(y Ix) - - f(x)
besides h(x,y), g(ylx) is the normed (i. e. ordinary)condilional density func-
320 A. RZNVl
-(10
for all h, i , j , k : O , +__1,Z2, ..., we shall say that the Markov chain is con-
ditionally ergodic. If the chain is ergodic in the usual sense (see [18]), i.e. if
1300
(53) lim r~k == PJ, > 0 (i, k = O, §__ 1, _ 2. . . . ),
~t --->- co
+co
especially, this is the case if (52) holds and ~ P~, = -~-~. If a Markov
Its- CO
ON A N E W A X I O M A T I C T H E O R Y O F P R O B A B I L I T Y 32[
as we have supposed that all states are recurrent, the series ,~P;,k ( k ~ 0 , + 1,...}
00
,~=~
diverges, and thus, it follows from (52) that
J~
x;~ p(,9
(55) Vt~: lim r=l __ Pk
r=l
But it has been proved by C. De,MAN [19] that if all states of an irreducible
chain are recurrent null states, the only solution P ~ = V~: of the system of
equations (54) with Vo= 1 is given by the limits in (55) which always exist..
The existence of the limits (52) has been proved by P. E~DOS a n d
K. L. CHUNQ [20] in the special case of additive Markov chains. Let us
consider an additive Markov chain, i.e. put _~n= ~o§ ~J~-f-~2 q - " " -k ~,~ where
the dj~'s are independent equidistributed random variables which take only
integer values, further suppose that the following conditions are satisfied"
Let us pnt P(dk=r)~---W,. ( r = 0 , • + 2 . . . . ). We suppose that the
g r e a t e s t c o m m o n divisor of the differences r--s where r and s are such
integers, for which IV,->0 and W~>0, is equal to 1, further ~hat
+co
D ~:- ~_, r2W, of the variables dj~ exists, this follows easily by LAPLACE'S.
F:- CO
322 A. R~NYI
-~-(:D
i[22] that if m = XxdF(x) (the mean value of the time interval between con-
o
secutive events)exists, and F(x) is not a lattice distribution function, the
distribution of "~(T) tends for T--~ or to the limiting distribution
It has been pointed out by L. TAKACS [23] that the fact that the mean value
ON A NEW AXIOMATIC THEORY OF PROBABILITY 323
x O' (x)
(62) z-+colim G(x)-~---0
is satisfied, there exists the limit of the conditional distribution of 6(T), and
we have
(63) lira P ( d ( T ) < xld(T) < y ) = G(x) for 0 < x ~ y < - [ - ~ .
~-~~ G(y)
3.9. Deduction of Maxwell's law of velocity distribution. Let us
consider an idea1 gas consisting of N partictes with equal masses m, let
~,, ~Ik, ~ denote the x, y, z-component of the velocity of the k-th particle;
we suppose that _~k,~k, ~k (k ~ 1, 2 , . . . , N ) are random variables defined on
a conditional probability space IS, N, ~3, P(AJB)]. Let us suppose that the
conditional probability distribution generated by the variables -~k,~k,~
( # = 1 , 2 , . . . , N) in the 3N-dimensional Euclidean space is uniform, further
that the space is regular with respect to the sets defined by _~=-xk, ~ . = y k ,
~,~zk ( - - ~ < X , ~ < + ~ , - - ~ < y k < q - ~ , - - ~ < Z k < + ~ ; k ~ l , 2 .... ,N),
further that the sets .~ Rq, .. ~3x belong to ~, where B~, is
(tl, , , , , t3N)~ 3I "' " " "' t3N
(x >=0),
~TJ
3N
and therefore, as any constant factor is irrelevant, we may take x 2 (x > 0)
for the density function of e.
Similarly, the conditional density function of , under condition st~ ~ t
3N-5
vx
( e - x ) ~- Vx
f(xlE) I'2 3N-5 E 3N-5
j" ( e - x ) 2 ~ d x
0
0
3N
If N is a large number, we have, putting fl-- 2E ' approximately
3N-5
f(xlE ) ~ 9~/-'~~-~. e + .
1
where a = V ~ " As it is known that EN - - 3_2 k T where T denotes the absol-
It can also be shown by the same way that the conditional density function
of each of the variables _~I,.,~:, ~1~ under the condition ~ - E is exactly
approximately normally distributed. This is the reason why the velocities have
a Maxwell distribution, as the Maxwell distribution can be defined as the
distribution of a variable V . ~ ~ g~ where ~, ,~, ~ are independent and nor-
mally distributed random variables with mean value 0 and variance a s.
7*
326 A. RENW
put ~ - ~_, ~,~ and A,~= M(~,) = M~ + M2-F ... + M,~, and suppose that the
I~I
c o o
D~
b) ~ ] - - _ - - < @ ~ .
1~=~ A~
]t follows that
_.,~~7~ converges with probability 1, provided that the series ~D~(~Tk) con-
1 , . = J.
ON A N E W AXIOMATIC T H E O R Y O F P R O B A B I L I T Y 327
(69) P[sup G - - 1
g' ~ A~ -I- ~=,,+1 A~ ]
As a) and b) clearly imply that
lira J~=~ O,
(70) .... A,2
it follows from (69) that
M
(73) lira !"=~q~ ~ M
,J1--->-03 ,~
. a ~ p/,:
328 a. RENYI
exists ,"
(74) 2 p~(D~-7(1--p~)M;~)
Tllen we have
--1.
X 1
\ ~kE g I
.k
On the other hand, let us apply Lemma 1 to the sequence of random vari-
ables ~ on C. As
M(e,~[C)--p,: and D2(~,~lC)=p,~(l--p,),
it follows that
(77) P lira/,=:J 1 C) =- 1.
' k=l
Combining (76), (77) and condition b) of the theorem, and taking into
account that
l~ ';t
case, when M~, M > 0 and D,~=D > 0 do not depend on n, the conditions
co
f i ( A [ B ) = ~-<~-<"
~_~ 1 ~ 1
is the conditional relative frequency of the event A with respect to the event
B in course of the first n observations. ~ The statement of Theorem 15 gives
for this special case
(78) P(lim fi~(A [B) : P(AIB)]C) = l ,
if P(B[ C) > 0, i.e. the conditional relative frequency of the event A with res-
pect to the event B converges with the conditional probability 1 (with respect
to C) to the conditional probability of A with respect to B.
4. 3. Generalization of the theorem of Borel on normal decimals. The
theorem of BOREL [25] states that if fn(k;x) denotes the relative frequency
of the number k between the first n digits of the decimal expansion of the
real number x (0 ~ x < 1), we have
1
(79) lim f"(k ; x ) - 10 ( k = 0 , 1, 2 , . . . , 9)
for almost every x; similarly all possible digits occur in the limit with equal
frequency in the expansion, in the number system with any basis q of almost
all real numbers.
We consider a more general representation of real numbers; namely
expansion of real numbers into Cantor's series. Let q~ ( n = l , 2 , . . . ) d e n o t e
an arbitrary sequence of integers, q~ ~ 2. It is easy to prove (see [24]) that
~e This interpretation suggests itself in the following special case: let us form the
Cartesian product of a denumerable sequence of isomorphic conditional probability spaces
1 2 n
[S, ~,~3, P] and denote by A,~ resp. B,, the sets defined by S'* S * . . . * A * S . . . . resp.
S*S*...*B*S,... (see section 1. tl).
330 A. R~NV*
~=1 qlq2""q~ ,
where e.(x) may take on the values O, 1, . . . , q ~ - - l ; the d!gits ~,~(x)are
uniquely determined except for those rational numbers x which can be writ-
ten in the form x ~ ~=,~'~}/1 q2--:-q,~ (s~v(x) > 0), where besides fhis finite re-
presentation the alternative infinite representation
N-1 8n(X) 8N(X)--I 2 q,--1
x~=l q,q~...q~ -k q~q2...qN q-,~=~v+, qiq~...q,,
is also possible. We shall prove that in the case q,~-+-k ~ , when all non-
negative integers are possible "digits", all these digits occur in the.limit with
co 1
the same (conditional) relative frequency, provided that ~ . X - diverges. This
is contained in the following theorem which is a consequence of Theorem 15.
THEOREM 16. Let the numbers q~ satisfy the following conditions-
a) q,~ is an !ntege r, q,~ >=2 for n = 1, 2 , . . . ,
b) q,~>=N for n =>no,
~%1
Let us consider the expansion of real numbers x (0 <= x < 1) into Cantor's
series with quotients q,, i.e. the expansion
CO
(80) x= Y _40_
~=1 qiq2. . . q~
where s~(x) is one of the numbers O, 1 , . . . , q,~--l.
Let F~(k; x) denote how many of the "digits" ~(x), ~.;(x),..., ~,~(x) are
equal to k. Then for almost every x in (0, 1) we have
(j; x)
(81) ,~+c~limF,~(k~))--l, for O<=j~k<~N--l:
(83) lim zF,x(k;x) l-l-1 1 for k ~ 0 , 1. . . . ' l and l-~- 1,2, .... N - - I .
j~O
This is an other way of expressing that the frequencies of all digits
0, 1 , . . . , N - - 1 are in the limit equal to another.
If q, ~ ~ , we have
(84) lim F~.u(k;x) _ 0 for k~0,1,
2"l/-.--~C0 m " " "
As (86) holds for all k = 0, 1 , . . . , N--1 and ~ _~,7~= F:~,(k ; x) the assertion
'0:=1
of Theorem 16 follows.
The results of this section (as well as those of section 3.2) show that
there exist sequences xl, x~, ..., x,~, consisting of the numbers 0, 1, 2 . . . . . in
w h i c h the relative frequency _F~(k)
ll
(where F~,,(k) denotes how many of the
332 n. RENYI
,i=o ,i = o
if2
~p
where P~ > 0 for k = 0 , 1, ... and ~ , r Such sequences may be
1;~0
considered as mathematical models of sequences of observations on a random
variable .~, defined on a conditional probability field, and having the conditional
distribution
P,,
# (h=0,1 .... ,k; k--1,2 . . . . ).
j~U
Bibliography
[16] P. E~D6S, On the integers having exactly k prime factors, Annals of Math., 49 (1948),
pp. 53--66.
[17] B. V. GnBD~nao, 13ber ein lokales Grenzwerttheorem iiir gleichverteilte unabh~ingige
Summanden, Wiss. Zeitschrift tier Humboldt-Univers#fft Berlin, 3 (1954"1, pp.
287 --293.
I18]. J. L. Dooa, Stochastic processes (New-York~ 1953).
[19] C. DERMAN, A solution to a set of fundamental equations in Markov chains, Proc. Amer.
Math. Soc., 5 (1954), pp. 332--334.
120] ERD6S P. and K. L. Cm~yo, Probability liniit theorems assuming 0nly the first molnent. 1,
Memoirs of the Amer. Math. Soc., 6 (1952).
[21] K. L. C,~u.~, Contributions to the theory of Markov chains; Trans. Amer. Mdth. Soc.,
76 (1954), pp. 397--419. See also K. L. Cau~G, An ergodic theorem for station-
ary Markov chains with a countable number of states, Proc. Internat. Congress
of Math. Cambridge(USA), 1959,(Amer. Math. Soc., 1952), I, p. 568.
122] W. FELLE~, On the integral equation of renewal theory, Ann. Math. Stat., 1~ (1941),
pp. 243--267. See also S.T.~eKLtND, Elementare Behandlnng vom Erneuerungs-
problem, Skandinavislr Alclttarietidskrift, (1944), pp. 1--15, and Fourier-analy-
tische Behandlung vom Erneuerungsproblem, Skandinavisk Aktuarietidskrift,
(1945), pp. 101--105; and J. L. DooB, Renewal theory from the point of view
of the theory of probability, Trans. Amer. Math. Soc., 63 (1948), pp. 422--438.
[23] L. TAKACS,Egy fij m6dszer rekurrens sztochasztikus folyamatok t~.rgyal~is~n~l, MTA Alk.
Mat. lnt. K6zlem~nyei, 2 (1953). pp. 135--163.
[24] O. PEr~R0r~, Irrationatzahlen (Berlin, 1921), pp. 111--116.
I25] E. BO~L, Les probabilit6s d6nombrables et leurs applications arithmetiques, Rend. Circ.
Math. Palermo, 27 (1909), pp. 247--27I.
[26] H. HAHN and A. ROSENTnAL, Set functions (New Mexico, 1948).
[27] M. LoirE, Probabilitv theory (New-York, 1955).
(Pez~oMe)
nagbmaTb CO6bITH~Mrl; nycTb o~ eCTb HeKOTOpOe He nycToe no~xlno~ecTBO OT (,~ (~-~ eCTD
MHO~eCTBO Bcex ~oIIyCTnMbIX ycnoBun), a P(A [B) qbyHu~nu MIJo~ecTB o r aByx nepeMen-
nmx, onpe~enenna~, ecru A E ~ H B E ~ ; qncao P(A IB) Mu 6y~eM naamBaTb yC.riOBUO~I
nepo~TnOCTb~O CO6b~Tn~ A OTHOCnTe~bH0 ycaoBn~ B. Hpe~noao~nM, qTO BbmO3H~OTC~
cae~ymu~ne 3 a~cnoM~,~:
A~cnoMa I. P ( A ] B ) ~ 0 , ecnaAE~ n BE~, n P(BIB)=I, ecnn B ~ .
AKCuOMa il. Ecaa B ~d~ qbnecnpoBau, P(A]B) eCTb ~epa Ha a-aare6pe ~ .
AecHoMa Ill. Ecnn A E ~ : ~ , B ~ , C E ~ n BC~, To
P(A [BC) P ( B I C)-- P(AB] C).
Ecan a~cnoMm I--lll ~monnennm, TO COBOKyl]HOCTb Mno~ecT~a S, a-aare6p~t C~
CnCTeMbl MHO)I~eCTB~ I~1 ~0yn~Inn MHO~eCTB P ( A ] B ) HasoBe,~ npocTpaHCTBOM yCflOBttO~
Bepo~TaOCTn n 0603naqn~ qepe3 [S, ~ , ~ , P].
OqeBn~no, UTO ecan B qbn~cnposanno, TO S , ~ n P ( A I B ) 06paaymw no Teopm~
A. H. I / o a ,~ 0 r o p o B a npocTPaHCT~O ~epo~TaOCTm HOOTOMy npocTpancT~o yC,~0SH0~
Bepo~tTaOCTn ~Bh~eTC~ Hn qeM 14HSIM~iza~ COBOI~yIIHOCTblOO6hI~noBet~HbIXHpocTpaHcTB Bepo~T-
nOCT~, Cs~3aHHb~X a~cnoMofi IlL
B ~aqecTne Henocpe~cTsennoro cne3cTBn~ a~cno~ nony~aeTc~ paneHCTSO P(A}B)=::
P(AB]B), a no3xo~y P ( A [ B ) ~ I. Pa~aeJI 1.4 zana~aeTc~ aansnefimu~u npocTm~n cnea-
cksnman m<cno~. B pa3~eae 1.5 nccne~lyeTc~ npn i~a~nx yCnOBn~X ycnoBna~ Bepo~TBOCTr,
sepOaTHOCTS P (A IB) MO~eT 6S~Tb npe~cTas.aena B sn~e P (A IB) - - Q3Qr B) , r~le Mepa Q~