Beyond Finite Sample Space II: Presidency University

Beyond finite sample space II
Presidency University
November, 2021
Example: Extension to uncountable sample space
I Let us consider the random experiment where we choose a real

number at random from the interval (0, 1].

I Here, every number in this interval is a possible outcome so

that the sample space is uncountable.


I If we follow the same idea of assigning numbers p(ω) to each

outcome, we run into difficulties.



I Fix any number ω in (0, 1].




I Fix any number ω in (0, 1].
I Since we are selecting a point at random the chances of it

falling in (0, 12 ] and the chances of it falling in ( 21 , 1] must be
the same and hence each must be equal to 12 .
I This means p(ω) ≤ 12 .
I Continuing this arguemnt we find that the probability of the

point falling in (0, 14 ] and ( 14 , 12 ] are same.



I If we continue this way, we shall find p(ω) = 0.


I If we continue this way, we shall find p(ω) = 0.
I Thus if you try to associate real numbers to outcomes you

observe that each outcome must have number zero.
I A similar problem occurs in the concept of length from which
we can take clue: points have length zero, but intervals have
positive length.
positive length.
I The idea is we should not try to associate real number to

different outcomes, rather we can try to associate at one
stroke a probability to each event.
positive length.

I But then what are events?

positive length.

I Imitation being the first choice (because we do not have any

guiding principle at this stage), let us say that every possible
subset of Ω = (0, 1] is an event.
positive length.

I Imitation being the first choice (because we do not have any

guiding principle at this stage), let us say that every possible
subset of Ω = (0, 1] is an event.
I This means that for every subset A of Ω, we want to associate

a probability P(A).
I Let us recall our demands for any number to be accepted as
probability.
I Let us recall our demands for any number to be accepted as
probability.
I For every subset A of Ω = (0, 1], we want to associate a

number P(A) which satisfies
I P(∅) = 0
I P(Ω) = 1 P
I For disjoint events A1 , A2 , ...., P(∪Ai ) = P(Ai ).
I These conditions were dictated by our intuitive notion of what
a probability ought to be.
I Now there is another condition that reflects the fact that we

want to model the specific experiment of picking a point at
random.

random.
I Picking a point at random means thatthe probability

i that the
selected point belongs to any interval 2n , 2n should be 21n .
k k+1

random.
I Picking a point at random means thatthe probability

i that the
selected point belongs to any interval 2n , 2n should be 21n .
k k+1
I If we can make an assignment of a value P(A) for each event

A so that these conditions are satisfied we can say that we
have succesfully created a probability model for this random
experiment.
I Unfortunately this is not possible due to the following result by
S.M.Ulam.
S.M.Ulam.
I There is no function P defined for all subsets A of the unit

interval (0, 1] such
i that it is non negative, countably additive,
and P 2n , 2n = 21n .
k k+1
S.M.Ulam.

and P 2n , 2n = 21n .
k k+1
I We may ask ourselves: Can we find a particular set A for

which we fail to associate probability?
S.M.Ulam.

and P 2n , 2n = 21n .
k k+1
I We may ask ourselves: Can we find a particular set A for

which we fail to associate probability?
I The answer is No. We cannot do that.

I The condition of countable additivity is a collective condition,
not an individual condition applying to each event A. This
causes problems.
I The condition of countable additivity is a collective condition,
not an individual condition applying to each event A. This
causes problems.
I For example, consider a simple analogy: Suppose we attempt

to share 11 items among 6 persons.Then certainly we cannot
give two items to each of 6 persons. But can we name one
single person to whom we fail to give? No, simply because any
person we name we can surely start giving two items to him.
I So where did we go wrong?
I Which of our demands is so tough that it can not be met?

I Or we may ask in which of the demands we can make a

compromise?
I Or we may ask in which of the demands we can make a

compromise?
I The fact is, while revisiting the demands, each of them seem
reasonable and neccessary (for our idea to be an extension of
classical model).
I The only demand on which we have to compromise is “every
subset of Ω is an event”. Is it really necessary?
I What exactly is the purpose of our building a probability

model?

model?
I We want to answer simple questions concerning the random

experiment. So it is enough if we can assign probabilities to all
the subsets that we are likely to come across in practice.

model?

I In other words, rather than defining probabilities for all subsets,

it is enough to define probabilities to events of “our interset”.

model?

I In other words, rather than defining probabilities for all subsets,

it is enough to define probabilities to events of “our interset”.
I Thus the compromise we make here is we abandon the

assumption that every subset be an event.
I Then the question is : which subsets should be events?
I Then the question is : which subsets should be events?
I For our example, clearly each interval should be an event

because we need to answer what is the probability that the
selected point belong to some interval.
I In general, we can have some demands for events:
I ∅ and Ω must be events, otherwise we cannot answer P(∅) = 0
or P(Ω) = 1.
or P(Ω) = 1.
I If A is an event then its complement Ac should also be an
event. If you can answer ‘what are the chances of something
happening’, then we should be able to answer ‘what are the
chances of that something not happening’.
or P(Ω) = 1.
I If A is an event then its complement Ac should also be an
event. If you can answer ‘what are the chances of something
happening’, then we should be able to answer ‘what are the
chances of that something not happening’.
I If we have countably many events An then their union ∪An

should also be an event. If some one asks you ‘what are the
chances of this happening’ ‘what are the chances of that
happening’ etc then we should also answer ‘what are the
chances of one of the things happening’).
I Let B be the smallest class of subsets of Ω = (0, 1] satisfying
these conditions.
I Let B be the smallest class of subsets of Ω = (0, 1] satisfying
these conditions.
I We agree to treat sets that belong to this class as events. Sets

which are not in this class are not events.
I Two points are to be noted at this stage:
I Now every subset of Ω is not an event as we have defined it

earlier. Events are only those subsets of Ω = (0, 1] which are
in B.
I Now every subset of Ω is not an event as we have defined it

earlier. Events are only those subsets of Ω = (0, 1] which are
in B.
I We have constructed the class B by imposing certain

restrictions not by picking subsets of Ω. Except ∅ and Ω, we
are not picking any other subsets explictly, rather imposing
some restrcitions like: “if some set A is in B, then some other
particular set B must also be in B”. This is justified because
recall that the class B was meant for our convenience,
meaning the choice is subjective: I may be interested in some
event and you might be interested in some completely different
event. So we both can choose our B as per our convenience.
I With this change of attitude, to build a probability model we
need to assign probability P(A) to all events A– that is, to sets
belonging to the class B satisfying the same demands as
earlier.
earlier.
I In fact this is possible, we can actually build a probability

model.
earlier.
I In fact this is possible, we can actually build a probability

model.
I There is a function P defined for all setsin B such

i that it is
non negative, countably additive, and P 2n , 2n = 21n .
k k+1
I This example tells us some new aspects of probability:
I It is not always possible to define probability for all subsets of
the sample space Ω. Of course in certain instances, this is
possible as we have seen in case of finite Ω.
I This example tells us some new aspects of probability:
I It is not always possible to define probability for all subsets of
the sample space Ω. Of course in certain instances, this is
possible as we have seen in case of finite Ω.
I Thus we need to modify our definition of events: we cannot
say that any subset of Ω is event.
I Infact we have to redefine a class of subsets of Ω, which we
can call events.
can call events.
I It is possible to construct such a class by imposing restrictions

(rather than explicitly picking up subsets from Ω), where the
choice of restrictions arise from usual necessity to answer
different questions in probability.
can call events.
I It is possible to construct such a class by imposing restrictions

(rather than explicitly picking up subsets from Ω), where the
choice of restrictions arise from usual necessity to answer
different questions in probability.
I Once we have such a class of events, we can define probability

at one stroke to all sets in that class.

Beyond Finite Sample Space II: Presidency University

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Beyond Finite Sample Space II: Presidency University

Uploaded by

Copyright:

Available Formats

Beyond finite sample space II

I Let us consider the random experiment where we choose a real

I Let us consider the random experiment where we choose a real

I Here, every number in this interval is a possible outcome so

I Let us consider the random experiment where we choose a real

I Here, every number in this interval is a possible outcome so

I If we follow the same idea of assigning numbers p(ω) to each

I Let us consider the random experiment where we choose a real

I Here, every number in this interval is a possible outcome so

I If we follow the same idea of assigning numbers p(ω) to each

I Fix any number ω in (0, 1].

I Let us consider the random experiment where we choose a real

I Here, every number in this interval is a possible outcome so

I If we follow the same idea of assigning numbers p(ω) to each

I Fix any number ω in (0, 1].

I Since we are selecting a point at random the chances of it

I Continuing this arguemnt we find that the probability of the

I Continuing this arguemnt we find that the probability of the

I This means p(ω) ≤ 14 .

I Continuing this arguemnt we find that the probability of the

I This means p(ω) ≤ 14 .

I If we continue this way, we shall find p(ω) = 0.

I Continuing this arguemnt we find that the probability of the

I This means p(ω) ≤ 14 .

I If we continue this way, we shall find p(ω) = 0.

I Thus if you try to associate real numbers to outcomes you

I The idea is we should not try to associate real number to

I The idea is we should not try to associate real number to

I But then what are events?

I The idea is we should not try to associate real number to

I But then what are events?

I Imitation being the first choice (because we do not have any

I The idea is we should not try to associate real number to

I But then what are events?

I Imitation being the first choice (because we do not have any

I This means that for every subset A of Ω, we want to associate

I For every subset A of Ω = (0, 1], we want to associate a

I Now there is another condition that reflects the fact that we

I Now there is another condition that reflects the fact that we

I Picking a point at random means thatthe probability

I Now there is another condition that reflects the fact that we

I Picking a point at random means thatthe probability

I If we can make an assignment of a value P(A) for each event

I There is no function P defined for all subsets A of the unit

I There is no function P defined for all subsets A of the unit

I We may ask ourselves: Can we find a particular set A for

I There is no function P defined for all subsets A of the unit

I We may ask ourselves: Can we find a particular set A for

I The answer is No. We cannot do that.

I For example, consider a simple analogy: Suppose we attempt

I Which of our demands is so tough that it can not be met?

I Which of our demands is so tough that it can not be met?

I Or we may ask in which of the demands we can make a

I Which of our demands is so tough that it can not be met?

I Or we may ask in which of the demands we can make a

I What exactly is the purpose of our building a probability

I What exactly is the purpose of our building a probability

I We want to answer simple questions concerning the random

I What exactly is the purpose of our building a probability

I We want to answer simple questions concerning the random

I In other words, rather than defining probabilities for all subsets,

I What exactly is the purpose of our building a probability

I Picking a point at random means thatthe probability

I Picking a point at random means thatthe probability

I There is a function P defined for all setsin B such