Professional Documents
Culture Documents
Sampling
Distributions
Chapter 7
Stat 101 2nd Semester AY 2020-2021
Concept of a
Random Sample
It is denoted by (𝑋!, 𝑋", … , 𝑋# ).
Random Sample
• Using a particular probability sampling method
and collecting 𝑛 observations, we have the
random sample 𝑿𝟏 , 𝑿𝟐 , … , 𝑿𝒏 , where
infinite
finite population versus
population
Random Sample of size 𝑛
from a Finite Population
• Suppose we select 𝑛 distinct elements from a population
consisting of 𝑁 elements, using a particular probability
sampling method. Let
X1 = measure taken from the 1st element in the sample
X2 = measure taken from the 2nd element in the sample
⋮
Xn = measure taken from the nth element in the sample
i ndependent &
i dentically
d istributed
EXAMPLE
• We have mentioned that a sample selected using simple random
sampling WITH replacement (SRSWR) qualifies as a random
sample from an infinite population.
• In the first example, we can see that even though our physical
population is finite, we can still have a viewpoint of sampling from
an infinite population. SRSWR seems to create that illusion!
• In the second example, we can still see that even though our
physical population is finite, we can have the viewpoint of
sampling from an infinite population as long as they satisfy the IID
requirement.
Remarks
• In summary, random sampling from a finite or an
infinite population DOES NOT depend on the physical
population from where we take our sample, nor in the
possible values of 𝑋$ (just like how we classify discrete
vs continuous).
This is still a r.s. from an infinite pop’n This is a r.s. from an infinite pop’n
EVEN IF each 𝑿𝒊 is discrete. This is NOT because each 𝑿𝒊 is normally
because each 𝑿𝒊 satisfies the IID distributed and can be any value from
requirement. − ∞ to ∞, BUT BECAUSE each 𝑿𝒊
satisfies the IID requirement.
Basic Concepts
Statistic, Sampling Distribution, Standard Error
Statistic
• Suppose (𝑋6 , 𝑋7 , … , 𝑋8 ) is a random sample.
% %
1 1
𝑋9 = ; 𝑋$ and 𝑆" = ; 𝑋$ − 𝑋9 "
𝑛 𝑛−1
$&! $&!
There are two candidates vying for the mayor position, say Jojo and
Junjun. What we do not know is that Voters A, B, C, and D have
already decided to elect Jojo, while voters E and F will elect Junjun.
B! CB" CB# EFG HI#J G$KK LFMN OFP QFRF $# MEN SIHTKN
&
𝑋= =
D MFMIK #UHVNP FO LFMNPS $# MEN SIHTKN
As we know, this proportion can be viewed as a probability of Jojo winning the election.
That’s why we are interested with the sampling distribution of 𝑋$ and that is how we
come up with inferences about the results of the election.
EXAMPLE (cont.)
> , where we let
Construct the sampling distribution of 𝑿
1 4 2 12 3 4
𝑃 𝑋9 = = , 𝑃 𝑋9 = = , 𝑃 𝑋9 = =
3 20 3 20 3 20
Simplifying, we have
x9 1/3 2/3 1
p(9x)=P(X9 =9x) 1 3 1
5 5 5
Remarks
• If we have a random sample from a finite population, then the
statistic of interest will have a CDF or a PMF.
• This is also the reason why we always ask the question “What is
> ?”
the distribution of 𝑿
Standard Error
• The standard deviation of a statistic is called its
standard error.
1
unknown mean is 2.4. Since it is unknown, we may want to
2 3
guess its value by taking a sample, say of size 𝑛 = 5, then use
the sample mean 𝑋$ to come up with a guess. Take note that
2.4 5 we look at the 4 possible samples given below, and for each
$
one, we compute the value of 𝑋.
To illustrate,
x9 1/3 2/3 1
p(9x)=P(X9 =9x) 1 3 1
5 5 5
EXAMPLE (cont.)
x9 1/3 2/3 1
Sampling Distribution of 𝑋9
p(9x)=P(X9 =9x) 1 3 1
5 5 5
"
c. The standard error of 𝑋W is 𝑠. 𝑒. 𝑋W = W =
𝑉𝑎𝑟(𝑋) = 0.2108
)(
EXERCISE # 1
Suppose the sampling distribution of 𝑋! is as follows:
!
a. What is the mean of 𝑋?
!
b. What is the standard error of 𝑋?
Aside, 𝐸 𝑋W " = 5 " 0.1 + 5.5 " 0.2 + 6 " 0.25 + 6.5 " 0.3
" "
+ 7 0.1 + 7.5 0.05 = 37.9375
J$
( = 𝜇 and
mean 𝐸(𝑋) variance 𝑉𝑎𝑟 𝑋( =
8
approximate distribution of XW
population from
where the random
sample comes from
rule of thumb
ILLUSTRATION
Even if the population from where the sample came from looks like one of these…
given a sufficiently large sample size (n > 30, as our rule of thumb only)
Remarks
• From the result of the Central Limit Theorem (CLT) that
!"
𝑋! ≈ 𝑁𝑜𝑟𝑚𝑎𝑙 𝜇, as long as 𝑛 is sufficiently large, we
"
can standardize the random variable 𝑋! by subtracting its
!" !
mean 𝜇 and dividing by its standard deviation = .
" "
Thus, we know that
𝑋# − 𝜇
𝑍=𝜎 ≈ 𝑁(0,1)
' 𝑛
Remarks
• The normal approximation will hold even if the
distribution of the population from where the sample
came from is either discrete or continuous.
𝑋W − 𝜇 998 − 1000
𝑃 𝑋W > 998 = 1 – 𝑃 𝑋W ≤ 998 ≈ 1 – 𝑃 ≤
𝜎 "h 6.25
𝑛
= 1 – 𝑃(𝑍 ≤ −0.8)
= 1 – 0.2119 = 0.7881
EXAMPLE (explained)
A random sample of size 100 is taken from a large population with
> > 𝟗𝟗𝟖).
mean 𝝁 = 1,000 and variance 𝝈𝟐 = 625. Approximate 𝑷(𝑿
𝑋W − 𝜇 998 − 1000
𝑃 𝑋W > 998 = 1 – 𝑃 𝑋W ≤ 998 ≈ 1 – 𝑃 ≤
𝜎 "h 6.25
Here, we just standardize 𝑋/ because we 𝑛
know what it’s approximately normal. To
standardize is to subtract the mean and then This is the CDF of 𝑍
divide by the standard deviation. The mean of = 1 – 𝑃(𝑍 ≤ −0.8) evaluated at -0.8.
!"
𝑋/ is 𝜇 and its std. dev. is . Hence, to
"
! $
standardize is to let 𝑍 =
"#
. = 1 – 0.2119 = 0.7881
$%%
&
This is obtained from the 𝑧-table. Look at
Row -0.8 and Column 0.00.
EXAMPLE
Suppose the average time it takes a large group of students to
complete a certain exam is 46.2 minutes with a standard deviation of
15 minutes. Find the probability that a class of 50 students taking that
exam has a mean time of completing it in less than 40 minutes.
Let 𝑋. denote the length of time of the 𝑖 %& student to complete a certain exam.
Each 𝑋. is taken from a large group of students with mean time of completing
the exam 𝜇 = 46.2 and standard deviation 𝜎 = 15.
Also, 𝑋W is the sample mean time of 50 randomly selected students completing
the exam. By the Central Limit Theorem, 𝑋W will be approximately normally
! #
W = 𝜇 = 46.2 and 𝑉𝑎𝑟(𝑋)
distributed with 𝐸(𝑋) W = + = !( .
# (-
EXAMPLE
Suppose the average time it takes a large group of students to
complete a certain exam is 46.2 minutes with a standard deviation of
15 minutes. Find the probability that a class of 50 students taking that
exam has a mean time of completing it in less than 40 minutes.
𝑋W − 𝜇 40 − 46.2
𝑃 𝑋W < 40 ≈ 𝑃 <
𝜎 "h 15"h
𝑛 50
= 0.0018
EXERCISE # 2
An electrical firm manufactures electric light bulbs
that have a length of life with mean and standard
deviation equal to 500 and 50 hours, respectively.
Let 𝑋. denote the length of life of the 𝑖 %& light bulb. Each 𝑋. is taken from a
large manufacturing of bulbs with mean lifetime 𝜇 = 500 and standard
deviation 𝜎 = 50.
Also, 𝑋W is the sample mean length of life of 35 randomly selected light bulbs.
By the Central Limit Theorem, 𝑋W will be approximately normally distributed
+! (-#
W W
with 𝐸(𝑋) = 𝜇 = 500 and 𝑉𝑎𝑟(𝑋) = = .
# '(
EXERCISE # 2
An electrical firm manufactures electric light bulbs that have a length
of life with mean and standard deviation equal to 500 and 50 hours,
respectively. Find the probability that a random sample of 35 bulbs will
have an average life greater than 475 hours.
𝑋W − 𝜇 475 − 500
𝑃 𝑋W > 475 ≈ 𝑃 ≤
𝜎 "h 50"h
𝑛 35
= 1 – 0.0015 = 0.9985
Remarks
• Again, the theorem does NOT require that the
random sample comes from a normal distribution.
• In the previous examples, we don’t have any
distribution at all to start with (we just know the
mean and variance of a large or infinite population),
yet we can still have an approximate distribution for
" just invoke the Central Limit Theorem! We can do
𝑋;
this since the sample sizes we have in those
examples are large enough based on our rule of
thumb.
t-distribution &
𝟐
𝝌 -distribution
In addition to the normal distribution, these are two special
distributions that are also commonly used in Inferential
Statistics.
t-distribution
X ~ t(v)
bell-
shaped
as the
symmetric larger
parameter: degrees of
about 0
variance degrees of
freedom
freedom
tails increases
approach
the x-axis
how it becomes
similarity with the difference with the almost similar to
standard normal standard normal the standard
normal
VISUALIZATION
Reading the 𝒕-table
• The 𝑡 -table gives the values of 𝑡Q,R , or the
100 1 − 𝛼 ST percentiles of 𝑋~𝑡(𝑣), for values
of 𝛼 = 0.10, 0.05, 0.025,0.01, and 0.005 and 𝑣
degrees of freedom from 1 to 30. That is why it
is labeled as “100(1-𝛼)th Percentiles of the t-
distribution”
Reading the 𝒕-table
• 𝑡Q,R is just a number in the x-axis such that its
AREA TO THE RIGHT is 𝛼.
EXAMPLE
We see from the table that 𝑡U.UW,X = 2.132.
To see this, just get the intersection of Row 4 and Column 0.05.
a. 𝑡j.j"k,!j
b. 𝑡j.!j,!"
c. 𝑡j.jk,!k
EXERCISE # 3
Suppose 𝑋 ~ 𝑡(𝑣). Find the value of the following:
a. 𝑡j.j"k,!j = 2.228
b. 𝑡j.!j,!" = 1.356
c. 𝑡j.jk,!k = 1.753
𝟐
𝝌 -distribution
!
X ~ 𝜒 (v)
almost
parameter: positive for symmetric
degrees of skewed to
freedom ℝ2 only the right as df
increases
VISUALIZATION
Reading the 𝟐
𝝌 -table
• The 𝜒 7 -table gives the values of 𝜒Q,R7
, or the
100 1 − 𝛼 ST percentiles of 𝑋~𝜒 7 (𝑣) , for
values of 𝛼
= 0.10, 0.05, 0.025,0.01,0.005, 0.90, 0.95,
0.975, 0.99, and 0.995 and 𝑣 degrees of
freedom from 1 to 30. That is why it is labeled
as “100(1-𝛼 )th Percentiles of the Chi-Square
distribution”
Reading the 𝟐
𝝌 -table
7
• 𝜒Q,R is just a number in the x-axis such that its
AREA TO THE RIGHT is 𝛼.
EXAMPLE
7
We see from the table that 𝜒U.UW,X = 9.488.
To see this, just get the intersection of Row 4 and Column 0.05.
"
Thus, 𝜒j.!,k = 9.236.
EXAMPLE
7
Find the value of 𝜒U.ZZ,Y .
"
Thus, 𝜒j.ll,D = 0.115.
EXERCISE # 4
Suppose 𝑋 ~ 𝜒 "(𝑣). Find the value of the following:
"
a. 𝜒j.j"k,!j
"
b. 𝜒j.!j,!"
"
c. 𝜒j.jk,!k
EXERCISE # 4
Suppose 𝑋 ~ 𝜒 "(𝑣). Find the value of the following:
"
a. 𝜒j.j"k,!j = 20.483
"
b. 𝜒j.!j,!" = 18.549
"
c. 𝜒j.jk,!k = 24.996
Sampling from
Normal
Distribution
Now, we assume that the distribution of the population
from where we obtain the random sample is normally
distributed with mean 𝜇 and variance 𝜎 ".
Sampling from the Normal
Distribution
• Suppose (𝑋6 , 𝑋7 , … , 𝑋8 ) is a random sample
such that 𝑋[ ~𝑁𝑜𝑟𝑚𝑎𝑙(𝜇, 𝜎 7 ) for 𝑖 = 1,2, … , 𝑛.
J#
(
Then, 𝑋~𝑁𝑜𝑟𝑚𝑎𝑙 𝜇, .
8
VISUALIZATION
𝑋!~𝑁𝑜𝑟𝑚𝑎𝑙(𝜇, 𝜎 ")
+
𝑋"~𝑁𝑜𝑟𝑚𝑎𝑙(𝜇, 𝜎 ")
+
𝑋D~𝑁𝑜𝑟𝑚𝑎𝑙(𝜇, 𝜎 ")
+
⋮
+
𝑋# ~𝑁𝑜𝑟𝑚𝑎𝑙(𝜇, 𝜎 ")
_________________________________________________
𝑋! + 𝑋"+. . . +𝑋# 𝜎"
?
= 𝑋~𝑁𝑜𝑟𝑚𝑎𝑙 𝜇,
𝑛 𝑛
Remarks
• It’s tempting to say that this is just the result of the
Central Limit Theorem. However, there are differences!!
p" Dj
?
Thus, 𝑋~𝑁𝑜𝑟𝑚𝑎𝑙 𝜇= 72, # = ?
= 3 , or 𝑋~𝑁(72,3).
!j
EXAMPLE
IQ is normally distributed with mean 100 and standard
deviation 20. What is the probability of selecting a random
sample of size 100 people with mean IQ larger than 105?
𝑋? − 𝜇 105 − 100
𝑃(𝑋? > 105) = 𝑃 >
𝜎 "F 4
𝑛
= 𝑃 𝑍 > 2.50
= 1 − 𝑃 𝑍 ≤ 2.50
= 1 − 0.9938
= 0.0062
EXERCISE # 5
An electrical firm manufactures electric light bulbs that have a length of
life that is normally distributed with mean and standard deviation equal
to 500 and 50 hours, respectively. Find the probability that a random
sample of 16 bulbs will have an average life greater than 475 hours.
Let 𝑋, denote the length of life of the 𝑖 -. light bulb. We know that
𝑋! ~𝑁 𝜇 = 500, 𝜎 " = 50" , 𝑋" ~𝑁 𝜇 = 500, 𝜎 " = 50" , …,
𝑋!/ ~𝑁 𝜇 = 500, 𝜎 " = 50" .
NOTE: The example is quite similar to that of Exercise # 2, but now, we know that each 𝑋$ is
normally distributed and the sample size is smaller (𝑛 = 16). Although we cannot invoke the
Central Limit Theorem because of the small sample size, we still have a distribution for 𝑋/
' %
$
because of our result here in this section – that 𝑋~𝑁𝑜𝑟𝑚𝑎𝑙 𝜇, ( given (𝑋% , 𝑋& , … , 𝑋" ) a
&
random sample from 𝑁𝑜𝑟𝑚𝑎𝑙(𝜇, 𝜎 ) distribution.
EXERCISE # 5
An electrical firm manufactures electric light bulbs that have a length of
life that is normally distributed with mean and standard deviation equal
to 500 and 50 hours, respectively. Find the probability that a random
sample of 16 bulbs will have an average life greater than 475 hours.
𝑋9 − 𝜇 475 − 500
𝑃(𝑋9 > 475) = 𝑃 >
𝜎 "^ 50"^
𝑛 16
= 𝑃 𝑍 > −2
= 1 − 𝑃 𝑍 ≤ −2
= 1 − 0.0228
= 0.9772
Sampling Distribution of
Some Common Statistics
Sampling Distributions of Statistics
based on a random sample from a Normal Distribution