Professional Documents
Culture Documents
sample
vs
population
When analyzing data (or reading the literature), think about
whether you want to discuss the sample that you observed or want
to make statements that are more generally true.
Option 1: Discuss the 40 women. What was the mean T cell count?
What was its variation?
Sample
• Size = n
• Sample Mean 1 n
X Xj
n j 1
• Sample variance = 1 2
X j X
n
s2
n 1 j 1
Population
• Size = N (usually )
• Mean pjX j or
• Variance
p j X j
2 2
or
Summer Institutes 2020 Module 1, Session 5 4
Generalizing the sample to the population
Example:
X1, X2, X3 follow F-distributions.
X1+ X2+ X3 does not follow an F-distribution
(X1+ X2+ X3)/3 does not follow an F-distribution
If X1, X2, ..., Xn are independent and have the same distribution, and the
variance of that distribution is σ2, then if n is large,
Population of X’s
(mean = )
en
f s i ze n
sample of size n
en
si z
s i ze
iz
of
fs
le of
le
o
s amp l e o
mp
le
mp
s am p
sa
sa
X X X X X
.4
.3
.2
.1
-4 -2 0 2 4
X is not
normally
Fraction
distributed…
0
0 10 20 30
x
0
0 5 10 15
Means of size 5
Fraction
0
2 4 6
Means of size 10
8 …and the means of
X for n=10 and 30
.19
become closer and
closer to normally
distributed
Fraction
0
3 4 5 6 7
Means of size 30
Example:
X / n 2.8 / 49 0.4
X 5.4
X X 6 5.4
P X 6 P
X 0. 4
PZ 1.5 0.0668
= 0.00668
means that
• The true parameter θ has probability at least 0.95 of being between
L and U.
• In repeated samples, 95% of the resulting confidence intervals
would contain θ.”
Because
we know that
1
X QZ
2
, X QZ
2
n n
Summer Institutes 2020 Module 1, Session 5 20
Confidence Intervals Example
1.96
(277.35, 281.65)
e.g., https://stattrek.com/online-calculator/t-distribution.aspx