Professional Documents
Culture Documents
Point Estimation
Questions
What is a sampling distribution?
What is the standard error?
What is the principle of maximum
likelihood?
What is bias (in the statistical sense)?
What is a confidence interval?
What is the central limit theorem?
Why is the number 1.96 a big deal?
Population
Population & Sample Space
Population vs. sample
Population parameter, sample statistic
Key things to keep in mind
Population- what we want to talk about
Sample- what we have with our data
Sampling distribution- the means by
which we will go from our sample to
the population
Parameter Estimation
We use statistics to estimate parameters,
e.g.,
effectiveness of pilot training,
effectiveness of psychotherapy.
X SD
A Sampling Distribution
From Vogt:
47%
43% Your Sample Mean
51%
A Sampling Distribution
Lets create a sampling distribution of means
Take a sample of size 1,500 from the US. Record the mean
income. Our census said the mean is $30K.
$30K
A Sampling Distribution
Lets create a sampling distribution of means
Take another sample of size 1,500 from the US. Record the mean
income. Our census said the mean is $30K.
$30K
A Sampling Distribution
Lets create a sampling distribution of means
Take another sample of size 1,500 from the US. Record the mean
income. Our census said the mean is $30K.
$30K
A Sampling Distribution
Lets create a sampling distribution of means
Take another sample of size 1,500 from the US. Record the mean
income. Our census said the mean is $30K.
$30K
A Sampling Distribution
Lets create a sampling distribution of means
Take another sample of size 1,500 from the US. Record the mean
income. Our census said the mean is $30K.
$30K
A Sampling Distribution
Lets create a sampling distribution of means
Take another sample of size 1,500 from the US. Record the mean
income. Our census said the mean is $30K.
$30K
A Sampling Distribution
Lets create a sampling distribution of means
Lets repeat sampling of sizes 1,500 from the US. Record the mean
incomes. Our census said the mean is $30K.
$30K
A Sampling Distribution
Lets create a sampling distribution of means
Lets repeat sampling of sizes 1,500 from the US. Record the mean
incomes. Our census said the mean is $30K.
$30K
A Sampling Distribution
Lets create a sampling distribution of means
Lets repeat sampling of sizes 1,500 from the US. Record the mean
incomes. Our census said the mean is $30K.
$30K
A Sampling Distribution
Lets create a sampling distribution of means
Lets repeat sampling of sizes 1,500 from the US. Record the mean
incomes. Our census said the mean is $30K.
$30K
A Sampling Distribution
Say that the standard deviation of this distribution is $10K.
Think back to the empirical rule. What are the odds you would get
a sample mean that is more than $20K off.
$30K
-3z -2z -1z 0z 1z 2z 3z
A Sampling Distribution
Say that the standard deviation of this distribution is $10K.
Think back to the empirical rule. What are the odds you would get
a sample mean that is more than $20K off.
$30K
-3z -2z -1z 0z 1z 2z 3z
Suppose
Population has 6 elements: 1, 2, 3, 4, 5,
6 (like numbers on dice)
We want to find the sampling
distribution of the mean for N=2
If we sample with replacement, what
can happen?
1st 2nd M 1st 2nd M 1st 2nd M
1 1 3 1 5 1
1 2 3
1 2 3 2 5 2
1.5 2.5 3.5
1 3 3 3 5 3
2 3 4
1 4 3 4 5 4
2.5 3.5 4.5
1 5 3 5 5 5
3 4 5
1 6 3 6 5 6
3.5 4.5 5.5
2 1 1.5 4 1 2.5 6 1 3.5
2 2 4 2 6 2
2 3 4
2 3 4 3 6 3
2.5 3.5 4.5
2 4 4 4 6 4
3 4 5
2 5 4 5 6 5
3.5 4.5 5.5
2 6 4 6 6 6
4 5 6
Possible Outcomes
Possible Outcomes
Histogram
Sampling
distribution for
mean of 2 dice.
1+2+3+4+5+6 = 21.
21/6 = 3.5
There is only 1
way to get a
mean of 1, but 6
ways to get a
mean of 3.5.
Sampling Distribution (2)
The sampling distribution shows the relation
between the probability of a statistic and the
statistics value for all possible samples of
size N drawn from a population.
Hypothetical Distribution of Sample Means
f(M)
Mean Value
Sampling Distribution Mean
and SD
The Mean of the sampling distribution is
defined the same way as any other
distribution (expected value).
The SD of the sampling distribution is the
Standard Error. Important and useful.
Variance of sampling distribution is the
expected value of the squared difference a
mean square.
Review
G2 E (G G ) 2
Review
Theta (p value)
Maximum Likelihood (4)
In example, best (max like) estimate
would be 9/15 = .60.
There is a general class called
maximum likelihood estimators that
find values of theta that maximizes the
likelihood of a sample result.
ML is one principle of goodness of an
estimator
More Goodness (1)
Bias.
If E(statistic)=parameter, the estimator
is unbiased.
If its unbiased, the mean of the
sampling distribution equals the
parameter.
The sample mean has this property:
. E( X )
Sample variance is biased.
More Goodness (2)
Efficiency size of the sampling variance.
Relative Efficiency. Relative efficiency is the
ratio of two sampling variances.
2 H
efficiency of G relative to H
2
G
Standard Error of the Mean: M
N
Law of large numbers:
Large samples produce sample estimates
very close to the parameter.
Unbiased Estimate of
Variance
It can be shown that: E (S 2
) 2 2 N 1 2
N N
N 1 N 1