Professional Documents
Culture Documents
Statistics For Life and Social Sciences: Chapter 5: Introduction To Inference
Statistics For Life and Social Sciences: Chapter 5: Introduction To Inference
February 1, 2021
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 1 / 30
Content
1 Introduction
3 Tests of significance
Stating hypotheses
Test statistics
P-values
Statistical significance
Tests for a population mean
Tests for a population proportion
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 2 / 30
Introduction
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 3 / 30
Introduction
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 4 / 30
Confidence intervals
Confidence interval
A level C (0.90, 0.95, 0.99) confidence interval for a parameter is an interval
computed from sample data by a method that has probability C of producing an
interval containing the true value of the parameter.
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 5 / 30
Confidence interval for a population mean
Here z ∗ is the value on the standard Normal curve with area C between the
critical points −z ∗ and z ∗ . The level C confidence interval for µ is
x±m (2)
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 6 / 30
Estimating with confidence
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 7 / 30
Estimating with confidence
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 8 / 30
Estimating with confidence
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 9 / 30
Estimating with confidence
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 10 / 30
Estimating with confidence
Fuel efficiency
Computers in some vehicles calculate various quantities related to performance.
One of these is the fuel efficiency, or gas mileage, usually expressed as miles per
gallon (mpg). For one vehicle equipped in this way, the mpg were recorded each
time the gas tank was filled, and the computer was then reset. Here are the mpg
values for a random sample of 20 of these records:
41.5 50.7 36.6 37.3 34.2 45.0 48.0 43.2 47.7 42.2
43.2 44.6 48.4 46.4 46.8 39.2 37.3 43.5 44.3 43.3
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 11 / 30
Estimating with confidence
Standard error
When the standard deviation of a statistic is estimated from the data, the result is
called the standard error of the statistic. The standard error of the sample mean
is
s
SEx = √ (3)
n
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 12 / 30
Estimating with confidence
The t distributions
Suppose that an SRS of size n is drawn from an N (µ, σ) population. Then the
one-sample t statistic
x−µ
t= √ (4)
s/ n
has the t distribution with n − 1 degrees of freedom.
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 13 / 30
Estimating with confidence
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 14 / 30
Confidence interval for a population mean
where t∗ is the value for the t(n − 1) density curve with area C between −t∗ and
t∗ . The quantity
s
t∗ √ (6)
n
is the margin of error. This interval is exact when the population distribution is
Normal and is approximately correct for large n in other cases.
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 15 / 30
Estimating with confidence
Apartment rents
You randomly choose 15 unfurnished one-bedroom apartments from a large
number of advertisements in your local newspaper. You calculate that their mean
monthly rent is $570 and their standard deviation is $105.
1 What is the standard error of the mean?
2 What are the degrees of freedom for a one-sample t statistic?
What critical value t∗ from Table D should be used to construct
1 a 95% confidence interval when n = 12?
2 a 99% confidence interval when n = 24?
3 a 90% confidence interval when n = 200?
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 16 / 30
Estimating with confidence
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 17 / 30
Choosing the sample size
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 18 / 30
Confidence interval for a population proportion
m = z ∗ SEp̂ (10)
where z ∗ is the value for the standard Normal density curve with area C between
−z ∗ and z ∗ .
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 19 / 30
Confidence interval for a population proportion
p̂ ± m (11)
Use this interval for 90%, 95%, or 99% confidence when the number of successes
and the number of failures are both at least 15.
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 20 / 30
Stating hypotheses
Null hypothesis
The statement being tested in a test of significance is called the null hypothesis.
The test of significance is designed to assess the strength of the evidence against
the null hypothesis. Usually the null hypothesis is a statement of “no effect” or
“no difference.”
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 21 / 30
Test statistics
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 22 / 30
P-values
P-value
The probability, assuming H0 is true, that the test statistic would take a value as
extreme or more extreme than that actually observed is called the P-value of the
test. The smaller the P-value, the stronger the evidence against H0 provided by
the data.
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 23 / 30
Statistical significance
Statistical significance
If the P-value is as small or smaller than α, we say that the data are statistically
significant at level α.
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 24 / 30
Tests for a population mean
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 25 / 30
Tests for a population mean
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 26 / 30
Tests for a population mean
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 27 / 30
Tests for a population mean
Doctor Visits
A report by the Gallup Poll stated that on average a woman visits her physician 5.8
times a year. A researcher randomly selects 20 women and obtained these data.
3 2 1 3 7 2 9 4 6 6
8 0 5 6 4 2 1 3 4 1
At α = 0.05 can it be concluded that the average is still 5.8 visits per year?
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 28 / 30
Tests for a population proportion
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 29 / 30
Tests for a population proportion
Thach Thanh Tien (Ton Duc Thang University) STATISTICS FOR LIFE AND SOCIAL SCIENCES February 1, 2021 30 / 30