Professional Documents
Culture Documents
Contents
Student ‘s t Distribution
How to use SPSS
3 Basic concepts
Sample
A subset of a population (hopefully
representative)
Statistic
A characteristic of the sample
Often denoted with English letters (x-bar, s, p, r)
Examples:
The mean of the sample. (x-bar)
The standard deviation of the sample. (s)
Populations and Samples
Two steps
Descriptive Statistics
Describe the sample
Inference
Make inferences about the population
using what is observed in the sample
Primarily performed in two ways:
Estimation
Hypothesis testing
Basic concepts
8
Estimation
Point Interval
Estimation Estimation
Confidence
Interval
Estimation Process
Interval Estimation:
a technique provides a range of reasonable
values that are intended to contain the
parameter of the population, with a certain
degree of confidence. This range of values is
called a confidence interval.
Confidence Interval Estimate
Level of confidence
Confidence in which the interval will
contain the unknown population parameter
Precision (range)
Closeness to the unknown parameter
Cost
Cost required to obtain a sample of size n
Determining Sample Size (Cost)
Confidence
Intervals
Known Unknown
Two-sided Confidence Interval
X 2
X i1
i
X N( m , )
n n
the central limit theorem states that Z z x m ~ N (0,1)
has a standard normal distribution / n
Two-sided Confidence Interval
X-μ
P ( - 1.96 < < 1.96 ) = 0.95
σ /√ n
Interval Estimate of m
n
X z / 2 X i
X i 1
n n
( x 1.96( n ) ,x 1.96( n ))
Margin of Error
E z /2
n
29 20 26 21 25 24 16 27 20 33 28 16 16 19 27 24 28
24 26 24 29 20 23 27 25 16 25 22 30 24 25 27 23 30
24 19 23 20 30 24 28 30 17 21 22 28 27 32 22
P( X 1.645 ( ) m ) 0.95
n
X 1.645 ( )m
n
Example
t distribution
Standard normal (df=20)
distribution
Bell-Shaped
Symmetric t distribution
(df=10)
‘Fatter’ Tails
z, t
0
t Distribution
Standard normal
z values
Interval Estimation of a Population Mean:
σ Unknown: Two-sided
Interval Estimate
s n
X t / 2,n 1
n
Xi i
( X X )2
n X s
i 1 i 1
n n 1
Example:
Consider a random sample of 16 children selected
from the population of infants receiving antacids that
contain aluminum. These antacids are often used to
treat peptic or digestive disorders. The distribution of
plasma aluminum levels is known to be approximately
normal; however, its mean μ and standard deviation
are not known. The mean aluminum level for the
sample of 16 infants is 37.2 μg/l and the sample
standard deviation is s= 7.13 μg/l.
Interval Estimate of a Population
Mean: σ Unknown Two-sided
s
X t0.025,15
n
7.13
37.2 2.131 37.2 3.889
16
CI (33.311 , 41.089)
Interval Estimate
X i
n
(X i X )2
X i 1
s i1
n n 1
s
Upper:
X t .n1
n
1
s t
Lower:
X t .n1 tα, n-1
n
1
t
-tα, n-1
Summary of Interval Estimation
Procedures for a Population Mean
Yes No
be assumed
known ?
Unknown
Assumptions 70.161.96(9.7/(109)1/2
Population standard deviation is unknown
Population is normally distributed
If population is not normal, use large
sample
Use student’s t distribution
Confidence interval estimate
S S
X t / 2 , n 1 m X t / 2 , n 1
n n
How to use SPSS
Exercise
29 20 26 21 25 24 16 27 20 33 28 16 16 19 27 24 28
24 26 24 29 20 23 27 25 16 25 22 30 24 25 27 23 30
24 19 23 20 30 24 28 30 17 21 22 28 27 32 22