Professional Documents
Culture Documents
Lecture 1: Course
Introduction, Review and
Paired-Samples t-test
Outline
• Course introduction
• Paired-samples t-test
Course Introduction
Business and Economics Statistics
1
9/3/2021
Course Introduction:
Objectives
This course aims to:
• Introduce a number of statistical techniques (Inference about
variances, Chi-squared tests, ANOVA, Non-parametric techniques,
etc.)
• Discuss the applications of these techniques in practical situations.
• Enable students to work on a small-scale case study to gain
valuable experience on data analysis.
• Provide practical sessions where students can practice with the R
statistical software
Course Introduction:
Assessment
• Attendance: 10%
• Case Study Analysis and Presentation: 20% (Wk 6 – Intro -
tentative)
• Midterm Test: 20% (Wk 8 – In lecture - tentative)
• Final Exam: 50%
Read your course outline carefully. It contains important
information regarding the course schedule, assessment, etc.
Course Introduction:
Study Materials
• Study materials:
• Lecture notes
• Required readings (in textbook and reference book; note that you
should print only the chapters specified in the required readings that
will be posted on Google class)
• Tutorial/lab exercises, etc.
2
9/3/2021
Course Introduction:
Recommended Study Strategies
• Discuss lecture notes with your peers. Review your lecture
notes weekly
• Attempt all the tutorial/lab exercises before class
• Participate actively in class: present solutions, ask questions,
contribute ideas, take notes, etc.
• Read the assigned readings in textbook and reference book
• Work actively on your case study and the labs
• Do further reading if time permits
7
Notation
Mean
Proportion ̅
Variance
Standard Deviation
Size
3
9/3/2021
10
Mean Median
1. Most common measure 1. Middle value in ordered
2. Affected by extreme values sequence
• If Odd n, middle value of sequence
(‘Outliers’)
• If Even n, average of 2 middle values
3. Formula for sample mean 2. Not affected by extreme values
∑
X= 3. Position of median (for sample)
4. Example Positioning point =
10.3 4.9 8.9 11.7 6.3 7.7 4. Example
X =? 10.3 4.9 8.9 11.7 6.3 7.7
Median = ?
11
Measures of Variation:
Variance & Standard Deviation
Characteristics Formula
1. Most common measures ∑
=
2. Involve all values in sample 1
(or population) n - 1 in denominator! (Use N if Population
3. Show variation about mean ( Variance)
or µ) =
4. Affected by outliers Note the notation and formula of population
5. Advantage of the SD over the variance and standard deviation.
variance? Example 5, 7, 10, 15, 18
Mean = ? Standard deviation = ?
12
4
9/3/2021
Statistical Inference
13
Interval Estimation
1. Provides range of values
2. Gives information about closeness to unknown population
parameter
• Stated in terms of probability
• Knowing exact closeness requires knowing unknown population parameter
14
Sample statistic
(point estimate)
Confidence interval
5
9/3/2021
Interval Estimation:
Confidence Level α
1. Probability that the interval contains unknown
population parameter
2. Denoted 1 - α
• α Is probability that interval does not contain parameter
16
Hypothesis Testing:
Definition & Procedure
A hypothesis test allows us to draw conclusions or make decisions regarding
population data from sample data. There are 6 steps:
1. Hypotheses (Ho, Ha) -> Right-tailed, Left-tailed, Two-tailed tests
2. Test statistic
3. Level of significance α
4. Decision rule
• Reject Ho when ...
5. Value of the test statistic
6. Conclusion
• Do not reject Ho: The test is not significant.
• Reject Ho: The test is significant.
17
Hypothesis Testing:
What Is a Hypothesis?
• Usually a statement about population parameters
18
6
9/3/2021
19
Hypothesis Testing:
Identifying Hypotheses Steps
1. Example Problem: Test that the population mean is not 3
2. Steps
• State the question statistically (µ ≠ 3)
• State the opposite statistically (µ = 3)
Must be mutually exclusive & exhaustive
Hypothesis Testing:
Definition & Procedure
A hypothesis test allows us to draw conclusions or make decisions regarding
population data from sample data. There are 6 steps:
1. Hypotheses (Ho, Ha) -> Right-tailed, Left-tailed, Two-tailed tests
2. Test statistic
3. Level of significance α
4. Decision rule
• Reject Ho when ...
5. Value of the test statistic
6. Conclusion
• Do not reject Ho: The test is not significant.
• Reject Ho: The test is significant.
21
7
9/3/2021
Hypothesis Testing:
p-Value
1. Probability of obtaining a Test Statistic more extreme (≥ or ≤
than actual sample statistic (computed from the sample data)
given H0 is true.
2. Used to make rejection decision
• If p-Value < α, Reject H0
• If p-Value ≥ α, Do not reject H0
22
Hypothesis Testing:
Possible Errors in Testing
• Probability of Type I error: alpha
• Significance level
• Probability of Type II error: beta
23
Hypothesis Testing:
Common Pitfalls
Choose or change hypotheses after looking at data
24
8
9/3/2021
Paired-Samples t Test
Paired Difference Experiments
1. Paired Samples
• Matched pairs
• Repeated measures (Before/After)
2. Eliminates Variation Among Subjects
26
9
9/3/2021
Problem:
Do older people exercise significantly less frequently
than younger people?
28
Paired-Samples t Test:
Hypotheses
29
Paired-Samples t Test:
Data Collection Table
Observation Group 1 Group 2 Differences
1 = -
2 = -
⋮ ⋮ ⋮ ⋮
i = −
⋮ ⋮ ⋮ ⋮
n = −
30
10
9/3/2021
Paired-Samples t Test:
Assumptions
Assumptions
• The population of differences is normal.
• The differences are randomly selected from the population
of differences.
31
Paired-Samples t Test:
Test Statistic
Test statistic:
!
= #$ = -1
"
Paired-Samples t Test:
Example
You work in Human Resources. You want to see if a training
program is effective. You collect the following test score data:
11
9/3/2021
Paired-Samples t Test:
Computation Table
Observation Before (1) After (2) Difference
Sam 85 94 -9
Tamika 94 87 7
Brian 78 79 -1
Mike 87 88 -1
Total -4
34
Paired-Samples t Test:
Null Hypothesis
1. Was the training effective?
2. Effective means ‘After’ > ‘Before’.
3. Statistically, this means µ2 > µ1.
4. Rearranging terms gives 0 > µ1 - µ2.
5. Defining µD = µ1 - µ2 & substituting into (4) gives 0 > µD or µD < 0.
6. The alternative hypothesis is Ha: µD < 0.
35
Paired-Samples t Test:
Solution
Test Statistic:
•Ho: µD = 0 (µD = µ1 - µ2)
•Ha: µD < 0 %& ' ( ' '!
= )& = +.-. = -.306
•α = .10 *& /
•df = 4 - 1 = 3
•Critical Value(s): Decision:
Do not reject Ho at α = .10
Reject
Conclusion:
.10
Test is not significant and there is not
enough evidence to conclude that the
-1.6377 0 t training was effective
Limitation: Sample size too small
12
9/3/2021
Conclusion
• Course Introduction
• Paired-samples t-test
37
13