Professional Documents
Culture Documents
1
Outline of the Training
Basic Concepts in Statistics
Introduction to the Windows of SPSS
Data Entry, Loading and Saving
Data Management
Descriptive Data Analysis
Correlation Analysis
Regression Analysis
◦ Linear regression
◦ Binary logistic regression
◦ Multinomial logistic regression
◦ Ordered logistic regression
Hypothesis Testing
Factor Analysis
Trainer: Hailegebriel Yirdaw 2
Trainer: Hailegebriel Yirdaw 3
Definition and Concept of Statistics
Definition
Statistics is the science of data analysis.
It deals with scientific methods of
◦ collecting,
◦ organizing,
◦ Presenting,
◦ analyzing data
so as to make valid conclusions and
reasonable decisions
Trainer: Hailegebriel
8 Yirdaw 8
Descriptive and Inferential Statistics
Descriptive statistics -statistics used to
summarize or describe a set of observations.
Inferential statistics consists of generalizing
from samples to populations, performing
hypothesis testing, determining
relationships among variables, and making
predictions.
Scale Categorical
Continuous
Discrete: Ordinal: Nominal:
Measurements
Counts/ integers obvious order no meaningful order
takes any value
Q2: Gender:
Male Female
Community groups:These
organizations need information about a
wide variety of subjects
Individuals: Everyone, from students to
pensioners, needs some form of information
at some time during their lives. The
information may be used to complete an
essay, a major project or simply to satisfy
one's curiosity.
Systematic sampling
Cluster sampling
Activity 4. (5 minutes)
Which method(s) of data collection is/are
appropriate to use in your project work? Why?
Statistical Estimation
Point estimate -- the single value of a statistic
calculated from a sample
Interval Estimate -- a range of values calculated
from a sample statistic(s) and standardized statistics,
such as the Z.
◦ Selection of the standardized statistic is determined
by the sampling distribution.
◦ Selection of critical values of the standardized
statistic is determined by the desired level of
confidence.
Point estimate X=
X
n
Interval σ
X ±Z
Estimate n
or
σ σ
X −Z ≤µ ≤ X +Z
n n
α α
2 2
1−α
µ X
Z
− Zα 0 Zα
2 2
α α
2 2
1−α 1−α
2 2
µ X
Z
− Zα 0 Zα
2 2
σ σ
Pr ob[ X − Z α ≤µ≤ X +Zα ] = 1−α
2 n 2 n
0.025 0.025
95%
0.4750 0.4750
µ X
Z
-1.96 0 1.96
σ σ
X −Z ≤µ≤ X +Z
n n
11. 11
.
4.26 − 196
. ≤ µ ≤ 4.26 + 196
.
60 60
4.26 − 0.28 ≤ µ ≤ 4.26 + 0.28
3.98 ≤ µ ≤ 4.54
Trainer: Hailegebriel Yirdaw 54
Confidence Interval to Estimate µ
when n is Large and σ is Unknown
S
X ± Zα2
n
or
S S
X − Zα ≤ µ ≤ X + Zα
2
n n 2
90% 1.645
95% 1.96
98% 2.33
99% 2.575
Z formula X −µ
Z=
σ
n
E = X −µ
Error of Estimation
(tolerable error)
ασ
2
ασ
2 2
Z Z
Estimated Sample Size n = 2 2 = 2
E E
σ
2
is the variance of an attribute in the population.
E is the desired level of precision
Trainer: Hailegebriel Yirdaw 76
Use of software in sample size determination
Depending on type of study and specific software
Some information will be required:
Population sample size, population standard
deviation, population sampling error, confidence
level, z –value, power of study etc …
80% power in a clinical trial means that the
study has a 80% chance of ending up with a p
value of less than 5% in a statistical test (i.e. a
statistically significant treatment effect) if there
really was an important difference (e.g. 10% versus
5% mortality) between treatments.
Ni
ni = n i =1, 2, 3.
N
The End