Professional Documents
Culture Documents
A Course Presentation in
STATISTICS
INTRODUCTION TO
STATISTICS AND
STATISTICAL
INFERENCE
Definition of Statistics
Session 1.3
TEACHING BASIC STATISTICS ….
Areas of Statistics
Session 1.4
TEACHING BASIC STATISTICS ….
Session 1.5
TEACHING BASIC STATISTICS ….
Based on the results, it was concluded that the new milk formulation is
effective in improving the psychomotor development of infants.
Session 1.6
TEACHING BASIC STATISTICS ….
Inferential Statistics
Larger Set
(N units/observations) Smaller Set
(n units/observations)
Inferences and
Generalizations
Session 1.7
TEACHING BASIC STATISTICS ….
Key Definitions
Session 1.8
TEACHING BASIC STATISTICS ….
Types of Variables
Session 1.9
TEACHING BASIC STATISTICS ….
Levels of Measurement
1. Nominal
Numbers or symbols used to classify units
into distinct categories
2. Ordinal scale
Accounts for order; no indication of distance
between positions
3. Interval scale
Equal intervals (fixed unit of measurement);
no absolute zero
4. Ratio scale
Has absolute zero
Session 1.10
TEACHING BASIC STATISTICS ….
Objective Method
Subjective Method
Session 1.11
TEACHING BASIC STATISTICS ….
Textual
Tabular
Graphical
Session 1.12
BASIC CONCEPTS IN
SAMPLING AND
SAMPLING TECHNIQUES
INFERENTIAL STATISTICS
Sample
Sampling Data
Process
Inferences/Generalization
(Subject to Uncertainty)
Universe
Session 3.14
TEACHING BASIC STATISTICS ….
Session 3.15
TEACHING BASIC STATISTICS ….
1. Probability sample
2. Non-probability sample
Session 3.16
TEACHING BASIC STATISTICS ….
Session 3.17
TEACHING BASIC STATISTICS ….
Session 3.18
TEACHING BASIC STATISTICS ….
Session 3.19
TEACHING BASIC STATISTICS ….
ILLUSTRATION
B B
C
D
Session 3.20
Determining Adequate
Sample Size
Sampling Formula
(Slovin’s)
N
n = -----------
1 + e 2N
Surgery 800 94
Medical 500 59
Obygyney 700 82
SYSTEMATIC SAMPLING
Adopts a skipping pattern in the selection
of sample units
Gives a better cross-section if the listing is
linear in trend but has high risk of bias if
there is periodicity in the listing of units in
the sampling frame
Allows the simultaneous listing and
selection of samples in one operation
Session 3.25
TEACHING BASIC STATISTICS ….
ILLUSTRATION
Systematic
Sample
CLUSTER SAMPLING
It considers a universe divided into N
mutually exclusive sub-groups called
clusters.
A random sample of n clusters is selected
and their elements are completely
Slide No. 3.19
enumerated.
It has simpler frame requirements.
It is administratively convenient to
implement. Slide No. 3.11
Session 3.27
TEACHING BASIC STATISTICS ….
ILLUSTRATION
Population
Cluster Sample
A PRIMARY SAMPLING
UNIT
Session 3.29
TEACHING BASIC STATISTICS ….
A SECONDARY
SAMPLING UNIT
SAMPLE
Session 3.30
Types Non - Probability Sampling
31
Purposive/judgemental sampling –sometimes called
deliberate sampling.The researcher relies on his
judgment as the criterion for the selection and does not
use the rules governing sampling techniques .
Example. To investigate the history of a certain place.
To investigate the effectiveness of a certain
shampoo.
32
Quota sampling – used for infinite population frames and
therefore, the researcher cannot get a random sample.
Like purposive sample, this is not a representative
sample.
Example: To investigate the DIP learning in SPUP. The
researcher will select say 50 participants in each
school/department as a quota.
33
TEACHING BASIC STATISTICS ….
Summary Measures
Maximum
Percentile
Quartile
Kurtosis
Decile Range
Minimum Coefficient of
Median
Variance Variation
Central Interquartile
Tendency Range
Standard Deviation
Mean Median Mode
Session 1.34
TEACHING BASIC STATISTICS ….
Session 1.35
TEACHING BASIC STATISTICS ….
Mean
X i
X1 X 2 X N
Population Mean: i 1
N N
n
x i
x1 x2 xn
Sample Mean: x i 1
n n
Session 1.36
Mean:
Fast-food meal A B C D E
85+44+27+24+1 181
Mean = ----------------------- = -------- = 3.62 (Strongly Agree)
50 50
Table of Interpretation
(5 pt. Likert Scale)
4.20 – 5.00 Very Strongly Agree
3.40 – 4.19 Strongly Agree
2.60 – 3.39 Agree
1.80 – 2.59 Disagree
1.00 – 1.79 Strongly Disagree
TEACHING BASIC STATISTICS ….
Session 1.46
TEACHING BASIC STATISTICS ….
0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 12 14
Mean = 5
Mean = 6
Session 1.47
TEACHING BASIC STATISTICS ….
Median
Session 1.48
Median for Odd Sample
Odd???
The array for the data A is :
What is
even?
The following are samples scores
obtained from a 75 item summative test:
(n= 12) 48, 53, 63, 65, 45, 47, 52, 48,
63, 54, 63, 53
Array : 45, 47, 48, 48, 52, 53, 54, 55, 63, 63, 63, 65
• Since n = 12 (even).
• Median = [ 6 th + 7th /2]
s s
Median = 5
Mode
occurs most frequently
nominal average
may or may not exist
0 1 2 3 4 5 6
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
No Mode
Mode = 9
Session 1.54
A set of data is said to be …
Unimodal or monomodal
if it has only one mode.
Example: 33, 35, 35, 38,
40, 46
Its mode is 35.
A set of data is said to be …
Properties of a Mode
Session 1.58
TEACHING BASIC STATISTICS ….
Properties of a Mode
Session 1.59
TEACHING BASIC STATISTICS ….
Session 1.60
TEACHING BASIC STATISTICS ….
Session 1.61
TEACHING BASIC STATISTICS ….
Session 1.62
TEACHING BASIC STATISTICS ….
Measures of Location
Session 1.63
TEACHING BASIC STATISTICS ….
Session 1.64
TEACHING BASIC STATISTICS ….
Percentiles
Session 1.65
TEACHING BASIC STATISTICS ….
EXAMPLE
Session 1.66
TEACHING BASIC STATISTICS ….
Deciles
Session 1.67
TEACHING BASIC STATISTICS ….
Quartiles
Session 1.68
TEACHING BASIC STATISTICS ….
Measures of Variation
A measure of variation is a
single value that is used to
describe the spread of the
distribution
A measure of central tendency
alone does not uniquely
describe a distribution
Session 1.69
TEACHING BASIC STATISTICS ….
A look at dispersion…
Data A
Mean = 15.5
11 12 13 14 15 16 17 18 19 20 21
s = 3.338
Data B
Mean = 15.5
11 12 13 14 15 16 17 18 19 20 21 s = .9258
Session 1.70
TEACHING BASIC STATISTICS ….
Session 1.71
TEACHING BASIC STATISTICS ….
Range (R)
The difference between the maximum and
minimum value in a data set, i.e.
R = MAX – MIN
Example: Pulse rates of 15 male residents of a
certain village
54 58 58 60 62 65 66 71
74 75 77 78 80 82 85
R = 85 - 54 = 31
Session 1.72
TEACHING BASIC STATISTICS ….
Session 1.73
TEACHING BASIC STATISTICS ….
54 58 58 60 62 65 66 71
74 75 77 78 80 82 85
IQR = 78 - 60 = 18
Session 1.74
TEACHING BASIC STATISTICS ….
Session 1.75
TEACHING BASIC STATISTICS ….
Variance
important measure of variation
shows variation about the mean
N
(X i )2
Population variance 2 i 1
N
s2 i 1
n 1
Session 1.76
TEACHING BASIC STATISTICS ….
(X i )2
Population SD i 1
N
(x x)i
2
Sample SD s i 1
n 1
Session 1.77
TEACHING BASIC STATISTICS ….
(Sample) Data: 10 12 14 15 17 18 24
(10 16) 2 (12 16) 2 (14 16) 2 (15 16) 2 (17 16) 2 (18 16) 2 ( 24 16) 2
s
7
4.309
Session 1.78
TEACHING BASIC STATISTICS ….
Session 1.79
TEACHING BASIC STATISTICS ….
Data A
Mean = 15.5
11 12 13 14 15 16 17 18 19 20 21 s = 3.338
Data B
Mean = 15.5
11 12 13 14 15 16 17 18 19 20 21 s = .9258
Data C
Mean = 15.5
11 12 13 14 15 16 17 18 19 20 21
s = 4.57
Session 1.80
TEACHING BASIC STATISTICS ….
Mean = 65
S =0
5”
65 “ 65 “ 65 “ 65 “ 65 “
Session 1.81
TEACHING BASIC STATISTICS ….
Mean = 65”
s = 4.0”
62 “ 67 “ 66 “ 70 “ 60 “
Session 1.82
TEACHING BASIC STATISTICS ….
Session 1.83
TEACHING BASIC STATISTICS ….
Chebyshev’s Rule
Chebyshev’s Rule
Session 1.85
TEACHING BASIC STATISTICS ….
Illustration
At least 75%
Session 1.86
TEACHING BASIC STATISTICS ….
Example
The midterm exam scores of 100 STAT 1 students
last semester had a mean of 65 and a standard
deviation of 8 points.
Applying the Chebyshev’s Rule, we can say that:
1. At least 75% of the students had scores
between 49 and 81.
2. At least 88.9% of the students had scores
between 41 and 89.
Session 1.87
TEACHING BASIC STATISTICS ….
Session 1.88
TEACHING BASIC STATISTICS ….
Comparing CVs
SD = P5
CV = 5%
Session 1.89
TEACHING BASIC STATISTICS ….
Measure of Skewness
3Mean Median
SK
SD
Session 1.90
TEACHING BASIC STATISTICS ….
What is Symmetry?
A distribution is said to be
symmetric about the mean,
if the distribution to the left
of mean is the “mirror
image” of the distribution to
the right of the mean.
Likewise, a symmetric
distribution has SK=0 since
its mean is equal to its
median and its mode.
Session 1.91
TEACHING BASIC STATISTICS ….
Measure of Skewness
SK > 0
positively
skewed
SK < 0
negatively skewed
Session 1.92
TEACHING BASIC STATISTICS ….
Measure of Kurtosis
Describes the extent of peakedness or
flatness of the distribution of the data.
Measured by coefficient of kurtosis (K)
computed as,
N
X
4
i
K i 1
4
3
N
Session 1.93
TEACHING BASIC STATISTICS ….
Measure of Kurtosis
K=0
mesokurtic
K>0 K<0
leptokurtic platykurtic
Session 1.94