Professional Documents
Culture Documents
• A road map
• Examining data through frequency
distributions
• Measures of central tendency
• Measures of variability
• The normal curve
• Standard scores and the standard normal
distribution
The Role of Description
• Description as a purpose of research
• Choosing the right statistical procedures
Raw Data: Overachievement Study
Frequency Distributions
MAJOR
Valid Cum
Value Label Value Frequency Percent Percent Percent
PHYSICS 1.00 5 12.5 12.5 12.5
CHEMISTRY 2.00 4 10.0 10.0 22.5
BIOLOGY 3.00 7 17.5 17.5 40.0
ENGINEERING 4.00 5 12.5 12.5 52.5
ANTHROPOLOGY 5.00 5 12.5 12.5 65.0
SOCIOLOGY 6.00 4 10.0 10.0 75.0
ENGLISH 7.00 7 17.5 17.5 92.5
DESIGN 8.00 3 7.5 7.5 100.0
------- ------- -------
Total 40 100.0 100.0
Valid cases 40 Missing cases 0
Frequency Distribution: Major Group
MAJORGRP
Valid Cum
Value Label Value Frequency Percent Percent
SCIENCE & ENGINEERIN 1.00 21 52.5 52.5 52.5
SOCIAL SCIENCE 2.00 9 22.5 22.5 75.0
HUMANITIES 3.00 10 25.0 25.0 100.0
------- ------- -------
Total 40 100.0 100.0
Frequency Distribution: SAT
SAT
Valid Cum
Value Frequency Percent Percent
1000.00 2 5.0 5.0 5.0
1025.00 1 2.5 2.5 7.5
1050.00 2 5.0 5.0 12.5
1060.00 1 2.5 2.5 15.0
1075.00 1 2.5 2.5 17.5
1080.00 1 2.5 2.5 20.0
1085.00 1 2.5 2.5 22.5
1090.00 2 5.0 5.0 27.5
1100.00 7 17.5 17.5 45.0
1120.00 2 5.0 5.0 50.0
1125.00 3 7.5 7.5 57.5
1130.00 1 2.5 2.5 60.0
1150.00 5 12.5 12.5 72.5
1160.00 2 5.0 5.0 77.5
1175.00 3 7.5 7.5 85.0
1185.00 1 2.5 2.5 87.5
1200.00 5 12.5 12.5 100.0
------- ------- -------
Total 40 100.0 100.0
Valid cases 40 Missing cases 0
Grouped Frequency Distribution:
SAT
Graphical Frequency
Distributions
• Bar Graphs
• Histograms
• Stem and Leaf
• Frequency Polygons
• Pie Chart
Graphical
Graphical Frequency
Frequency
Distributions:
Distributions:
Single-Variable
Single-Variable (“Univariate”)
(“Univariate”)
Bar Chart: Major
Bar Chart
F 8
r 7
e
q 6
u
e 5
n
c 4
y
3
1
0
PHYSICS BIOLOGY ANTHROPOLOGY ENGLISH
CHEMISTRY ENGINEERING SOCIOLOGY DESIGN
MAJOR
Histogram: SAT
(From Grouped Data)
Frequency Polygon Overlay: SAT
(From Grouped Data)
Frequency Polygon: SAT
(From Grouped Data)
Frequency Polygon: SAT Scores
(From Ungrouped Data)
SAT
Cumulative Frequency Polygon: SAT
Scores
C 50
u
m
u 40
l
a
t 30
i
v
e
20
F
r
e 10
q
u
e 0
n 1000.00 1050.00 1075.00 1085.00 1100.00 1125.00 1150.00 1175.00 1200.00
c 1025.00 1060.00 1080.00 1090.00 1120.00 1130.00 1160.00 1185.00
y
SAT
Stem and Leaf: SAT
Stem and Leaf: SAT
SAT Stem-and-Leaf Plot
3.00 10 . 002
8.00 10 . 55678899
13.00 11 . 0000000222223
11.00 11 . 55555667778
5.00 12 . 00000
MAJORGRP
10 SCIENCE & ENGINEERIN
SOCIAL SCIENCE
0 HUMANITIES
2.00 2.50 2.80 3.00 3.20 3.40 3.60
2.30 2.70 2.90 3.10 3.30 3.50
GPA
Relative Frequency Polygon: GPA
Comparison of Gender
P 30
e
r
c
e
n
t
20
10
SEX
MALE
0 FEMALE
2.00 2.30 2.50 2.70 2.80 2.90 3.00 3.10 3.20 3.30 3.40 3.50 3.60
GPA
What Can Be Seen in Frequency
Distributions
• Shape
• Central Tendency
• Variability
Shapes of Frequency Polygons
Shapes of Distributions
Very Peaked in the Center Peak Just Like Flat in the Center
Compared to the Compared to
Normal Distribution Normal Distribution Normal Distribution
• Range:
– Difference between highest and lowest score
• Inter-quartile Range:
– The spread of the middle 50% of the scores
– The difference between the top 25% (Upper Quartile-Q3) and the lower
25% (Lower Quartile-Q1)
• Standard Deviation:
– The average dispersion or deviation of scores around the mean (measured
in original score units)
• Variance:
– The average variability of scores (measured in squared units of the
original scores (square of the standard deviation)
Range, Interquartile Range, and Standard
Deviation: SAT Scores by Area
• Bell-shaped (unimodal)
• Symmetric about the mean
• Mode, median, and mean are equal (though
rarely occurs)
• Asymptotic (curve never touches the
abscissa)
Normal Curve
Areas Under the Curve
.3413 .3413
.1359 .1359
68%
.0214 .0214
95%
.0013 .0013
99%
-3s -2s -1s +1s +2s +3s
X
Definitions: Standard Scores
• Standard Scores: scores expressed as SD
away from the mean (z-scores)
• Obtained by finding how far a score is above
or below the mean and dividing that
difference by the SD
• Changes mean to 0 and SD to 1, but does not
change the shape (called Standard Normal
Distribution)
Uses of Standard Normal
Distribution
• What proportion of scores falls between the mean
and a given raw score
• What proportion of scores falls above or below a
given raw score
• What proportion of scores falls between two raw
scores
• What raw score fall above (or below) a certain
percentage of scores