Professional Documents
Culture Documents
LECTURE 2
DESCRIPTIVE STATISTICS II
OUTLINE
DESCRIPTIVE STATISTICS
Descriptive Statistics
Graphical Numerical
Methods Methods
2 categorical
variables
Crosstabulation
table Clustered Bar Stacked
Chart bar chart
CROSSTABULATION TABLE
Stocks 46 55 27 128
Bonds 32 44 19 95
Cash 15 20 33 68
ACTIVITY
2 numerical
variables
Scatter plot
SCATTER PLOT
EXAMPLE
Average SAT scores by
state: 1998
Verbal Math
Alabama 562 558
Alaska 521 520
Arizona 525 528
Arkansas 568 555
California 497 516
Colorado 537 542
Connecticu
t 510 509
Delaware 501 493
D.C. 488 476
Florida 500 501
Georgia 486 482
Hawaii 483 513
ANALYZING SCATTER PLOT
Look for:
Overall pattern
Form
Direction
Strength
possible clusters/groups
possible outliers
Q: What is an outlier?
DESCRIPTIVE STATISTICS
Descriptive Statistics
Graphical Numerical
Methods Methods
Relative standing
Central Tendency Variation
Percentile
Arithmetic Mean Range
Median Interquartile Range
Mode Variance
Standard Deviation
MEASURES OF CENTRAL
TENDENCY
ARITHMETIC MEAN
MEAN EXAMPLE
Raw Data:10.3 4.9 8.9 11.7 6.3 7.7
MEDIAN EXAMPLE
ODD-SIZED SAMPLE
Raw Data: 24.1 22.6 21.5 23.7 22.6
Ordered: 21.5 22.6 22.6 23.7 24.1
Position: 1 2 3 4 5
MEDIAN EXAMPLE
EVEN-SIZED SAMPLE
MODE EXAMPLE
No Mode
Raw Data: 10.3 4.9 8.9 11.7 6.3 7.7
One Mode
Raw Data: 6.3 4.9 8.9 6.3 4.9 4.9
More Than 1 Mode
Raw Data: 21 28 28 41 43 43
RANGE
7 8 9 10 7 8 9 10
VARIANCE &
STANDARD DEVIATION
X = 8.3
4 6 8 10 12
Variance
STANDARD DEVIATION
EXAMPLE
6 8 10 12 14 9 11 7 13 11
Calculate the sample variance and standard
deviation
VARIANCE AND SD
CHEBYSHEV’S RULE
EXAMPLE
EXAMPLE
EMPIRICAL RULE
EXAMPLE
Consider a very large number of students
taking a college entrance exam such as the
SAT.
Suppose that the distribution of SAT score is
bell-shaped, the mean score on the mathematics
section of the SAT is 550 with a standard
deviation of 50.
Measures of relative standing
PERCENTILE
QUARTILES
BOX PLOT
• How to construct
• How to represent outliers
• Use a boxplot to assess and compare the
shape, central tendency, and variability of
distributions and to look for potential
outliers.
• Sample size: n at least 20
Source: https://www.slideshare.net/mido02/chap-3gbu
CONCLUSION