Professional Documents
Culture Documents
Spring 2019
Section W1
Lecture 9: Statistical
Descriptions
Data Visualization
Summary
2
Measuring the Dispersion of Data
Boxplot: ends of the box are the quartiles; median is marked; add whiskers, and plot
outliers individually
Outlier: usually, a value higher/lower than 1.5 x IQR from Q3/Q1
Boxplot
Data is represented with a box
The ends of the box are at the first and third
quartiles, i.e., the height of the box is IQR
The median is marked by a line within the box
Whiskers: two lines outside the box extended to
Minimum and Maximum
Outliers: points beyond a specified outlier
threshold, plotted individually
5
Boxplot in Matlab
>> d = [30, 36, 47, 50, 52, 52, 56, 60, 63,
70, 70, 110];
>> boxplot(d);
Properties of Normal Distribution Curve
9
Graphic Displays of Basic Statistical
Descriptions
10
Histogram Analysis
bars 35
It shows what proportion of 30
cases fall into each of several 25
categories 20
The categories are usually
15
specified as non-overlapping
10
intervals of some variable. The
5
categories (bars) must be
adjacent 0
10000 30000 50000 70000 90000
11
Histograms Often Tell More than Boxplots
12