You are on page 1of 12

AP STATISTICS

CLUSTERS, GAPS , PEAKS, & 11b A.Munkhjargal

OUTLIERS
CLUSTER, GAPS, PEAKS, &
OUTLIERS
To understand or to use clusters, gaps, peaks, and outliers, we must first know what
dot plot is. As stated by Wikipedia, Dot plot is a representation of a distribution
consists of group of data plotted on a simple scale.
DOT PLOT
DOT PLOT USES
Dot plot’s ease of use makes it efficient for small or medium amounts of data.
It is primarily used for certain surveys including small groups of people.
It is also used for comparing small amounts of data.
HISTORY OF DOT PLOT
The dot plot was first used to depict distributions in 1884(pre-computer era).
However there are 2 types of dot plots. These include:
1) William S. Cleveland’s dot plot
2) The dot plot that dates before the computer era.
These 2 dot plots are very alike, but if you see closely, you will see their differences.
DOT PLOT COMPARISON
The main difference between Cleveland’s dot plot and the dot plot is that while
Cleveland’s dot plot is a alternative or a substitute for pie charts and other types of
charts, the dot plot is (as a representation of distribution) a graph showing group of
data plotted on a simple scale. The dot plot highlights the clusters, gaps, outliers, and
peaks. To show you the difference here are 2 good examples.
COMPARISON
Cleveland’s dot plot The dot plot
WILLIAM S. CLEVELAND II
He is a American computer scientist and Professor of Statistics and Professor of
Computer Science at Purdue University. He was born in 1943. His research interests
are in the fields of "data visualization, computer networking, machine learning, data
mining, time series, statistical modeling, visual perception, environmental science,
and seasonal adjustment.
He created the dot plot that replaces bar charts, pie plots and other plots. His dot plot
is used by the U.S. Federal Reserve to convey its benchmark Federal Fund interest
rate outlook at certain Federal Open Market Committee (FOMC) meetings.
SHORT DEFINITIONS
Cluster- forms when there are
many data points in a small
interval.
Gaps- large open space between
some data points.
Peaks- highest value in the
distribution
Outliers- minor data points that
As you can see, the 2nd from the left cluster is a peak and
from the values 6 to 10 are the gaps. exists far from the majority of
the points.
APPLICATIONS IN EVERYDAY
LIFE
•Due to its ease of use, it is primarily used for analyzing small amounts of data
and/or displaying certain data to the crowd.
•The gaps and clusters can also represent errors in the data collection
methodology.
•It is also used by many scientists to either display information or to organize
and analyze data.
APPLICATIONS IN EVERYDAY
LIFE,
REAL EXAMPLES
For example:
A telephone survey polls only certain area codes, such as low-income housing
complexes or high-end suburban residential areas, and not a broad cross-section of
the population, so the chances are there will be gaps and clusters in the data. Outliers
can skew the mean or average value of a data set.
HOW TO USE DOT PLOT
(CLUSTER, GAPS, PEAKS, &
OUTLIERS)
Using cluster, gaps, peaks, and outliers is easy. If you put it in order, it would be:
1) Analyzing and grouping the data
2) Drawing a single x-plane graph (just a line)
3) Evaluating the values and putting them in order. (stack the same values up)
4) Identifying the outliers and peaks.

You might also like