Professional Documents
Culture Documents
Introduction
to STATISTICS
STATISTICS
It is a branch of mathematics
that deals with the collection,
organization, presentation,
analysis and interpretation of
data.
• Collection refers to the process of gathering
data or information.
Collection
• Organization and Presentation refers to
summarizing data into tables, graphs, or
A pound of pears costs $1.30. Tony
charts.
bought a pound and a half of pears. If
he gave the clerk a $5.00 bill, how
• Analysis
much changerefers to describing the data using
will he get back?
sample data.
Application of Statistics
Sample
A subcollection of elements drawn from a population.
Census
A collection of data from every element in a population.
Classifications of Data
Data is the information we gather about the sample or the
population.
Qualitative Quantitative
Data Data
refers to the attributes or refers to the numerical information
characteristics of the gathered about the samples. It can
be subjected to the arithmetic
samples. operations.
2 Types Quantitative of Data
• Range
• Mean • Mean Deviation
• Median • Variance
Standard
• Mode •
Deviation
Choosing Appropriate Statistical
Measures
• For qualitative data, the mode is the only one
of the three that makes sense.
• For quantitative data, mean or median is
often preferred over the mode as a measure
of center because the value that occurs most
frequently may not be located near the center
of the data set.
Measures of Central Tendency
Mean
It is the most reliable and is commonly used
when the data is symmetrical, meaning, there is no
extreme high or low in the numbers, or when the
data do not contain outliers. Outliers are values that
don’t fit the rest of the data or values that are far
away from other data values. Therefore, use the
mean when the range is not great.
Measures of Central Tendency
Median
It is commonly used when the data is
asymmetrical, meaning, there is an extreme
high or low in numbers, or when the data
contains outliers. It is not affected by the values
of data but by the number of data. This works for
ordinal, interval, and ratio data. Therefore, use
the median when the range is great.
Measures of Central Tendency
Mode
It is commonly used when the data are not
numbers. It is referred to as the most frequent value
in a given set of data. This is the only appropriate
measure for nominal data, but still, it can also be
used for ordinal, interval and ratio data. Mode,
however, is unstable. Therefore, use the mode when
you want the most frequent item or number in a list
of data.
Measures of Variability