You are on page 1of 84

Lesson 2.1 Lesson 2.

3
Review on Graphs
Descriptive and
Statistics Charts

Lesson 2.4
Lesson 2.2
Review on
Inferential
Descriptive

Statistics in Excel
Statistics
STATISTICS
Mathematics in the Modern World: Data Management: MTH 101: Unit II
Review on Descriptive Statistics
STATISTICS

What is the importance of statistics in people’s

life?❑to be able to effectively conduct research

❑to be able to read and evaluate journal articles ❑to


further develop critical thinking and analytic skills
❑to be an informed consumer
❑to know when you need to hire outside statistical help
Review on Descriptive Statistics
MTH 101: Unit II

Mathematics in the Modern World: Data Management:

STATISTICS
Statistics is basically a science that involves data collection, data
interpretation and finally, data validation.
Statistical Data Analysis is a procedure of performing various
statistical operations. It is a kind of quantitative research, which
seeks to quantify the data, and typically, applies some form of
statistical analysis.
Statistical Data Analysis generally involves some form of
statistical tools, which a layman cannot perform without having
any statistical knowledge.
Review on Descriptive Statistics
MTH 101: Unit II
Mathematics in the Modern World: Data Management:

Branches of Statistics

• focuses on collecting, organizing, summarizing,


and presenting the data.
• allow us to make conclusions
• help us compress data and about the population
summarize them to give us
more useful information
• help interprets and draws
conclusions from the data
Measures of Variation
ReviewMeasures of Central

population
Measures of Position or
Tendency Location

parameter
lin
g
tec
hni explorato
que ry data
s analysis
sample
statistics

inference
sa
mp probability
John Alexis B. Gemino MTH101 Mathematics in the Modern World

Measures of
Location

Measures of
Variation

DESCRIPTIVE
STATISTICS
Introduction

Measures of
Central Tendency
based on collecting, organizing and
reporting data without using the data
to draw any wide-ranging conclusions

a statistical technique that organized data can be presented


summarizes and describes important through bar graphs, pie charts, time
characteristics of a data set
Source: Slidesgo

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics


series graphs, etc. MTH 101: Unit II

Review
Performance in a Test
How will you describe the test
scores of students in a particular
class?
Measures of Central
Tendency
scatterplot of test scores
Measures of Variability Measures of Location
MEASURES OF CENTRAL TENDENCY
Introduction
Measures of
Variation
Measures of
Central Tendency MEAN
Measures of
Location
midpoint of the

MEDIAN MODE
ҧ �ො distribution/data
set

��෤ The number that


occurs most
frequently in a
represents data set.

� average of the
data set

�� represents
Review on Descriptive Statistics
MTH 101: Unit II

Mathematics in the Modern World: Data Management:

MEAN

Introduction

Measures of
Central Tendency

Measures of
Location

Measures of
Variation
obtained by summing the data
values and by dividing the
number of data values

given observed data points


��1, ��2, … , ����, the mean is given
by
��
1

��෍
��=1
represents “center of gravity” of
the data set Mathematics in the Modern World: Data Management: Review on
Descriptive Statistics
���� =��1 + ��2 + ⋯ + ���� ��

MTH 101: Unit II

POPULATION MEAN VS SAMPLE MEAN

POPULATION SAMPLE

Introduction
statistical study
Measures of
Measures of
Location
a part or subset of the
Central Tendency
collection of all elements population from which the
under consideration in a information is collected
POPULATION MEAN SAMPLE MEAN
Measures of Variation
��
�� ��ҧ=1��෍
���� �� = , �� = sample size 1��෍ ����, �� = population size
��=1
��=1

MTH 101: Unit II


Mathematics in the Modern World: Data Management:
Review on Descriptive Statistics
in the
Introduction

Measures of
Central Tendency

Measures of
Location

Measures of computation of mean,


Variation
values in the data set may have
WEIGHTED MEAN different degrees of importance,
quantified by weights
multiplied by their corresponding
obtained by summing the data weights and by dividing the sum
values which are already by the sum of the weights
Central Tendency

Measures of
Mathematics in the Modern World: Data Management: Location

WEIGHTED MEAN
Review on Descriptive Statistics
MTH 101: Unit II

Given observed data points


��1, ��2, … , ����, with
Introduction the corresponding weights
Measures of ��1, ��2, … , ���� the
mean is given by �� ��
����= 1��1 +

��
��2��2 + ⋯ +
��������
��������
σ��=1
Measures of Variation

σ��=1 ��1 + ��2 + ⋯ + ����


Review on Descriptive Statistics
MTH 101: Unit II

Mathematics in the Modern World: Data Management:


Average (GWA) given the grades on the
following subjects with varying number of
units:
Subject No. of Units Grade
Marleyan History 2 1.00

Introduction
Military Science 5 2.50
Marksmanship 3 1.50
Measures of
Central Tendency 3
��������
Measures of
Location GWA =σ��=1
WEIGHTED MEAN 3
����=��1��1 + ��2��2 + ��3��3
��1 + ��2 + ��3=2 1.00 + 5 2.50 +
Calculate Reiner Braun’s General Weighted 3(1.50)
Measures of Variation
σ��=1 4.50
2+5+3 =
10 19.00 = ��. ����
10
GWA =2.00 + 12.50 +
Review on Descriptive Statistics
Mathematics in the Modern World: Data Management: MTH 101: Unit II
Introduction

Measures of
Central Tendency

Measures of
Location

Measures of
Variation
Review on Descriptive Statistics
MTH 101: Unit II

Mathematics in the Modern World: Data Management:


Introduction

Measures of
Central Tendency
Measures of
Location 50% of the population/sample is
greater than or equal to this
Measures of
Variation
value; 50% of the
population/sample is less than or
equal to this value

a robust statistic

represents “midpoint” of the


distribution/data set

Mathematics in the Modern World: Data Management:


Review on Descriptive Statistics
MTH 101: Unit II
MEDIAN

Given observed and ordered (lowest


Introduction to highest) data points ��(1), ��(2),
Measures of … , ��(��), the median is:
Central Tendency
if �� is odd:
if �� is even:
Measures of Location
middle point mean of two middle
numbers
Measures of
Variation �� ��+1 2 �� �� 2 + �� ��
2+1
2

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II

Properties of the Median


Introduction

Measures of
Central Tendency

Measures of
Location
Measures of
Variation

Mathematics in the Modern World: Data Management:


Review on Descriptive Statistics
MTH 101: Unit II
Introduction

Measures of
Central Tendency
Measures of
Location Data may have:

Measures of
Variation
• 1 mode (unimodal)
• 2 modes (bimodal)
• > 2 modes
(multimodal) • no
mode at all

The number that occurs Mathematics in the Modern World: Data Management:
most frequently in a data Review on Descriptive Statistics
MTH 101: Unit II
set.
Introduction

Measures of
Central Tendency

Measures of
Location

Measures of
Variation
Review on Descriptive Statistics
MTH 101: Unit II

Mathematics in the Modern World: Data Management:


watching television in a week. Their
5, 7, 3, 38,
responses are:
and 7 hours.
Introduction

Measures of a. Find the mean, median and


Central Tendency
mode.
Measures of
Location
b. If another person were to be
Measures of
asked the same question and
Variation he/she responded 200 hours, how
Five persons were asked on the would this affect the mean,
usual number of hours they spent median and mode?
Mathematics in the Modern World: Data Management:
Review on Descriptive Statistics
MTH 101: Unit II
Introduction
Measures of
Central Tendency

Measures of
Location

Measures of
Variation

Mathematics in the Modern World: Data Management:


Review on Descriptive Statistics
When do we typically use
them? MEAN MEDIAN MODE

Introduction
Scale of Values Skewness
Measures of Measurement
Central Tendency Does it exist as Affected by Outliers
Distribution or a data point?
Measures of Skewness of Data Continuous May not
Location
Interval/Ratio
Measures of
Presence of
Variation Outliers or ExtremeAffected by
Type of Data/
Ordinal* Categorical a data point
Interval/Ratio Nominal*
All others
Not affected by
Not affected by
Skewness
Skewness
Not affected by
Not affected by
Outliers
Outliers
May not
Exists as
exist
exist
Review on Descriptive Statistics
MTH 101: Unit II

Mathematics in the Modern World: Data Management:


LOCATION

Introduction

Measures of
Central Tendency

Measures of
Location
����
Measures of
Variation

MEASURES OF
PERCENTILE DECILE

����
���� QUARTILE

Review on Descriptive Statistics


MTH 101: Unit II

Mathematics in the Modern World: Data Management:


MEASURES OF
LOCATION

Introduction

Measures of
Central Tendency Fractiles (or quantiles) extend
the idea of median by dividing the
Measures of
Location
ordered data set/distribution into
more than two equal parts.
Measures of
Variation
Review on Descriptive Statistics
MTH 101: Unit II

Mathematics in the Modern World: Data Management:

Introduction

Measures of
Central Tendency

Measures of
Location
the idea of median by dividing the
Measures of
Variation ordered data set/distribution into
more than two equal parts.
MEASURES OF
LOCATION Quartiles (����) divide the data
set into four equal parts.

Fractiles (or quantiles) extend


Review on Descriptive Statistics
MTH 101: Unit II

Mathematics in the Modern World: Data Management:

Measures of
Location

Measures of
Variation

MEASURES OF
LOCATION
Introduction

Measures of
Central Tendency
set into four equal parts.

Deciles (����) divide the data set


into ten equal parts
Fractiles (or quantiles) extend Percentiles (����) divide the data
the idea of median by dividing the set into 100 equal parts.
ordered data set/distribution into
more than two equal parts.

Quartiles (����) divide the data


Review on Descriptive Statistics
MTH 101: Unit II
Mathematics in the Modern World: Data Management:
Let ��(1), ��(2), … , ��(��) be the
observed and ordered (lowest to highest)
data points and �� be the fractile/quantile
Introduction location.
Measures of
Central Tendency MEDIAN QUARTILE
MEASURES OF LOCATION ��෤ = �� ��+1 , �� = 1,2,3
Measures of Location
���� = �� �� ��+1 4
2

�� ��+1 , �� = 1,2,3, … , 99
Measures of Variation

DECILE PERCENTILE ���� = ��


, �� = 1,2,3, … , 9 10 ��+1 100
���� = �� ��
Review on Descriptive Statistics
MTH 101: Unit II

Mathematics in the Modern World: Data Management:

MEASURES OF LOCATION

MEDIAN QUARTILE ��෤ = �� ��+1 , �� = 1,2,3


Introduction Measures of
���� = �� �� ��+1 4
2

Central Tendency Measures of


DECILE PERCENTILE ���� = ��
�� ��+1 , �� = 1,2,3, … , 99
Location
, �� = 1,2,3, … , 9 10 ���� = �� �� ��+1
Measures of 100

Variation ��70, ��8 = ��80, ��9 = ��90


Note: ��෤ = ��2 = ��5 = ��50, Mathematics in the Modern World: Data Management:
Review on Descriptive Statistics
��1 = ��25, ��3 = ��75 ��1 = MTH 101: Unit II
��10, ��2 = ��20, ��3 = ��30,
��4 = ��40, ��6 = ��60, ��7 =

MEASURES OF LOCATION

Using the raw data below, compute for ��3, ��5, and ��25 and
��෤.
44 524
Introduction
45
���� = �� 0.75×48 = ��
52
Measures of 36 = ����
Central Tendency
Arrange the data in order
Measures of from lowest to highest.
Location

Measures of
���� = �� �� ��+1
Variation 4
33 47�� �� 3 47+1
3=
36 48

38 49
41 50
42 51
43 51
Measures of
Variation

Mathematics in the Modern World: Data Management:


Review on Descriptive Statistics
MEASURES OF
MTH 101: Unit II
LOCATION

What if the value inside the


parenthesis is NOT a whole number?
Suppose we have another data
Introduction point, say 103.
33 47 52 62 73 83
Measures of
Central Tendency 36 48 54 63 75 84

Measures of
38 49 56 63 78 86
Location
41 50 58 66 79 4
42 51 58 67 80
��3 = �� 3 48+1
4
43 51 59 68 81
��3 = �� 0.75×49 = �� ����.����
44 52 60 69 81

45 52 60 72 82

Mathematics in the Modern World: Data Management:


Compute for ��3, ��5, and ��25 Review on Descriptive Statistics
MTH 101: Unit II
and ��෤.
���� = �� �� ��+1
In the example, ��3 = �� 36.75 .
To estimate the 36.75th number, we

perform linear interpolation. To do this,


Introduction

Measures of
take note that �� 36.75 is between �� 36
Central Tendency

and �� 37 .
Measures of
Location

Measures of
Variation

MEASURES OF LOCATION
(2) Assign weight of (1-0.75) to the first number.

�� 36.75 = �� 36 1 − 0.75 + �� 37 (0.75)


�� 36.75 = �� 36 0.25 + �� 37 (0.75)
�� 36.75 = 79 0.25 + 80 0.75 = 79.75

So, ��3 = �� 36.75 = ����. ����.

(1) Assign weight of 0.75 to the second number.

Mathematics in the Modern World: Data Management:


Review on Descriptive Statistics
MTH 101: Unit II
Introduction Measures of consistent with the formula for the
median when �� is even, even
when we use the formula for when
MEASURES OF �� is odd.
LOCATION Using the same example,

This method of linear interpolation is


Central Tendency → ��෤ = �� 48+1 = ��(0.5×49) = �� 24.5 .
��෤ = �� ��+1 2 2
Measures of Location 24 and �� 25 .

Measures of
(1) Assign weight of 0.5 to the second number.
Take note that �� 24.5 is between �� (2) Assign weight of (1-0.5) to the
Variation
first number. �� 24.5 = �� 24 1 − 0.5 + �� 25 (0.5)
�� 24.5 = �� 24 0.5 + �� 25 0.5 =�� 24 + �� 25

2=60 + 62
2= ���� = ��෤
Central Tendency
Mathematics in the Modern World: Data Management:
Review on Descriptive Statistics
MTH 101: Unit II
MEASURES OF
VARIATION OR
DISPERSION

Introduction
��(��) − ��(1) ��3
Measures of
− ��1

INTERQUARTILE

RANGE RANGE

����
��
2 ��ҧ
Measures of Location

2
��
Measures of

�� ��
��
Variation
STANDARD
DEVIATION VARIANCE
COEFFICIENT OF VARIATION
Review on Descriptive Statistics
MTH 101: Unit II

Mathematics in the Modern World: Data Management:

The rates of returns for two stocks, namely, Stock


A and Stock B from 2005 to 2014
Introduction

Measures of
Central
Tendency

Measures of
Location
Measures of
Variation
Review on Descriptive Statistics
MTH 101: Unit II

Mathematics in the Modern World: Data Management:


Introduction

Measures of
Central Tendency

Measures of
Location

Measures of
Variation
Review on Descriptive Statistics
MTH 101: Unit II

Mathematics in the Modern World: Data Management:


Introduction

Measures of
Central Tendency

Measures of
Location
Dispersion refers to the spread
Measures of or variability of data
Variation

MEASURES OF VARIATION Do the observations tend to be


quite similar (homogeneous)?
OR DISPERSION
considerably (heterogeneous)?
Do the observations tend to vary
Review on Descriptive Statistics
MTH 101: Unit II

Mathematics in the Modern World: Data Management:


Introduction

Measures of
Central Tendency
Measures of
Location Given observed and
ordered (lowest to highest)
Measures of
Variation data points ��(1), ��(2), … ,
��(��), the range is:

Range = �� �� − �� 1

The difference between the


greatest data point and least
data point.
Review on Descriptive Statistics
MTH 101: Unit II

Mathematics in the Modern World: Data Management:


(IQR)

Introduction

Measures of
Central Tendency
The

Measures of
Location

Measures of
Variation
difference between the 3rd
INTERQUARTILE RANGE quartile and 1st quartile.
middle 50% of the data set.
IQR = ��3 − ��1
Higher IQR → Larger Dispersion
IQR represents the spread of the
Review on Descriptive Statistics
MTH 101: Unit II

Mathematics in the Modern World: Data Management:

Measures of
Location

Measures of
Variation

VARIANCE AND
STANDARD DEVIATION
Introduction

Measures of
Central Tendency
are clustered around
the mean.

Higher Value → Larger Dispersion

They Standard deviation has the


both same unit of measurement as the
measure data set while the variance does
how not (squared units).
closely
the They are the most commonly
values used measures of variation.

Mathematics in the Modern World: Data Management:


Review on Descriptive Statistics
MTH 101: Unit II
Introduction
POPULATION VARIANCE
VARIANCE SAMPLE VARIANCE
Measures of ��
Central Tendency
1 2෍
�� = ��
�� ��2 =1
���� − �� 2
���� − ��ҧ2
Measures of Location
��=1
�� − 1෍ ��=1
2 2
= �� 1 − �� + ⋯ + �� �� = �� 1 − ��ҧ +⋯+
Measures of Variation
− ��2 �� ���� − ��ҧ2 �� − 1
POPULATION

Mathematics in the Modern World: Data Management:


Review on Descriptive Statistics
MTH 101: Unit II

SAMPLE

STANDARD
DEVIATION
Introduction
STANDARD
Measures of DEVIATION
Central Tendency ��
DEVIATION �� =
1
�� =1��෍
STANDARD
��
Measures of Location ��=1 ���� − ��ҧ2
���� − ��2 �� − 1෍ ��=1
2 2
= �� 1 − �� + ⋯ + �� �� = �� 1 − ��ҧ +⋯+
Measures of Variation
− ��2 �� ���� − ��ҧ2 �� − 1
Review on Descriptive Statistics
MTH 101: Unit II

Mathematics in the Modern World: Data Management:


Introduction

Measures of
Central Tendency

Measures of
Location

Measures of
Variation
Review on Descriptive Statistics
MTH 101: Unit II

Mathematics in the Modern World: Data Management:


Introduction

Measures of
Central Tendency

Measures of
Location

Measures of
Variation

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
VARIATION (CV)

Introduction
A unitless measure
Measures of that can be
Central Tendency
used to compare the
Measures of
dispersion of
Location two or more data sets with
Measures of
different units of measurement.
Variation

COEFFICIENT OF This expresses


standard deviation
as a percentage of the mean or �� × 100%
���� = ��ҧ
��
���� = ��× 100% or
Review on Descriptive Statistics
MTH 101: Unit II
Mathematics in the Modern World: Data Management:
Introduction

Measures of
Central Tendency

Measures of
Location

Measures of
Variation
Review on Descriptive Statistics
MTH 101: Unit II

Mathematics in the Modern World: Data Management:

PROBLEM
Example
The numbers of incorrect answers on a
true-false competency test for a
random sample of 15 students were
recorded as follows: 2, 1, 3, 0, 1, 3, 6,
0, 3, 3, 5, 2, 1, 4, and 2.
Introduction Measures of Compute the following statistics:

Central Tendency
c. Variance IQR = X12 – X4 = 3 – 1 = 2 S2 =
Measures of
Location d. Standard Deviation e. 2.9714
Measures of
Coefficient of Variation. S = 1.7238
Variation

a. Range CV = 71.82%
R=6–0=6
b. IQR
Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II

Thank
you!

You might also like