You are on page 1of 50

Lesson 2.1 Lesson 2.

3
Review on Graphs
Descriptive and
Statistics Charts

Lesson 2.4
Lesson 2.2
Review on
Descriptive
Inferential
Statistics in Excel
Statistics
STATISTICS

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
STATISTICS
What is the importance of statistics in people’s life?

to be able to effectively conduct research


to be able to read and evaluate journal articles
to further develop critical thinking and analytic skills
to be an informed consumer
to know when you need to hire outside statistical help

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
STATISTICS

Statistics is basically a science that involves data collection, data


interpretation and finally, data validation.
Statistical Data Analysis is a procedure of performing various
statistical operations. It is a kind of quantitative research, which
seeks to quantify the data, and typically, applies some form of
statistical analysis.
Statistical Data Analysis generally involves some form of
statistical tools, which a layman cannot perform without having
any statistical knowledge.
Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
Branches of Statistics

• focuses on collecting,
organizing, summarizing, • help interprets and draws
and presenting the data. conclusions from the data

• help us compress data and • allow us to make conclusions


summarize them to give us about the population
more useful information
Measures of Measures of Measures of Position
Review Central Tendency Variation or Location

population
parameter
sampling
techniques
exploratory
data analysis
sample
statistics

inference

probability

John Alexis B. Gemino MTH101 Mathematics in the Modern World


DESCRIPTIVE STATISTICS

a statistical technique that


summarizes and describes important
Introduction
characteristics of a data set
Measures of
Central Tendency
based on collecting, organizing and
Measures of
reporting data without using the data
Location to draw any wide-ranging conclusions
Measures of
Variation organized data can be presented
through bar graphs, pie charts, time
Source: Slidesgo series graphs, etc.

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
Review
Performance in a Test
How will you describe the test
scores of students in a particular
class?

Measures of Central Tendency


scatterplot of Measures of Variability
test scores
Measures of Location
MEASURES OF CENTRAL TENDENCY

Introduction
MEAN
𝑥ҧ represents average of the
data set
Measures of

𝑥෤
Central Tendency

MEDIAN represents midpoint of the


Measures of
Location distribution/data set
Measures of

𝑥ො
Variation
The number that occurs most
MODE frequently in a data set.

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
MEAN

represents “center of gravity” of


the data set
Introduction

obtained by summing the data


Measures of
Central Tendency values and by dividing the
number of data values
Measures of
Location
given observed data points
Measures of
Variation
𝑥1 , 𝑥2 , … , 𝑥𝑛 , the mean is given by
𝑛
1 𝑥1 + 𝑥2 + ⋯ + 𝑥𝑛
෍ 𝑥𝑖 =
𝑛 𝑛
𝑖=1

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
POPULATION MEAN VS SAMPLE MEAN

POPULATION SAMPLE
Introduction collection of all elements a part or subset of the
under consideration in a population from which the
Measures of
Central Tendency statistical study information is collected

Measures of
Location

POPULATION MEAN SAMPLE MEAN


Measures of
Variation 𝑁 𝑛
1 1
𝜇 = ෍ 𝑥𝑖 , 𝑁 = population size 𝑥ҧ = ෍ 𝑥𝑖 , 𝑛 = sample size
𝑁 𝑛
𝑖=1 𝑖=1

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
WEIGHTED MEAN

in the computation of mean,


Introduction values in the data set may have
different degrees of importance,
Measures of
Central Tendency
quantified by weights

Measures of
Location
obtained by summing the data
values which are already
Measures of multiplied by their corresponding
Variation
weights and by dividing the sum
by the sum of the weights

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
WEIGHTED MEAN

Given observed data points 𝑥1 , 𝑥2 , … , 𝑥𝑛 , with


the corresponding weights 𝑤1 , 𝑤2 , … , 𝑤𝑛 the
Introduction
mean is given by
Measures of
Central Tendency

Measures of
Location σ𝑛𝑖=1 𝑤𝑖 𝑥𝑖 𝑤1 𝑥1 + 𝑤2 𝑥2 + ⋯ + 𝑤𝑛 𝑥𝑛
𝑛 =
Measures of
Variation
σ𝑖=1 𝑤𝑖 𝑤1 + 𝑤2 + ⋯ + 𝑤𝑛

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
WEIGHTED MEAN

Calculate Reiner Braun’s General Weighted Average (GWA) given the


grades on the following subjects with varying number of units:
Introduction Subject No. of Units Grade
Marleyan History 2 1.00
Measures of
Central Tendency Military Science 5 2.50
Marksmanship 3 1.50
Measures of
Location
σ3𝑖=1 𝑤𝑖 𝑥𝑖 𝑤1 𝑥1 + 𝑤2 𝑥2 + 𝑤3 𝑥3 2 1.00 + 5 2.50 + 3(1.50)
Measures of
GWA = 3 = =
Variation
σ𝑖=1 𝑤𝑖 𝑤1 + 𝑤2 + 𝑤3 2+5+3

2.00 + 12.50 + 4.50 19.00


GWA = = = 𝟏. 𝟗𝟎
10 10
Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
Properties of the Mean

Introduction

Measures of
Central Tendency

Measures of
Location

Measures of
Variation

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
MEDIAN

represents “midpoint” of the


Introduction distribution/data set
Measures of
Central Tendency 50% of the population/sample is
greater than or equal to this
Measures of
Location
value; 50% of the
population/sample is less than or
Measures of equal to this value
Variation

a robust statistic

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
MEDIAN

Given observed and ordered (lowest to highest)


Introduction data points 𝑥(1) , 𝑥(2) , … , 𝑥(𝑛) , the median is:
Measures of
Central Tendency

if 𝑛 is odd: if 𝑛 is even:
Measures of
Location

Measures of
middle point mean of two middle numbers
Variation
𝑥 𝑛+1 𝑥 𝑛 + 𝑥 𝑛+1
2 2 2
2

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
Properties of the Median

Introduction

Measures of
Central Tendency

Measures of
Location

Measures of
Variation

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
MODE

The number that occurs most


Introduction
frequently in a data set.
Measures of
Central Tendency
Data may have:
Measures of
Location • 1 mode (unimodal)
Measures of
• 2 modes (bimodal)
Variation • > 2 modes (multimodal)
• no mode at all

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
Properties of the Mode

Introduction

Measures of
Central Tendency

Measures of
Location

Measures of
Variation

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
Five persons were asked on the usual number of hours they
spent watching television in a week. Their responses are:
Introduction

Measures of
Central Tendency
5, 7, 3, 38, and 7 hours.
Measures of
Location a. Find the mean, median and mode.
Measures of
b. If another person were to be asked the same question
Variation and he/she responded 200 hours, how would this affect the
mean, median and mode?

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
Introduction

Measures of
Central Tendency

Measures of
Location

Measures of
Variation

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
When do we typically use them?

MEAN MEDIAN MODE


Type of Data/ Categorical
Continuous Ordinal*
Introduction Nominal*
Scale of Measurement Interval/Ratio Interval/Ratio
All others
Measures of
Central Tendency Distribution or Affected by Not affected Not affected
Skewness of Data Skewness by Skewness by Skewness
Measures of
Location Presence of Outliers Affected by Not affected Not affected
or Extreme Values Outliers by Outliers by Outliers
Measures of
Variation
Does it exist as May not May not Exists as
a data point? exist exist a data point

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
MEASURES OF LOCATION

Introduction

Measures of
Central Tendency

Measures of
𝑄𝑘 𝐷𝑘 𝑃𝑘
Location

QUARTILE DECILE PERCENTILE


Measures of
Variation

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
MEASURES OF LOCATION

Fractiles (or quantiles) extend


the idea of median by dividing the
Introduction ordered data set/distribution into
more than two equal parts.
Measures of
Central Tendency

Measures of
Location

Measures of
Variation

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
MEASURES OF LOCATION

Fractiles (or quantiles) extend


the idea of median by dividing the
Introduction ordered data set/distribution into
more than two equal parts.
Measures of
Central Tendency
Quartiles (𝑸𝒌 ) divide the data
Measures of set into four equal parts.
Location

Measures of
Variation

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
MEASURES OF LOCATION

Fractiles (or quantiles) extend


the idea of median by dividing the
Introduction ordered data set/distribution into
more than two equal parts.
Measures of
Central Tendency
Quartiles (𝑸𝒌 ) divide the data
Measures of set into four equal parts.
Location
Deciles (𝑫𝒌 ) divide the data set
Measures of
Variation into ten equal parts
Percentiles (𝑷𝒌 ) divide the data
set into 100 equal parts.

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
MEASURES OF LOCATION

Let 𝑥(1) , 𝑥(2) , … , 𝑥(𝑛) be the observed and ordered (lowest to highest)
data points and 𝑘 be the fractile/quantile location.
Introduction

Measures of MEDIAN QUARTILE


Central Tendency
𝑥෤ = 𝑥 𝑛+1 𝑄𝑘 = 𝑥 𝑘 𝑛+1 , 𝑘 = 1,2,3
Measures of 2 4
Location

Measures of
Variation
DECILE PERCENTILE
𝐷𝑘 = 𝑥 𝑘 𝑛+1 , 𝑘 = 1,2,3, … , 9 𝑃𝑘 = 𝑥 𝑘 𝑛+1 , 𝑘 = 1,2,3, … , 99
10 100

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
MEASURES OF LOCATION

MEDIAN QUARTILE
Introduction 𝑥෤ = 𝑥 𝑛+1 𝑄𝑘 = 𝑥 𝑘 𝑛+1 , 𝑘 = 1,2,3
2 4
Measures of
Central Tendency
DECILE PERCENTILE
Measures of
Location 𝐷𝑘 = 𝑥 𝑘 𝑛+1 , 𝑘 = 1,2,3, … , 9 𝑃𝑘 = 𝑥 𝑘 𝑛+1 , 𝑘 = 1,2,3, … , 99
10 100
Measures of
Variation
Note: 𝑥෤ = 𝑄2 = 𝐷5 = 𝑃50 , 𝑄1 = 𝑃25 , 𝑄3 = 𝑃75
𝐷1 = 𝑃10 , 𝐷2 = 𝑃20 , 𝐷3 = 𝑃30 , 𝐷4 = 𝑃40 ,
𝐷6 = 𝑃60 , 𝐷7 = 𝑃70 , 𝐷8 = 𝑃80 , 𝐷9 = 𝑃90
Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
MEASURES OF LOCATION

Using the raw data below, compute for 𝑄3 , 𝐷5 , and 𝑃25 and 𝑥.

Introduction 33 47 52 62 73 83
36 48 54 63 75 84 Arrange the data in order
Measures of
Central Tendency
from lowest to highest.
38 49 56 63 78 86

Measures of 41 50 58 66 79 88 𝑄𝑘 = 𝑥 𝑘 𝑛+1
Location 4
42 51 58 67 80 88
𝑄3 = 𝑥 3 47+1
Measures of 43 51 59 68 81 89 4
Variation
44 52 60 69 81 90 𝑄𝑘 = 𝑥 0.75×48 =𝑥 36 = 𝟕𝟗
45 52 60 72 82

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
MEASURES OF LOCATION

What if the value inside the parenthesis is NOT a whole number?


Suppose we have another data point, say 103.
Introduction
33 47 52 62 73 83

Measures of 36 48 54 63 75 84 Compute for 𝑄3 , 𝐷5 , and 𝑃25 and 𝑥.



Central Tendency
38 49 56 63 78 86
𝑄𝑘 = 𝑥 𝑘 𝑛+1
Measures of 4
Location
41 50 58 66 79 88
42 51 58 67 80 88
𝑄3 = 𝑥 3 48+1
Measures of 4
Variation 43 51 59 68 81 89 𝑄3 = 𝑥 0.75×49 =𝒙 𝟑𝟔.𝟕𝟓
44 52 60 69 81 90
45 52 60 72 82 103

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
MEASURES OF LOCATION

In the example, 𝑄3 = 𝑥 36.75 .


To estimate the 36.75th number, we perform linear interpolation.
Introduction
To do this, take note that 𝑥 36.75 is between 𝑥 36 and 𝑥 37 .
Measures of
Central Tendency (1) Assign weight of 0.75 to the second number.
(2) Assign weight of (1-0.75) to the first number.
Measures of
Location
𝑥 = 𝑥 36 1 − 0.75 + 𝑥 37 (0.75)
36.75
Measures of 𝑥 36.75 = 𝑥 36 0.25 + 𝑥 37 (0.75)
Variation
𝑥 36.75 = 79 0.25 + 80 0.75 = 79.75

So, 𝑄3 = 𝑥 36.75 = 𝟕𝟗. 𝟕𝟓.

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
MEASURES OF LOCATION

This method of linear interpolation is consistent with the formula


for the median when 𝑛 is even, even when we use the formula
Introduction
for when 𝑛 is odd.
Using the same example,
Measures of
Central Tendency 𝑥෤ = 𝑥 𝑛+1 → 𝑥෤ = 𝑥 48+1 = 𝑥(0.5×49) = 𝑥 24.5 .
2 2
Measures of Take note that 𝑥 24.5 is between 𝑥 24 and 𝑥 25 .
Location
(1) Assign weight of 0.5 to the second number.
Measures of (2) Assign weight of (1-0.5) to the
Variation first number.
𝑥 24.5 =𝑥 24 1 − 0.5 + 𝑥 25 (0.5)
𝑥 24 + 𝑥 25 60 + 62
𝑥 24.5 =𝑥 24 0.5 + 𝑥 25 0.5 = = = 𝟔𝟏 = 𝑥෤
2 2
Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
MEASURES OF VARIATION OR DISPERSION

𝑥(𝑛) − 𝑥(1) 𝑄3 − 𝑄1
Introduction
INTERQUARTILE
RANGE
Measures of RANGE
Central Tendency

𝜎 𝑠
22 𝜎𝑠
𝜎𝑠
Measures of
Location

Measures of
𝜇 𝑥ҧ
Variation
STANDARD COEFFICIENT OF
VARIANCE VARIATION
DEVIATION

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
The rates of returns for two stocks, namely, Stock A
and Stock B from 2005 to 2014
Introduction

Measures of
Central Tendency

Measures of
Location

Measures of
Variation

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
Introduction

Measures of
Central Tendency

Measures of
Location

Measures of
Variation

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
MEASURES OF VARIATION OR DISPERSION

Introduction Dispersion refers to the spread


or variability of data
Measures of
Central Tendency
Do the observations tend to be
Measures of
Location
quite similar (homogeneous)?

Measures of Do the observations tend to vary


Variation
considerably (heterogeneous)?

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
RANGE

The difference between the


Introduction greatest data point and least
data point.
Measures of
Central Tendency
Given observed and ordered
Measures of (lowest to highest)
Location
data points 𝑥(1) , 𝑥(2) , … , 𝑥(𝑛) , the
Measures of range is:
Variation

Range = 𝑥 𝑛 −𝑥 1

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
INTERQUARTILE RANGE (IQR)

The difference between the 3rd


Introduction
quartile and 1st quartile.
Measures of
Central Tendency
IQR = 𝑄3 − 𝑄1
Measures of
Location IQR represents the spread of the
Measures of
middle 50% of the data set.
Variation

Higher IQR → Larger Dispersion

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
VARIANCE AND STANDARD DEVIATION

They both measure how closely


the values are clustered around
Introduction the mean.
Measures of Higher Value → Larger Dispersion
Central Tendency

Standard deviation has the


Measures of
Location same unit of measurement as the
data set while the variance does
Measures of
Variation not (squared units).

They are the most commonly


used measures of variation.

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
VARIANCE

Introduction
POPULATION VARIANCE SAMPLE VARIANCE
Measures of
Central Tendency 𝑁 𝑛
1 1
𝜎2 = ෍ 𝑥𝑖 − 𝜇 2 𝑠2 = ෍ 𝑥𝑖 − 𝑥ҧ 2
Measures of 𝑁 𝑛−1
Location 𝑖=1 𝑖=1
2 + ⋯+ 2
𝑥1 −𝜇 2 + ⋯ + 𝑥𝑁 − 𝜇 2 𝑥1 − 𝑥ҧ 𝑥𝑛 − 𝑥ҧ
Measures of = =
Variation 𝑁 𝑛−1

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
STANDARD DEVIATION

POPULATION SAMPLE
Introduction
STANDARD DEVIATION STANDARD DEVIATION
Measures of
Central Tendency 𝑁 𝑛
1 1 2
𝜎= ෍ 𝑥𝑖 − 𝜇 2 𝑠= ෍ 𝑥𝑖 − 𝑥ҧ
Measures of 𝑁 𝑛−1
Location 𝑖=1 𝑖=1

Measures of 2 2 2 2
Variation 𝑥1 − 𝜇 + ⋯ + 𝑥𝑁 − 𝜇 𝑥1 − 𝑥ҧ + ⋯ + 𝑥𝑛 − 𝑥ҧ
= =
𝑁 𝑛−1

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
Computation of STANDARD DEVIATION

Introduction

Measures of
Central Tendency

Measures of
Location

Measures of
Variation

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
Introduction

Measures of
Central Tendency

Measures of
Location

Measures of
Variation

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
COEFFICIENT OF VARIATION (CV)

A unitless measure that can be


used to compare the dispersion of
Introduction two or more data sets with
different units of measurement.
Measures of
Central Tendency
This expresses standard deviation
Measures of
Location as a percentage of the mean or

Measures of 𝜎
Variation 𝐶𝑉 = × 100% or
𝜇
𝑠
𝐶𝑉 = × 100%
𝑥ҧ

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
COEFFICIENT OF VARIATION (CV)

Introduction

Measures of
Central Tendency

Measures of
Location

Measures of
Variation

Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
PROBLEM Example

The numbers of incorrect answers on a true-false competency test


for a random sample of 15 students were recorded as follows:
Introduction 2, 1, 3, 0, 1, 3, 6, 0, 3, 3, 5, 2, 1, 4, and 2.
Compute the following statistics:
Measures of
Central Tendency
a. Range R=6–0=6
Measures of
Location b. IQR IQR = X12 – X4 = 3 – 1 = 2
Measures of
c. Variance S2 = 2.9714
Variation

d. Standard Deviation S = 1.7238

e. Coefficient of Variation. CV = 71.82%


Mathematics in the Modern World: Data Management: Review on Descriptive Statistics MTH 101: Unit II
Thank
you!

You might also like