Professional Documents
Culture Documents
OF THE PHILIPPINES
SAN PEDRO CAMPUS
FUNDAMENTALS OF
DESCRIPTIVE ANALYTICS
BUMA 30063
INSTRUCTIONAL MATERIAL
PREPARED BY
Mc Joben R. Reyes, MIS, LPT
2023
BASIC
DESCRIPTIVE
STATISTICS
CHAPTER 2
INTRODUCTION
We will focus on Summary Statistics. These are the different measures that are used to
describe any set of data. If we want to know the typical value of a certain variable, how
different the values are from one another, how a certain data point compares to the
rest, we can use these measures.
FREQUENCY DISTRIBUTION
Frequency is simply the number of occurrences of an event. A frequency distribution is
a list, table or graph that displays the frequency of various outcomes in a sample. It
tells us how many there are of each item in the data set.
Frequency distribution can show us the raw number of each item and its percentage
toward the total.
In measuring the height of 50 children, some are tall and some are short, but there is a
high probability of a higher frequency or concentration in the middle range. The most
important factors for gathering data are that the intervals used must not overlap and
must contain all of the possible observations.
25
BUMA 30063 - Fundamentals of Descriptive Analytics
In general, a histogram chart will typically show a normal distribution, which means
that the majority of occurrences will fall in the middle columns. Frequency distributions
can be a key aspect of charting normal distributions which show observation
probabilities divided among standard deviations.
26
BUMA 30063 - Fundamentals of Descriptive Analytics
26
BUMA 30063 - Fundamentals of Descriptive Analytics
The Mean
The mean (also known as the arithmetic mean) is the most commonly used measure
of central position. It is used to describe a set of data where the measures cluster or
concentrate at a point. As the measures cluster around each other, a single value
appears to represent distinctively the typical value.
x̄ = ∑x / N
Example: The grades in Geometry of 10 students are 87, 84, 85, 85, 86, 90, 79, 82, 78,
76. What is the average grade of the 10 students?
Solution: x̄ = ∑x / N
x̄ = 87 + 84 + 85 + 85 + 86 + 90 + 79 + 82 + 78 + 76 / 10
= 832 / 10
x̄ = 83.2
27
BUMA 30063 - Fundamentals of Descriptive Analytics
Example: The library logbook shows that 58, 60, 54, 35, and 97 books, respectively,
were borrowed from Monday to Friday last week. Find the median.
The Mode
The mode is the measure or value which occurs most frequently in a set of data. It is
the value with the greatest frequency.
28
BUMA 30063 - Fundamentals of Descriptive Analytics
MEASURES OF LOCATION
Sometimes, we want to know how a certain data point compares with the rest. This is
for example, in the case of rankings and quotas. In some situations, we could also
divide data into a certain number of equal sections to answer our questions, as with
certain problems that would involve brackets, classes, and other groupings.
Measures of Location specify points in the data set in which a specified amount of
data lie. This allows us to find the position of a data in relation to the entire data set.
Some examples of these are percentiles, deciles and quartiles. Percentiles divide the
data into 100 equal parts, deciles divide the data into 10 equal parts, and quartiles
divide the data into 4 equal parts.
MEASURES OF DISPERSION
There are two types of Measures of Dispersion. First is Absolute, which is the measure
of the variability within a data set, and relative dispersion which compares this data set
with other data sets.
Variance and Standard Deviation are measures of dispersion with reference to the
mean. The higher these values are, the farther away from the mean the data values are.
Standard deviation is the square of variance, resulting in a number that is always
positive and is in the same units as the mean.
29
BUMA 30063 - Fundamentals of Descriptive Analytics
The first step is to calculate the mean. The sum is 33 and there are 5 data points.
Therefore, the mean is 33 ÷ 5 = 6.6. Then you take each value in data set, subtract the
mean and square the difference. For instance, for the first value:
(2 - 6.6)^2 = 21.16
69.20 ÷5 = 13.84
The variance is 13.84. To get the standard deviation, you calculate the square root of
the variance, which is 3.72.
30
BUMA 30063 - Fundamentals of Descriptive Analytics
LEARNING ASSESSMENT
Answer the following discussion point.
When is it best to use mean? What about median or mode? Name some specific
examples of situations in which one would choose a certain measure over the two
others.
31