Professional Documents
Culture Documents
statistical data in
2. Continuous data – these data has interval and ratio scale;
these uses parametric statistics. making important
Interval scales are measured on continuum and differences
between any two numbers on the scale are of known size. decisions.
Example: temperature, tons of garbage, number of arrests,
income, and age.
1. Quantitative variable – is one that can be measured and ordered according to quantity.
Quantitative variable may be discrete or continuous variable.
Discrete variable includes finite or countably finite.
Continuous variable covers the values in an interval of real number line.
2. Qualitative variable – is one simply used as labels to distinguish one group from the another.
Presentation of data:
1. Textual presentation – uses statements with numerals in order to describe the data for the
concrete information and in expository form.
2. Tabular presentation – uses statistical table to directly display the quantities or variables
collected as data.
3. Graphical presentation – illustrates data in a form of graphs aiding readers to understand the
text easily.
Example: circle graph, bar graph, line graph, pictograph.
The data gathered should be properly organized into grouped data called frequency distribution.
11 19 11 15 16 10
16 16 15 17 10 27
21 11 13 21 10 16
11 19 24 12 22 13
19 13 18 20 21 11
19 15 11 25 29 23
16 23 10 17 11 27
16 24 12 21 13 12
26 15 11 14 10 12
11 15 18 12 20 13
2
Solution:
1. Determine the value of k = 1 + 3 log(n) where n = 30, log 60 = 1.77815125, k = 1 + 3
(1.7781512)
k = 1 = 5.3344536
k = 6. Therefore, 6 is the estimate number of classes in these data.
EXERCISE no 1
Construct a frequency distribution table for the following data. The scores of students in a
Geometry Test.
55 63 44 37 50 57 44 57 42 46
58 40 54 65 39 27 28 56 38 45
30 35 56 78 55 27 50 28 44 28
39 37 65 43 33 70 60 61 60 44
Interpretation of Data
Any given data in statistics are useless if we don’t interpret them. The most appropriate measures
found to be useful in describing a distribution of observations are the measure of central tendency,
measures of variation, measure of relative position, z-scores, box and whisker plot, probability
and normal curve, linear regression and correlation.
3
Measures of Central Tendency
Central Tendency determines a numerical value in the central region of a distribution of scores. It
refers to the center of a distribution of observation.
There are three measures of central tendency: the mean, the median, and the mode.
1. MEAN
The mean, Mn is also called the arithmetic mean or average. It can be affected by extreme
scores. It is the balance point of a distribution.
Example: Jeffrey has been working on programming and updating a Web site for his company for the
past 24 months. The following numbers represent the number of hours Jeffrey has worked on this Web
site for each of the past 7 months: 24, 25, 31, 50, 53, 66, 78. What is the mean (average) number of
hours that Jeffrey worked on this Web site each month?
Solution:
24 + 25 + 33+ 50 + 53 + 66 + 78 329
Mn = = = 47 was the average number of
7 7
hours that Jeffrey worked on this
website each month.
Σ𝑓𝑋 Where:
Weighted Mean, WMn = WMn = weighted mean
𝑁
f = frequency
X = score
ΣfX = sum of the product of frequency and score
N = total frequency
Example: There are 1000 notebooks sold at Php 10 each; 500 notebooks at Php 20 each; 500
notebooks at Php 25 each, and 100 notebooks at Php 30 each. Compute the weighted mean.
Solution:
Prepare the frequency distribution.
Σ𝑓𝑋 35,500
WMn = = = 16.90
𝑁 2,100
Σ𝑓𝑋𝑚
a. Mn = Where:
𝑁 Mn = mean
f = frequency
Xm = class mark
ΣfXm = sum of the product of frequencies and class
marks
N = total frequency
Example: The table below summarizes the weights of the Cubs. Find the average weight of the cubs.
N = 45
Reminder: the class mark is just equal to the average value of the upper-class limit and the lower-class limit form each of the
class limits in the given frequency distribution.
Solution:
In solving for the mean given the grouped data or frequency distribution, we have to add two
columns for class mark (Xm) and fXm, that is
Weights of the Cubs f Xm fXm
201 – 210 3 205.5 616.5
191 – 200 8 195.5 1564
181 – 190 12 185.5 2226
171 – 180 11 175.5 1930.5
161 – 170 9 165.5 1489.5
151 – 160 2 155.5 311
Σ𝑓𝑋𝑚 = 8137.5
N = 45
5
Therefore:
Σ𝑓𝑋𝑚 8137.5
Mean, Mn = = = 180.83
𝑁 45
EXERCISE no 2
1. The sizes of pants sold during one business day in a department store are 32, 28, 34,
42, 36, 34, 40, 44, 32, 34. Find the average size of the pants sold.
2. Given the frequency distribution for the weights of the 50 pieces of luggage. Compute
the mean.
Weight (kilograms) Number of Pieces, f
7–9 2
10 – 12 8
13 – 15 14
16 – 18 19
19 – 21 7
N 50
2. MEDIAN
The median, Md, is the value in the distribution that divides an arranged
(ascending/descending) set into two equal parts. It is the midpoint or middlemost of a
distribution of scores.
Examples:
1. Find the median of the following prices:
Php 50, Php 55, Php 60, Php 65, Php 12, Php 35, Php 48.
Solution:
Php 12, Php 35, Php 48, Php 50, Php 55, Php 60, Php 65, N = 7
Therefore:
Md = (N+1)/2 = (7+1)/2 = 4th score
Md = 50
2. Find the median of the following weights in kilos, 101, 107, 115, 120, 111, 105.
Solution:
Arranging the numbers in ascending order.
101, 105, 107, 111, 115, 120
6
N=6
Md = (N+1)/2th score
Md = (6+1)/2 = 3.5th score, that is between the 3rd and the 4th scores.
Md = (107+111)/2 = 109
N
( −𝑐𝑓𝑏 )
2
Where:
Md = XLB + 𝑖 Md = median
𝑓𝑚
XLB = the lower boundary or true lower limit of the
median class.
N = total frequency
𝑐𝑓𝑏 = cumulative frequency before the median class
𝑓𝑚 = frequency of the median class
𝑖 = size of the class interval
Solution:
The median class that contains the 30th score is 14 – 15 since it has the 30th score.
XLB = 13.5
cfb = 24
fm = 6
i=2
7
Therefore:
N
( −𝑐𝑓𝑏 )
2
Md = XLB + 𝑖
𝑓𝑚
60
( −24)
2
= 13.5 + 2
6
6
= 13.5 + ( )2
6
= 13.5 + (1)2
= 13.5 + 2
= 15.5
This means that 50 percent of the students got a score below 15.5 or if the passing score is 50 percent
of the total number of items, almost half of the class failed in the test.
EXERCISE no 3
1. The ages of 10 Administrators in a certain college are given as follows: 40, 38, 45, 51,
44, 53, 59, 45, 56, 45. Compute the median.
2. Compute the median given the following data:
Scores in Statistics f
75 – 79 6
70 – 74 7
65 – 69 2
60 – 64 8
55 – 59 12
50 – 54 7
45 – 49 10
40 – 44 8
N 60
3. Mode
The mode is the value with largest frequency. It is the value that occurs most frequently in the
distribution. This is used when the quickest estimate of typical performance is wanted. A
distribution can be unimodal with one mode value, bimodal with two mode values and
trimodal with three mode values. In other words, it can have more than one mode.
Solution:
By inspection, the mode is 7 since it has the largest frequency.
Where,
Mo = Mode
XLB = lower boundary of the modal class
df1 = difference between the frequency of the modal class and the frequency above it.
df2 = difference between the frequency of the modal class and the frequency below it.
i = size of the class interval
Solution:
Mo = 9.5 + [14/(14+4)]2
= 9.5 + [14/(18)]2
= 9.5 + (0.78)2
= 9.5 + 1.56
Mo = 11.06
9
EXERCISE no 4
10