Professional Documents
Culture Documents
INTRODUCTION TO STATISTICS
Statistics is a collecting, analysing, interpreting, presenting, and organizing data.
Summation Notation
In summarizing data, we often have to find the sum of several numbers. A more concise way to
n
express the sum a 1+ a2+ a3 +…+ an is to use the summation notation or sigma notation. ∑ ak
k =1
Examples: 4
5 ∑ k +2
Evaluate: ∑ k2 k =1
k =1
Solution: 2 2 2 2 2 Solution:
=1 + 2 +3 + 4 +5 =(1+2)+(2+2 )+(3+2)+(4 +2)
=1+ 4+9+ 16+25 =3+4+5+ 6
=55 =18
∑3k
k =1
x f fx x2 fx 2
15 3 45 225 675
12 8 96 144 1152
11 4 44 121 484
8 1 8 64 64
N=16 Σ fx=193 2
Σ x =¿554 2
Σ fx =¿2375
A table has a title and appropriate headings for the columns and rows. Notice how table neatly
organizes a data set.
Baby Boom!
Do you know in what month most babies are born? Let’s do a mini survey in your class. Gather the
birth months of everyone in the class and organize the data in the table.
Stem and leaf display is a method of organizing data in which the stem values or the leading digits
for each observation are listed in a column and the leaf values or the trailing digits for each
observation is then listed beside the corresponding stem.
Examples: 1. Marco’s travel time from school to his house varies daily due to traffic and other
factors. He records his daily travel time in minutes for 20 days as follows:
25, 32, 18, 26, 40, 20, 31, 21, 34, 28, 19, 36, 25, 38, 41, 40, 23, 22, 30, 31
Construct a stem and leaf diagram.
1 8, 9
2 0, 1, 2, 3, 5, 5, 6, 8
3 0, 1, 1, 2, 4, 6, 8
4 0, 0, 1
2. The following back-to-back stem and leaf diagram represents the scores of the students in two
different sections for a common examination. In each section, 30 students took the examination.
I- Charity I- Peace
Leaf Stem Leaf
5, 2, 1 4 0, 3
7, 6, 3, 3, 0 5 1, 2, 2, 5
8, 8, 7, 5, 1, 0 6 3, 4, 6, 6, 7, 9
9, 8, 7, 7, 6 7 0, 0, 3, 5, 8, 8, 9
9, 7, 7, 3, 2, 1, 1 8 0, 1, 1, 3, 4, 5, 5
6, 3, 0, 0 9 1, 2, 2, 5
a. Which section got the highest score? What was the score?
b. Which section got the lowest score? What was the score?
c. How would you decide which class did better in the exam?
Answers:
a. I- Charity. The highest score was 96.
b. I-Peace. The lowest score was 40.
c. Answers may vary. One way is to get the average score obtained by each class.
Try This
The following back-to-back stem and leaf diagram represents the total points made by two
basketball teams in 20 games.
Purefoods Alaska
Leaf Stem Leaf
6 8
8, 5, 4, 0 7 2, 6, 7
9, 7, 5, 3, 3 8 0, 4, 5, 8
8, 6, 5, 4, 2, 2, 0 9 1, 2, 5, 6, 6, 8
7, 6, 4, 0 10 2, 3, 7, 8, 8, 9
a. Which basketball team made the most points in a game? What was the score?
b. Which basketball team made the least number of points in a game?
c. How would you decide which team made more points?
Frequency distribution is a table that displays frequencies for the different categories, values of a
counting variable or class intervals.
Note: N is the
Types of Frequency Distribution total number of
data.
a. Ungrouped Frequency Distribution
In the ungrouped frequency distribution, the data are arranged in a chronological order,
usually from the highest to lowest.
Consider the given data below that show the score of 25 students in a 40-itme math test.
29 20 29 27 28
25 19 33 27 33
32 33 29 25 29
28 28 25 20 28
20 27 27 19 30
Here are the results of 60-item test of a class. Construct a frequency distribution.
71 50 33 55 66 58
77 45 36 63 63 46
68 40 40 70 61 41
64 35 45 72 65 38
55 31 50 74 56 34
Solution: a. Find the range. ( the difference between highest score and the lowest score)
HS−LS=range
77−31=46
b. Number of Classes
Using Sturge’s Rule. Find k.
k = 1+3.3LogN
In the given, N=30 since the total number of data is 30.
k = 1 + 3.3Log30
k= 5.87 6
c. Class Limit – it is the end numbers which define a class. It should not overlap so that
the variables can be classified in one and only one class.
d. Class Boundaries or True Limits – It is the lower class boundary and upper class
boundary. This is obtained by getting the midpoint of the lower class limits of the
adjacent classes.
e. Class Mark - This is the midpoint of each class. It is obtained by dividing the sum of the
upper and lower values of the class by 2.
Example: 31 - 37
31+ 37 68
= =34
2 2
f. Cumulative Frequency – There are two ways on how to get the cumulative frequency.
Less than cumulative frequency (<cf)
This can be obtained by taking the sum of the frequency of the class and the
frequencies of the lower classes.
Greater than cumulative frequency (>cf)
This can be obtained also by adding the frequency of the class with the frequencies
of the upper classes.
Name: _________________________________
QUIZ
Construct a frequency distribution table. Given below are the scores of 30 students in their 50-item
math exams.
32 45 12 34 25 28 30 48 36 45
18 22 29 43 29 33 42 37 31 23
37 39 21 26 20 28 32 30 40 25
Daniel tallies the results of the survey and presents the data in a frequency table.
Drama
Dance
Glee
Culinary
Art
Legend: = 3 students
Try This
Construct a bar graph and pictograph using this data. Let represent 5 students.
Daniel may also choose to present the data on students’ preferred interest clubs using a bar graph.
Bar graph uses bars of different lengths and of equal widths to show the frequencies for each of
several categories.
Try This
A pie chart is a circle divided into sectors proportional to the frequencies. It shows how a part of
something relates to the whole. It is important to define what the whole represents .
Oxygen
20%
nitrogen
80%
Example: Construct a pie chart using the data on students’ preferred interest clubs.
Solutions: Divide the circle into sectors. Each sector corresponds to the percentage of students who
prefer a particular interest club.
Dance 24 24 24
× 100 %=16.67 % × 360° =60 °
144 144
Glee 36 36 36
× 100 %=25 % × 360° =90 °
144 144
Culinary 18 18 18
× 100 %=12.5 % × 360° =45 °
144 144
Art 21 21 21
× 100 %=14.58 % × 360° =52.5°
144 144
a. 16.67 % of the students preferred dance.
b. 12.5 % of the students preferred Glee over Culinary
c. 14.58 % of the students preferred Art
15%
31%
12%
25% 17%
Try This
30 24 30 25 26 40 42 45 48 31
28 18 34 36 39 29 16 21 37 27
4
Intervals
3 Tally Frequency
46-50 I 1
2
41-45 II 2
36-40
1 IIII 4
31-35 II 2
0
26-30 IIII 5
16-20 21-25 26-30 31-35 36-40
21-25 3 41-45 46-50
III
16-20 III 3
Each bar starts with the lower limit of the interval and extends to the upper limit.
50-item test
Try This
Interval Frequency
18-20 2
15-17 8
12-14 9
9-11 7
6-8 3
3-5 1
A line graph shows trends in data clearly. This displays data which are collected over a period of
time to show how the data change at regular intervals.
100
80
frequency
60 Math
40 English
20
0
First Quarter Second Quarter Third Quarter Fourth Quarter
Try This