You are on page 1of 20

Sheikh Osman Secondary ‫مدرسة شيخ عثمان الثانوية‬

School Gardo ‫في قرطو‬

Name: ………………………………………………………………… F3………

Course: Statistics

Facilitator: Abdullahi Ali

HAADI Page 1
What is Statistics?

 Statistics is a way to get information from data.


 Statistics is the process of:

• Collecting

• Organizing

• Presenting

• Analyzing

• And Interpreting

 Numerical information from data

Types of statistics

Describes and summarizes data Draws conclusion and makes prediction

Data Types

 Quantitative
 Qualitative

Types of Variable

 A quantitative variable has a value or numerical measurement for which


operations such as addition or averaging make sense.
 A qualitative variable describes and individual by placing the individual
into a category or group such as male or female.

Obtaining the Data

 Direct collecting of data. This called Primary data


 Using data collected by others. This called Secondary data

HAADI Page 2
Method of Collecting Data

• Questionnaire

• Interview

• Observation

Mean, median and mode of ungrouped data

Frequency Distribution

Arrangement of data

A set of data

28 31 29 27 30 29 29 26 30 28
28 29 27 26 32 28 32 31 25 30

HAADI Page 3
27 30 29 30 28 29 31 27 28 28

Can be arranged in ascending order:

25 26 26 27 27 27 27 28 28 28
28 28 28 28 29 29 29 29 29 29
30 30 30 30 30 31 31 31 32 32

Arrangement of data

Table of values

25 26 26 27 27 27 27 28 28 28
28 28 28 28 29 29 29 29 29 29
30 30 30 30 30 31 31 31 32 32

It can be entered into a table.

The number of occasions on which any particular value occurs is called the

frequency, denoted by f.

Value Number of times


25 1
26 2
27 4
28 7
29 6
30 5

HAADI Page 4
31 3
32 2

Arrangement of data

Tally diagram

When dealing with large numbers of readings, instead of writing all the values in
ascending order, it is more convenient to compile a tally diagram, recording the range of
values of the variable and adding a stroke for each occurrence of that reading:

Arrangement of data

Grouped data

If the range of values of the variable is large, it is often helpful to consider these values
arranged in regular groups or classes

HAADI Page 5
HAADI Page 6
Data Presenting

• Tabulation

• Chart

• Importance of Chart : Easy to understand and Easy to compare

Types of Chart

• Pictogram

• Bar chart

• Pie chart

• Histogram

• Frequency Polygon

• Cumulative Frequency Curve

Simple Bar Chart

The number of students taken for admission in various faculties of a particular University
in 2007/2008 academic session is given below:

HAADI Page 7
Faculties Number of Students
Art 150
Agriculture 100
Business study 350
Engineering 100
Law 200
Science 300

Component Bar Chart

Considering the example used for Simple Bar Chart, if the number of students taken for
admission can further be classified by gender then we have:

Faculties Male Female

Arts 100 50

Agric 60 40

Business 150 200

Ingineering 80 20

Law 100 50

Science 150 150

HAADI Page 8
Frequency Polygon

HAADI Page 9
Cumulative Frequency Curve

years) Frequency Cumulative Frequency


10 5 5
11 10 5+10 = 15
12 27 15+27 = 42
13 18 42+18 = 60
14 6 60+6 = 66
15 16 66+16 = 82
16 38 82+38 = 120
17 9 120+9 = 129

Measures of Central Tendency

Mean – Grouped Data

Example: The following table gives the frequency distribution of the number of
student registered each day during the past 50 days at the office via mail.
Calculate the mean.

Number f
of Students
10 – 12 4
13 – 15 12
16 – 18 20
19 – 21 14
  n = 50
HAADI Page 10
Solution:

Number f x fx
of Students
10 – 12 4 11 44
13 – 15 12 14 168
16 – 18 20 17 340
19 – 21 14 20 280
  n = 50 = 832

X is the midpoint of the class. It is adding the class limits and divides by 2

x=
 fx = 832 =16.64
n 50

Median – Grouped Data

Step 1: Construct the cumulative frequency distribution.

Step 2: Decide the class that contain the median.

Class Median is the first class with the value of cumulative

frequency equal at least n/2.

Step 3: Find the median by using the following formula:

 n 
 2 -F 
Median = Lm + i
 fm 
 
Where:

n = the total frequency

HAADI Page 11
F = the cumulative frequency before class median

Fm = the frequency of the class median

i = the class width

Lm = the lower boundary of the class median

Example: Based on the grouped data below, find the median:

Time to travel to school Frequency


1 – 10 8
11 – 20 14
21 – 30 12
31 – 40 9
41 – 50 7

Solution:

HAADI Page 12
1st Step: Construct the cumulative frequency distribution

Time to travel Frequency Cumulative


to school Frequency
1 – 10 8 8
11 – 20 14 22
21 – 30 12 34
31 – 40 9 43
41 – 50 7 50

N 50
= = 25 class median is the 3rd class
2 2

So, F = 22, fm= 12, Lm= 20.5 and i = 10

Therefore,
n 
 - F 
Median = Lm   2 i
 m f
 
 25 - 22  to school and another 25 students
Thus, 25 students take less=than   to travel
.5 minutes
2124 10
take more than 24 minutes to travel to school.
12 
Mode – Grouped Data = 24
Mode

• Mode is the value that has the highest frequency in a data set.

• For grouped data, class mode (or, modal class) is the class with the highest
frequency.

• To find mode for grouped data, use the following formula:


 Δ1 
Mode = Lmo +  i
Where:  Δ1 + Δ2 
i is the class width

HAADI Page 13
1 is the difference between the frequency of class mode and the frequency of the class
after the class mode.

2 is the difference between the frequency of class mode and the frequency of the class
before the class mode.

Lm is the lower boundary of class mode.

Calculation of Grouped Data – Mode

Example: Based on the grouped data below, find the mode

Time to travel to work Frequency


1 – 10 8
11 – 20 14
21 – 30 12
31 – 40 9
41 – 50 7

Solution:

Based on the table,

Lm = 10.5, 1= (14 – 8) = 6, 2= (14 – 12) = 2 and i = 10

 6 
Mode = 10.5   10  17.5
62

MEASURES OF DISPERSION

• Measures how spread-out of data values from the center.

HAADI Page 14
• Example

– Set 1: 60 40 30 50 60 40 70 50

– Set 2: 50 49 49 51 48 50 53 50

• The two data sets given above have a mean of 50, but obviously set 1 is more
“spread out” than set 2. How do we express this numerically? The object of
measuring this dispersion is to obtain a single summary figure which adequately
exhibits whether the distribution is compact or spread out.

Some of the commonly used measures of dispersion

• Range

• Inter-quartile range

• Variance

• Standard deviation

• Coefficient of variation.

• Range

Range is the easiest way of measuring the spread of a data set is the range of values.

Example

HAADI Page 15
Range = Largest - Smallest
= 48 - 35 = 13

Quartiles

Using the same method of calculation as in the Median, we can get Q1 and Q3 equation as
follows:

n   3n 
 4-F   4 -F 
Q3  LQ3 +  i
Q1  LQ1 +  i  fQ3 
 f Q1   
 

Example: Based on the grouped data below, find the Interquartile Range

Time to travel to school Frequency


1 – 10 8
11 – 20 14
21 – 30 12
31 – 40 9
41 – 50 7

HAADI Page 16
Solution:

1st Step: Construct the cumulative frequency distribution

Time to travel Frequency Cumulative


to school Frequency
1 – 10 8 8
11 – 20 14 22
21 – 30 12 34
31 – 40 9 43
41 – 50 7 50
2nd Step: Determine the Q1 and Q3

n 50
Class Q1    12.5
4 4
Class Q1 is the 2nd class n 
 4 -F 
Therefore, Q1  LQ1   i
f
 1 
Q
 
 12.5 - 8 
 10.5   10
 14 
 13.7143
3n 3  50 
Class Q3    37.5
4 4 n 
 4 -F 
Class Q3 is the 4th class Q3  LQ3   i
 fQ3 
Therefore,  
 37.5 - 34 
Interquartile Range  30.5   10
 9 
 34.3889
IQR = Q3 – Q1

IQR = Q3 – Q1

Calculate the IQ

IQR = Q3 – Q1 = 34.3889 – 13.7143 = 20.6746

HAADI Page 17
Mean Deviation

• This is a much better measure of dispersion.

• Mean deviation takes into consideration the limitations of range and quartile
deviation; it takes account of all observations.

• In other words, mean deviation is the mean of the absolute values of the deviation
from some measure of central tendency. Mean deviation is average of the absolute
deviations from the mean.

Example:

Find the mean deviation of the following scores: 5, 9, 16, 17, 18.

Variance and Standard Deviation


  fx 
2

Population Variance:
 fx 2

N
 
2

N
  fx 
2
Variance for sample data:
 fx 2

n
s2 
n 1

HAADI Page 18
Standard Deviation

Population:   2
2

Sample:
s 2
 s 2

Example:

Find the variance and standard deviation for the following data:

No. of Student f
10 – 12 4
13 – 15 12
16 – 18 20
19 – 21 14
Total n = 50
Solution:

No. of Student F x fx fx2


10 – 12 4 11 44 484
13 – 15 12 14 168 2352
16 – 18 20 17 340 5780
19 – 21 14 20 280 5600
Total n = 50 832 14216
  fx 
2

Variance,  fx 2

n
s2 
n 1
 832 
2
Standard Deviation,
14216 
 50
s  s  7.5820 502.75
2
1
 7.5820
Thus, the standard deviation of the number of student registered a college during
the past 50 days is 2.75.

HAADI Page 19
Example 2

Find Variance and Standard Deviation

Solution

HAADI Page 20

You might also like