Professional Documents
Culture Documents
Descriptive Statistics:
•Descriptive statistics uses data that provides a description of the population either through numerical calculation
or graph or table.
b) Measures of dispersion:
A) Measure of central tendency –
The measure of central tendency also known as summary statistics is used to represent the center point or a
In statistics, there are three common measures of central tendency as shown below:
(i)Mean :
It is a measure of the average of all values in a sample set.
Formula:
Example 1: Find the mean of the first five natural odd numbers, using the mean formula.
Solution:
Mean = (1 + 3 + 5 + 7 + 9) ÷ 5 = 25/5 = 5
Example 2: The heights of five students are 161 in, 130 in, 145 in, 156 in, and,162 in respectively.
The heights of five students = 161 in, 130 in, 145 in, 156 in, and,162 in (given)
Sum of the heights of five students = (161 + 130 + 145 + 156 + 162) = 754
Solution:
Sum of the student score = 67 + 56 + 77 + 68 +88 + 90 + 65+ 53+ 45+ 71+ 66 + 80 = 826
x̄ = ∑xi/n
∴∑xi = x̄ × n
Example 1: Given below are marks (out of 100%) obtained by 100 Students in the examination find the mean?
Frequency 14 16 18 23 18 11
Solution:
Marks Mid-point(x) Freq (f) Fx
30-35 32.5 14 455.0
35-40 37.5 16 600.0
40-45 42.5 18 765.0
45-50 47.5 23 1092.5
50-55 52.5 18 945.0
55-60 57.5 11 632.5
Total -------- F=100 𝑓𝑥=4490
Wages No of workers
200-210 5
210-220 15
220-230 32
230-240 42
240-250 15
250-260 12
260-270 4
Solution:
Wages No of workers (f) Mid-point(x) Fx
Mean= /
=29115/125
=232.92
Example 3: Below are the heights of semester eight information technology students at PSU.
Use it to calculate the mean height for the data.
Mean= å fx
åf
Mean= 4870/30
= 162.33cm
Exercise:
1) Find the mean of the following data.
Frequency 15 10 20 22 16 17
2) There are 40 students in Grade 8. The marks obtained by the students in
mathematics are tabulated below. Calculate the mean marks.
100 6
95 8
88 10
76 9
69 7
3) Find the mean of the following data.
Frequency 12 16 6 7 9
Assumed mean Method for Finding the Arithmetic Mean
The following steps describe this method.
Step 1: Calculate the class marks (mid-point) of each class (xi).
Step 2: Let A denote the assumed mean of the data.
Step 3: Find deviation (di) = xi – A
Step 4: Use the formula:
x̄ = A + (∑fidi/∑fi)
Example 1: Let's understand this with the help of the following example.
Calculate the mean of the following using the assumed mean method.
Frequency 5 8 30 25 14 12 6
Solution:
Let us make the calculation table. Let the assumed mean be A = 62.5
Class (Ci) Frequency (fi) Class mark (xi) di = xi – a fidi
0-10 12 5 5 – 25 = – 20 -240
10-20 28 15 15 – 25 = – 10 -280
20-30 32 25 = a 25-25 = 0 0
30-40 25 35 35-25 = 10 250
40-50 13 45 45-25 = 20 260
Total Σfi =110 Σfidi = -10
Class Interval 20 – 60 60 – 100 100 – 150 150 – 250 250 – 350 350 – 4507
frequency 7 5 16 12 2 3
2. A group of students was surveyed as a part of their environmental
awareness.
Number of houses 1 2 1 5 6 2 3
Step Deviation Method for Finding the Arithmetic Mean
This is also called the change of origin or scale method. The following steps describe this
method:
x̄ = A + h × (∑fiui/∑fi)
Example 1: Consider the following example to understand this method. Find the arithmetic
mean of the following using the step-deviation method.
Class Intervals 0-10 10-20 20-30 30-40 40-50 50-60 60-70 Total
Frequency 4 4 7 10 12 8 5 50
Solution: To find the mean, we first have to find the class marks and decide on A (assumed mean).
Let A = 35 Here h (class width) = 10
Number of apples 5 8 10 12 8 4 3
Question 2: The following table gives marks scored by students in an examination:
Number of students 3 7 15 24 16 8 5 2
ii) Median :
It is a measure of the central value of a sample set. In these, the data set is ordered from lowest to highest value
and then finds the exact middle.
Example 1
Find the median of this data:
1, 4, 2, 5, 0
There is an odd number of data points, so the median is the middle data point.
0, 1, 2,4, 5
The median formula is {(n + 1) ÷ 2}th, where “n” is the number of items in the set and “th” just means the (n)th
number.
To find the median, first order the numbers from smallest to largest. Then find the middle number.
For example, the middle for this set of numbers is 5, because 5 is right in the middle:
1, 2, 3, 5, 6, 7, 9.
You get the same result with the formula. There are 7 numbers in the set, so n = 7:
{(7 + 1) ÷ 2}th
= {(8) ÷ 2}th
= {4}th
The 4th number in 1, 2, 3, 5, 6, 7, and 9 is 5.
Example question: Find the median for the following data set:
102, 56, 34, 99, 89, 101, 10.
Step 1: Sort your data from the smallest number to the highest number. For this example data set, the
order is:
10, 34, 56, 89, 99, 101, 102.
Step 2: Find the number in the middle (where there are an equal number of data points above and below the
number):
10, 34, 56, 89, 99, 101, 102.
The median is 89.
Example question: Find the median for the following data set:
102, 56, 34, 99, 89, 101, 10, 54.
Step 1: Place the data in ascending order (smallest to highest).
10, 34, 54, 56, 89, 99, 101, 102.
Step 2: Find the TWO numbers in the middle (where there are an equal number of data points above and below
the two middle numbers).
10, 34, 54, 56, 89, 99, 101, 102
Step 3: Add the two middle numbers and then divide by two, to get the average
56+89=145
Wages($) No of workers
800-1000 18
1000-1200 25
1200-1400 30
1400-1600 34
1600-1800 26
1800-2000 10
Solution
Wages($) No of workers Cf
800-1000 18 18
1000-1200 25 43
1200-1400 30 73
1400-1600 34 107
1600-1800 26 133
1800-2000 10 143
Now the median value is N/2=143/2=71.5
Therefore the item which lies in the class (1200-1400), thus (1200-1400) is the median class
L=1200, h=200 f= 30 n=143, C=43
Median= L+h/f (n/2-c) = 1200+200/30(143/2-43)
Median= $1390
Example 3: Calculate the median for the following data:
Marks 0 - 20 20 - 40 40 - 60 60 - 80 80 - 100
Number of students 6 20 37 10 7
Solution:
• We need to calculate the cumulative frequencies to find the median.
0 - 20 6 0+6 6
20 - 40 20 6 + 20 26
40 - 60 37 26 + 37 63
60 - 80 10 63 + 10 73
80 - 100 7 73 + 7 80
• N = sum of cf = 80, N/2 = 80/2 = 40
• Since n is even, we will find the average of the n/2 th and the (n/2
+1)th observation i.e. the cumulative frequency greater than 40 is 63 and the
class is 40 - 60. Hence, the median class is 40 - 60.
l = 40, f = 37, c = 26, h = 20
Using the Median formula:
Median = l + h/f(n/2−c)/]
= 40 + 20/37(80/2-26)
= 40 + 20/37(40-26)
= 40 + 20/37(14)
= 40 + 280/37
= 40+7.56
= 47.56
Therefore, the median is 47.56.
Exercise:
1) Calculate the median for the following frequency distribution.
Number of girls 2 8 12 20 8
iii) Mode :
It is the value most frequently arrived in the sample set. The value repeated most of the time in the central set is
the actual mode.
Example 1: What is the Mode of 3, 4, 4, 5, 6, 6, 7
15 is the mode since it is appearing more times in the set compared to other numbers.
Example 3: Find the mode of 4, 4, 4, 9, 15, 15, 15, 27, 37, 48 data set.
Solution: Given 4, 4, 4, 9, 15, 15, 15, 27, 37, 48 is the data set.
As we know, a data set or set of values can have more than one mode if more than one value occurs with equal
frequency and number of times compared to the other values in the set.
Hence, here both the numbers 4 and 15 are modes of the set.
Mode for grouped data
The mode for grouped data can be calculated by the following formula
M= L+ (fm-f1)*h
(Fm-f1) + (fm-f2)
Example1
Following are heights in (m) of 50 students find the modal height
Height No of students
109.5-114.5 5
114.5-119.5 12
119.5-124.5 23
124.5-129.5 6
129.5-134.5 4
Solution
We can see from column (2) of the table that the maximum frequency of 23 lies in the heights of „between 19.5-124.5 this
suggests that the mode lies in these heights applying the formula given.
M= L+ (fm-f1)*h
(Fm-f1) + (fm-f2)
Lm = Lower medium
Fm = Frequency medium
Lm=119.5, Fm=23, F1=12, F2=6 h=5
M= 119.5+ (23-12)*5
(23-12) + (23-6)
=119.5+55/11+17
=119.5+55/28
=121.46
Example2: The following are marks obtained by semester eight information technology students find the modal marks?
30-40 4
40-50 6
50-60 8
60-70 12
70-80 9
80-90 7
90-100 4
Solution
M = L+ (fm-f1) *h
(Fm-f1) + (fm-f2)
M = 60+ (12-8)*10
(12-8) + (12-9)
M = 65.7
Example 3: A survey on the heights (in cm) of 30 students of the same batch
was conducted at a school. The data so obtained has been organized in the table
given below. Find the mode.
120 - 125 3
125 - 130 5
130 - 135 11
135 - 140 6
140 - 145 5
Total 30
Solution:
• Modal class = 130 - 135
• Lower limit of the modal class = (L) = 130
• Frequency of the modal class = (f)1= 11
• Frequency of the preceding modal class = (f)0= 5
• Frequency of the next modal class = (f)2(= 6
• Size of the class interval = (h) = 5
Mode = L + (f1−f0)/(2f1−f0−f2)*
Putting the values in the formula,
Mode = 130 + (11-5)/(2×11-5-6)5
= 130 + (6/11)(5) = 130 + 0.54×5 = 132.72
Therefore, mode = 132.72
Exercise:
1) Find the mode for the grouped data organized in the table given below.
10 - 20 8
20 - 30 15
30 - 40 12
40 - 50 5
Total 40
2) Observe the following table and find the mode.
Class Interval 10 - 20 20 - 30 30 - 40 40 - 50
Frequency 8 12 22 17
iv) Quartile:
Quartiles divide the entire set into four equal parts. So, there are three quartiles, first, second, and third
represented by Q1, Q2, and Q3, respectively.
Q2 is nothing but the median, since it indicates the position of the item in the list and thus, is a positional
average.
To find quartiles of a group of data, we have to arrange the data in ascending order.
Quartiles Formula
Suppose, Q3 is the upper quartile is the median of the upper half of the data set. Whereas, Q1 is the lower quartile
and median of the lower half of the data set. Q2 is the median.
Example 1: Find the quartiles of the following data: 4, 6, 7, 8, 10, 23, 34.
Solution: Here the numbers are arranged in ascending order and the number of items, n = 7
Solution:
The formula for the upper quartile formula is Q3 = ¾(n + 1)th Term.
The formula instead of giving the value for the upper quartile gives us the place.
So firstly we put your numbers in ascending order: 1, 2, 5, 6, 7, 9, 12, 16, 18, 19, 26.
Solution:
The upper quartile (18) is the 9th term or on the 9th place from the left.
v) Percentile.
Percentile is defined as the value below which a given percentage falls under. For example, in a group of 20
children, Ben is the 4th tallest and 80% of the children are shorter than you.
Percentile Formula
P = (n/N) × 100
Where,
n = ordinal rank of the given value or value below the number
N = number of values in the data set
P = percentile
Or
Percentile = (Number of Values Below “x” / Total Number of Values) × 100
Example 1: The scores obtained by 10 students are 38, 47, 49, 58, 60, 65, 70, 79, 80, 9and 2 Using the
percentile formula, calculate the percentile for score 70.
Solution:
Given:
Scores obtained by students are 38, 47, 49, 58, 60, 65, 70, 79, 80, 92
Percentile of 70
= (6/10) × 100
= 0.6 × 100 = 60
Solution:
Given:
The weights of the people are 35, 41, 42, 56, 58, 62, 70, 71, 77, 90
= (4/10) × 100
= 0.4 × 100 = 40