Professional Documents
Culture Documents
Business Statistics (Central Tendency) MMS 2022
Business Statistics (Central Tendency) MMS 2022
Unit 1
Unit 2
Unit 3
Unit 4
Unit 5
Unit 6
Unit 7
Unit 8
Unit 9
1|Page
Business Statistics
Definition of Statistics
The word statistics has been derived from the Latin word ‘status’. In the plural sense it means a
set of numerical figures called ‘data’ obtained by counting or measurement. In the singular sense
it means collection, classification, presentation, analysis, comparison and meaningful
interpretation of raw data
It is defined as the science which deals with the collection, analysis and interpretation of
numerical data
Presentation of findings
The planning of operations – relating to the special projects for the firm
Setting up of standards – size of employment, volume of sales, fixation of quality norms for the
manufactured product, norms for the daily output
In statistical quality control methods – statistics can be useful in various ways to ensure of
production of quality goods. This is achieved by identifying and rejecting defective or
substandard goods. The sales targets can be fixed on the basis of sale forecasts, which are done
by using varying methods of forecasting, analysis of sales etc
2|Page
Seasonal behavior – construct a seasonal index for the consumption of products
Export marketing – analyzing the quality of the products, to select the right products which has
demand in the overseas markets, analyzing the statistics of imports and exports
Maintenance of cost records – ensure cost of production includes cost of raw materials and
wages
Expenditure on advertising and sales – to find association between two or more variables such as
advertising expenditure and sales
Mutual funds, banking and financial institutions – statistics provide certain tools or techniques to
a consultant or financial adviser, provide an avenue to a person to invest his savings for
reasonable returns.
Population(N)
It is a collection of people, items or events about which you want to make inferences
Population is defined as the potential set of respondents in a geographical area. It is any large
collection of objects or individuals, such as Indian housewives, consumers etc about which the
information is desired.
Parameter
It is any summary number, like an average or percentage, that describes the entire population
Ex: to determine the average annual expenditure on clothing in a city, proportion of employees
working overtime in a factory
The mean and variance for a given population are known as population parameters.
Sample (n)
3|Page
It is a subgroup of population
Statistics
Quantitative data : It is highly statistical and highly structured. The data which is easily
measurable and tabulated
Variable
Variable is denoted by X
It means any measurable characteristic or quantity which can assume a range of numerical
values within certain limits, e.g., income, height, age, weight, wage, price etc.
Classification of Variables
Discrete variable
A variable which can take up only exact values and not any fractional values, is called a discrete
variable. It has specific values in a given class interval.
Continuous variable
A variable which can take up any numerical value[integral/fractional of whole number] with a
certain range is called a continuous variable. Height, weight, rainfall, time, temperature, etc., are
examples of continuous variables.
4|Page
Height of students in a school is a continuous variable as it can be measured to the nearest
fraction of time, i.e., years, months, days, etc.
Research variables
Independent Variable(X)
It is a variable which can be controlled by the researcher. Independent variables are also called as
Predictors or Exogenous variables.
Dependent Variable(Y)
It is a variable which can’t be controlled by the researcher. Dependent variables are also called as
Criterion or Endogenous variables.
Ungrouped data
Discrete series : the series without class interval and having frequency. It takes specific values.
The items which can be easily counted
Ex:
30 10
45 30
Continuous series : the series with class interval and having frequency. It may take any value in
a given class interval. The items cannot be easily counted.
Ex:
5|Page
30-40 10
40-50 30
What is lower class interval, upper class interval? Calculate midpoint, width of the class
interval
Ex:
30-39 10
40-49 30
What is lower class interval, upper class interval? Calculate midpoint, width of the class
interval
Ex:
29.5-39.5 10
39.5-49.5 30
What is lower class interval, upper class interval? Calculate midpoint, width of the class
interval
Ex:
150 - 199 10
200- 249 30
6|Page
For ex: 10 – 19
It can be made continuous by reducing the lower limit of the class interval by 0.5 and increasing
the upper limit of the class interval by 0.5 then the interval will be as follows :
Frequency Distribution : the arrangement and display of data, where the observed value is
paired with the frequency
Class limit : they are the lowest and highest values of a class. In 30 – 50, 30 is lower limit and
50 is the higher limit
Class interval : the difference between the upper limit and the lower limit is called the class
interval. 30 is lowest and 50 is highest
Computational measures – Arithmetic Mean (A.M), Harmonic Mean (HM) & Geometric
Mean (GM)
Arithmetic Mean is also called as Mean or Average. A.M is useful while making comparisons
among several data sets. A.M is calculated by taking all the values in the given data set.
N – number of observations
7|Page
Ex: 12, 15, 18, 21 and 25
Problem
The monthly wages of 4 workmen are Rs. 400, Rs. 440, Rs. 380 and Rs. 360. Find the A.M. of
the wages of four workmen.
Relative weight(w) : 2 3 5
Marks(x) : 30 25 20
wx : 60 75 100
Years : 1 2 3 4 5
Income(’00) : 5 10 15 20 25
The monthly wages of 4 workmen are Rs. 400, Rs. 440, Rs. 380 and Rs. 360. Find the A.M. of
the wages of four workmen.
Solution
X (d = x – 440)
400 -40
440 0
380 -60
8|Page
360 -80
∑d = -180
Income(’00) : 15 8 12 20 14
X (d = x – 12)
15 3
8 -4
12 = A 0
20 8
14 2
∑d = 9
Here A = 12, n = 5, ∑d = 9
Consumption (units) : 4 10 3 16 7 9 8 6
10 30
15 25
20 40
30 20
40 35
Solution
( loaves of bread) f
10 30 200
15 25 375
20 40 800
10 | P a g e
30 20 600
40 35 1400
Calculating Arithmetic mean for discrete series using direct method for the given distribution
400 8
440 6
380 12
360 10
Calculating Arithmetic mean for discrete series using direct method and shortcut method or
deviation method for the given distribution
92 12
125 7
180 6
80 9
11 | P a g e
Solution
Direct Method
i.e. d = (x – 125)
92 12
-33 -396
A = 125 7
0 0
180 6
55 330
80 9
-45 -405
∑f = 34 ∑ fd = -471
Calculating Arithmetic mean for discrete series using direct method and shortcut method or deviation
method for the given distribution
Profit(Lakhs No. of
) investors
22 10
12 | P a g e
15 18
18 2
24 4
Calculating Arithmetic mean for discrete series using direct method and shortcut method or deviation
method for the given distribution
Units No. of
sold salesman
315 20
260 15
200 7
400 30
10 – 20 10
20 – 30 15
30 – 40 5
40 – 50 30
50 – 60 15
60 – 70 12
70 – 80 13
13 | P a g e
80 – 90 10
Solution
X_ = A + ∑ fd’ * i
A – Assumed mean
f – Frequency
Locate the assumed mean (assumed mean can be identified from the centre of the
series having the midpoints)
20 – 30 15 25 -2 -30
30 – 40 5 35 -1 -5
40 – 50 30 45 - Assumed Mean 0 0
50 – 60 15 55 1 15
60 – 70 12 65 2 24
70 – 80 13 75 3 39
14 | P a g e
80 – 90 10 85 4 40
X_ = A + ∑ fd’ * i
X_ = 45 + (53 / 110) * 10
X_ = 45 + [4.818]*10 = 49.818
Problem
10 – 15 14
15 – 20 2
15 – 20 2 17.5 -1 -2
25 – 30 6 27.5 1 6
30 – 35 4 32.5 2 8
15 | P a g e
Here A = 22.5, i = width of class interval = 5, ∑fd’ = -16, N = 34
X_ = A + ∑ fd’ * i
Problem
10 – 20 8
20 – 30 10
30 – 40 15
40 – 50 20
50 – 60 4
60 – 70 6
70 – 80 9
16 | P a g e
Employees
20 – 30 10 25 -2 -20
30 – 40 15 35 -1 -15
40 – 50 20 45 = Assumed Mean 0 0
50 – 60 4 55 1 4
60 – 70 6 65 2 12
70 – 80 9 75 3 27
X_ = A + ∑ fd’ * i
X_ = 45 + (-16 / 72 )* 10
X_ = 45 + [-0.222] * 10 = 47.22
5–9 15
10 – 14 9
15 – 19 24
20 – 24 30
25 – 29 8
30 - 34 5
35 - 39 2 17 | P a g e
Calculate the Arithmetic Mean for the following distribution
15 – 19 25
20 – 24 7
25 – 29 14
30 – 34 40
Median 35 – 39 6
Ex: 5, 6, 8, 10, 12
If n is even
4, 5, 6, 7, 8, 9
Here n = 6
18 | P a g e
The terms are 6th and 7th term
Frequency : 6 10 5 4 2
Wt in kg F Cumulative frequency
112 6 6
118 10 16
122 5 21
130 4 25
40 2 27
N=∑f =
27
Frequency : 10 5 8 6 7 4
Wt in kg F Cumulative frequency
200 10 10
19 | P a g e
250 5 15
260 8 23
270 6 29
280 7 36
300 4 40
N = ∑f= 40
10 – 20 10
20 – 30 15
30 – 40 5
40 – 50 30
50 – 60 15
60 – 70 12
70 – 80 13
80 – 90 10
Solution
20 | P a g e
Md = l 1 + N/2 – C * i
Frequency
10 – 20 10 15
20 – 30 15 25
30 – 40 5 30 (C)
40 – 50 30(f) 60
50 – 60 15 75
60 – 70 12 87
70 – 80 13 100
80 – 90 10 110
21 | P a g e
Total N =∑f = 110
Md = l 1 + N/2 – C * i
Md = 40 + 55 – 30/30 * 10
Md = 40 + 8.33 = 48. 33
Problem
10 – 15 14
15 – 20 2
20 – 25 18
25 – 30 6
30 – 35 4
Frequency
10 – 15 14 14
15 – 20 2 16 (C)
20 – 25 18 (f) 34
25 – 30 6 40
22 | P a g e
30 – 35 4 44
Total N =∑f = 44
Solution
Md = l 1 + N/2 – C * i
Md = 20 + [22 – 16/18] * 5
Md = 20 + 1.66 = 21. 66
Problem
10 – 20 8
20 – 30 10
30 – 40 15
40 – 50 20
50 – 60 4
60 – 70 6
70 – 80 9
23 | P a g e
Calculation of Median for Continuous Series
10 – 20 8 8
20 – 30 10 18
30 – 40 15 33 = C
40 – 50 20 = f 53
50 – 60 4 57
60 - 70 6 63
70 - 80 9 72
Total N =∑f = 72
Solution
Md = l 1 + N/2 – C * i
Md = 40 + [36 – 33/20] * 10
Md = 40 + 1.5 = 41. 50
24 | P a g e
Calculate the Median for the following distribution
5–9 15
10 – 14 9
15 – 19 24
20 – 24 30
25 – 29 8
15 – 19 25
20 – 24 7
25 – 29 14
30 – 34 40
35 – 39 6
40 - 44 8
45 - 49 7
Mode
For the ungrouped data, mode is the value of the variable that occurs most frequently.
The value of the individual observations or items with the highest frequency is the mode for the
discrete series.
25 | P a g e
Continuous Series
The class interval with highest frequency is mode for the continuous series.
Ex: 8,9,11,15,16,12,15,3,7,15
In the above ex: out of 10 items the number 15 appears 3 times then 15 is called mode.
10 – 20 10
20 – 30 15
30 – 40 5
40 – 50 30
50 – 60 15
60 – 70 12
70 – 80 13
80 – 90 10
Solution
2f1 – f0 – f2
26 | P a g e
Determination of Mode
10 – 20 10
20 – 30 15
30 – 40 5(f0)
40 – 50 30(f1)
50 – 60 15(f2)
60 – 70 12
70 – 80 13
80 – 90 10
Solution
The class interval having the highest frequency is called as modal class
27 | P a g e
2f1 – f0 – f2
= 40 + (30 – 5) * 10
2*30 – 5 - 15
Problem
10 – 15 14
15 – 20 2
20 – 25 18
25 – 30 6
30 – 35 4
10 – 15 14
15 – 20 2(f0)
20 – 25 18(f1)
25 – 30 6(f2)
30 – 35 4
Total N =∑f = 44
Solution
The class interval having the highest frequency is called as modal class
28 | P a g e
l1 - Lower class interval of the modal class = 20
2f1 – f0 – f2
= 20 + (18 – 2) *5
2*18 – 2 - 6
Problem
10 – 20 8
20 – 30 10
30 – 40 15
40 – 50 20
50 – 60 4
60 – 70 6
70 – 80 9
29 | P a g e
Calculation of Mode for Continuous Series
10 – 20 8
20 – 30 10
30 – 40 15
40 – 50 20 = f
50 – 60 4
60 - 70 6
70 - 80 9
Total N =∑f = 72
Solution
The class interval having the highest frequency is called as modal class
2f1 – f0 – f2
30 | P a g e
= 40 + (30 – 5) * 10
2*30 – 5 - 15
5–9 15
10 – 14 9
15 – 19 24
20 – 24 30
25 – 29 8
30 – 34 5
35 – 39 2
15 – 19 25
20 – 24 7
25 – 29 14
30 – 34 40
35 – 39 6
40 - 44 8
45 - 49 7
31 | P a g e
Relationship between Arithmetic Mean, Geometric Mean and Harmonic Mean
The Harmonic Mean (HM) is defined as the reciprocal of the average of the reciprocals
of the data values. It is based on all the observations, and it is rigidly defined. Harmonic
mean gives less weightage to the large values and large weightage to the small values to
balance the values correctly. In general, the harmonic mean is used when there is a
necessity to give greater weight to the smaller items. It is applied in the case of times and
average rates.
Harmonic mean is the reciprocal of the average of reciprocals, the formula to define the
harmonic mean “HM” is given as follows:
The three means such as arithmetic mean, geometric mean, harmonic means are known
as Pythagorean means. The formulas for three different types of means are:
If G is the geometric mean, H is the harmonic mean, and A is the arithmetic mean, then
the relationship between them is given by:
Or
G2 = A.H
Example 1:
Solution:
Given data: 2, 5, 7, 9
32 | P a g e
Step 1: Finding the reciprocal of the values:
Step 3: Finally, take the reciprocal of the average value obtained from step 2.
Find the harmonic mean for data 12, 15, 18, 21 and 25.
Find the geometric mean for data 12, 15, 18, 21 and 25.
Measures of Dispersion
33 | P a g e
Practice Problems in Central Tendency
No. of Workers 2 4 9 31 12 6 4 2
No. of 2 6 14 16 8 3 1
companies
No. of 14 20 42 54 45 18 7
Salesman
No. of 18 38 46 27 15 8
Companies
34 | P a g e
No. of 5 7 19 29 16 9 8 7
companies
Calculate Arithmetic Mean, Median ,mode and Quartile Deviation for the following :
No. of 1 3 11 51 43 32 9
students
No. of 25 20 20 80 55 30 15 5
employees
No. of 10 13 18 40 15 5 12
students
No. of 14 21 33 47 27 36 38
Companies
35 | P a g e
Calculate Arithmetic Mean, Median and mode for the following :
No. of 5 16 56 19 4
companies
No. of 42 38 120 84 48 36 31
companies
No. of 7 13 22 10 8
companies
No. of 4 8 11 15 12 6 5
companies
36 | P a g e
Calculate Arithmetic Mean, Median and mode for the following :
No. of 8 24 27 21 10
companies
Frequency 2 12 22 20 14 4 1
37 | P a g e