Professional Documents
Culture Documents
Notes
Unit 1: Introduction
Structure:
1.1 Introduction to Descriptive Statistics
1.2 The Objects of Statistics
1.3 Characteristics of Statistics in the Plural Sense or Numerical Data
1.4 Functions of Statistics
1.5 Scope of Statistics
1.6 Limitations of Statistics
1.7 Distrusts of Statistics
1.8 Diagrammatical and Graphical representation of data
1.9 Bar Diagram
1.10 Types of Bar Diagram
1.11 Frequency Polygons
1.12 Ogives
1.13 Summary
1.14 Check Your Progress
1.15 Questions and Exercises
1.16 Key Terms
1.17 Check Your Progress: Answers
1.18 Case Study
1.19 Further Readings
1.20 Bibliography
Objectives
The origin of statistics is revealed by the word itself which is said to have been derived
either from the Latin Word 'Status' or the Italian word 'statista' or the German word 'Statistik'
which means political state. Statistics was used as a by-product of administrative activity.
Govt. maintains records of various types of numerical data on population, births, deaths,
literates, illiterates, employment, unemployment, Income, Taxes, Imports, exports etc.
Statistics was used as a technique to collect periodical data to ascertain the manpower
and material strength for military and fiscal purposes.
The theoretical development of statistics has its origin in the mid-seventeenth century
when many gamblers and mathematicians of France, Germany and England are credited
for its development. Pascal and P. Fermat, the two great French mathematicians made
innovative efforts to solve the famous 'Problem of point' which was posed by the famous
French gambler Chevalier De Mere. Their contribution became the foundation stone of the
Science of Probability. James Bernoulli (1654-1705) developed the 'Normal Curve'. The use
of 'Statistics' was popularized by Sir John Sinclair in his work Statistical Account of
Scotland (1791-1799). Modern Theory of Statistics was gradually developed during the
18th, 19th and 20th centuries mathematicians. Laplace (1749-1827) gave the principles
of 'Least squares' and established the 'Normal Law of Errors'. The famous statisticians
Sir Francis Galton (1822-1911), Karl Pearson (1857-1936) and W.S. Gosset contributed
to the study of Regression Analysis, Correlation Analysis as well as Chi-square test of
Goodness of Fit, and t-test respectively. R.A. Fisher, who is called "Father of Statistics",
has developed statistics for use in genetics, biometry, agriculture, psychology and
education. He also contributed to the Estimation Theory, Sampling Distribution, Analysis
of Variance (ANOVA) and design of experiments. Thus Prof. Ronald A. Fisher is the real
exponent in the development of the 'Theory of Statistics".
Meaning of Statistics
Statistics is used as general name for a large group of mathematical tools, not aiming
at absolutely accurate results but approximate results based on the theory of probability,
used to collect, analyse and interpret numerical facts for solving specific problems. Facts
that one dealt with in statistics must be capable of numerical expression. Otherwise they
do not come within the preview of statistics. Statistics is also concerned with a group
of data as for example, population of a country, sales price of the finished goods produced
by a concern etc.
Definitions
Prof. A.L. Bowley has given as series of definitions. At one place Bowley says
"Statistics may be called the science of counting". This view is not perfect and correct.
Statistics is not concerned with counting only. It deals more with estimates. At another
place, he says that "Statistics may rightly be called the science of averages". But calling
statistics as a science of counting or averages, confines the scope of statistics. Bowley
himself realized this drawback and stated that statistics cannot be confined to any one
sense.
Webster defined Statistics as "The classified facts respecting the condition of the
people in a state- especially those facts which can be stated in numbers or in tables
of numbers or in any tabular or classified arrangement". This definition has limited the
scope of statistics. It relates statistics only to those facts which are concerned with the
condition of the people in a state. This definition is not exhaustive because, it does not
take into account all aspects of human activity.
Seligman defines statistics as the science which deals with the methods of
collecting, classifying, presenting, comparing and interpreting numerical data collected to
throw some light on any sphere of enquiry.
According to Horace Secrist “By statistics we mean aggregate of facts affected to
a marked extent by multiplicity of cause, numerically expressed, enumerated or estimated
according to reasonable standard of accuracy, collected in a systematic manner for a
predetermined purpose and placed in relation to each other”.
Notes (iii) To bring out the broad characteristics of a group in which the individual numbers
exhibit variation in attributes.
(iv) To throw light on general economic and social conditions as guide to
administrators to decide administrative policy.
(v) To indicate business trends and tendencies so that it may be possible to plan
ahead without being caught unawares.
(f) Statistics and research: Statistics is indispensable in research work. Most of Notes
the advancement in knowledge has taken place because of experiments
conducted with the help of statistical methods.
(g) Statistics and other uses: Statistics is useful to bankers, brokers, insurance
companies, social workers, labour unions, trade associations, chambers of
commerce and to the politicians.
In the field of Economics, Statistics entered rather late. The relationship among
supply of, demand for and price of commodities was established with the help of statistical
analysis by the end of the 17th century. The classical economists laid more stress upon
the deceptive method of reasoning and economic laws were reasoned out in the abstract.
The economic laws were verified from observations and proved by inductive method only
during the first decade of the 20th century. The quantitative analysis was introduced along
with the qualitative analysis in the methods of study of economics. Thus, the inductive
method was introduced in addition to the classical deductive method in the science of
economics and various problems of economics were solved by the end of the 20th century.
Statistics of production help in adjusting the supply to demand, statistics of consumption
help us to find out the way in which people of different strata of society spend their income.
Statistics are very useful in knowing the standard of living and taxable capacity of people.
Statistical methods help not only in formulating appropriate economic policies but also
evaluating their effect. Econometrics which comprises application of statistical methods
to theoretical economic models is widely used in empirical economic research.
In the field of business, decision-making process is the most important function of
management. Modern business is complex and vast, involving a number of specialised
activities, government interference and cut-throat competition. A modern manager cannot
solve the complex business problems without the assistance of statistical methods. The
entire business planning is depending upon the forecasting of the future trends which is
performed only by the science of statistics. Statistics is helpful in preparing the various
budgets, conducting the market survey, fixing the wage structure based upon cost of living
indices and controlling the business activities. Statistics and statistical methods have
provided the businessman with one of his most valuable tools for decision - making. The
use of statistics in business can be extended to production, sales, purchase, finance,
personnel, accounting, marketing and product research and quality control.
Notes 8. The greatest limitation is statistical data can be used properly only by persons
having thorough knowledge of the methods of statistics and proper training.
9. If statistical data are not uniform and homogenous, study of the problem is not
possible. Homogeneity of data is essential for their proper study.
10. Statistical methods are not the only method for studying a problem. There are
other methods also. A problem can be studied in various ways.
Meaning
One of the most convincing and appealing ways in which statistical results may be
represented is through graphs and diagrams.
It represents only one variable. Since these are of the same width and vary only
in lengths (heights), it becomes very easy for comparative study. Simple bar diagrams
are very popular in practice. A bar chart can be either vertical or horizontal; for example
sales, production, population figures etc. for various years may be shown by simple bar
charts
Illustration - 1
The following table gives the birth rate per thousand of different countries over a certain
period of time.
Country India Germany U. K. New Zealand Sweden China
Birth rate 33 16 20 30 15 40
Solution:
Notes
Comparing the size of bars, China’s birth rate is the highest, next India, while
Germany and Sweden equal in the lowest positions.
Illustration - 2
Represent the data by using a simple bar diagram.
Countries: A B C D E F
Production of Rice (000’s tons): 38 42 29 28 18 11
Solution:
50
42
40 38
29 28
30
20 18
11
10
0
A B C D E F
In a subdivided bar diagram, each bar representing the magnitude of given value is
further subdivided into various components. Each component occupies a part of the bar
proportional to its share in the total
Illustration - 1
Present the following data in a sub-divided bar diagram.
Year/Faculty Science Humanities Commerce
2014-2015 240 560 220
2015-2016 280 610 280
Solution:
Amity Directorate of Distance and Online Education
Introduction 11
Notes
Illustration - 2
The Number of Students in University X during 2008 to 2011 are as follows. Represent
the data by a similar diagram.
Year Arts Commerce Science Total
2008 - 2009 20,000 10,000 5,000 35,000
2009 - 2010 26,000 9,000 7,000 42,000
2010 - 2011 31,000 9,500 7,500 48,000
Solution:
Notes Illustration - 3
The following table shows the result of B.B.M. students for the last 3 years. Represent
the data by sub-divided bar diagram.
5000 4800
4500 4300 Ist Class
1600
4000 IInd Class
1500
3500 3400 Failed
3000
1200
2500
2600
2000
2100
1600
1500
1000
600
700
600
500
In a multiple bar diagram two or more set of related data are represented the
components are shown as separate adjoining bars. The height of each bar represents the
actual value of the component. The components are shown by different shades or colours.
Illustration - 1
The table below gives data relating to the exports and imports of a certain country
X (Rs. in Lakhs) during the four years ending in 2007-2011. Represent the data by a suitable
diagram:
Year Export Import
2007-2008 319 250
2008-2009 339 263
2009-2010 345 258
2010-2011 308 206
Solution: Notes
Illustration - 2
Construct a suitable bar diagram for the following data of number of students in two
different colleges in different faculties.
College Arts Science Commerce Total
A 1200 800 600 2600
B 700 500 600 1800
1800
= College 'A'
1600
= College 'B'
1400
1200
1200
No. of students
1000
800
800 700
600 600
600 500
400
200
Different departments
Fig: A multiple bar diagram showing numbers of students in two different colleges
in different departments.
Illustration - 3
700
600
500
400 Ist Class
300 IInd Class
IIIrd Class
200
Failed
100
0
2006 2007 2008
In percentage bar diagram the length of the entire bar kept equal to 100 (Hundred).
Various segment of each bar may change and represent percentage on an aggregate.
Illustration - 1
16%
22%
90 Children
80 Women
70 Men
36%
35%
34%
60
50
40
48%
30
45%
44%
20
10
Illustration - 1
Following are the figures relating to exports and imports for six years. Represent
the information by deviation bars.
1999 25 5 +20
2000 70 20 +50
2001 70 110 -40
2002 90 50 +40
2003 100 130 -30
Solution:
Diagram showing balance of payment (in lakhs `) for five years
+50 Favourable
+50 +40
Unfavourable
+40
+30 +20
+20
+10
2001 2003
0
1999 2000 2002
-10
-20
-30
-30
-40
-40
-50
Notes Illustration - 1
Solution:
Scale : along x axis 1 cm = 10 units
along y axis 1 cm = 5 units
35
30 Frequency polygon
No. of persons
25
20 Histogram
15
10
5
0
10 20 30 40 50 60 70 80
age
Frequency polygon showing the distribution of persons of different age group.
1.12 Ogives
When frequencies are added, they are called cumulative frequencies. The curve
obtained by plotting cumulating frequencies is called a cumulative frequency curve or an
ogive (pronounced as ojive).
To construct an Ogive: (i) Add up the progressive totals of frequencies, class by class,
to get the cumulative frequencies. (ii) Plot classes on the horizontal (x-axis) and cumulative
frequencies on the vertical (y-axis).
(a) Less than Ogive: To plot a less than ogive, the data is arranged in ascending
order of magnitude and the frequencies are cumulated from the top i.e., adding.
Cumulative frequencies are plotted against the upper class limits. Ogives under
this method, gives positive curve
(b) Greater than Ogive: To plot this ogive, the data are arranged in the ascending
order of magnitude and frequencies are cumulated from the bottom or subtracted
from the total from the top. Cumulative frequencies are plotted against the lower Notes
class limits. Ogives under this method, gives negative curve
Uses: Certain values like median, quartiles, deciles, quartile deviation, co-efficient
of skewness etc. can be located using ogives. Ogives are helpful in the comparison of
the two distributions.
Illustration - 1
Draw less than and more than ogive curves for the following frequency distribution
and obtain median graphically. Verify the result.
Solution:
20 5 100 0
40 17 95 20
60 35 83 40
80 60 65 60
100 75 40 80
120 87 25 100
140 95 13 120
160 100 5 140
180
160
140
120
80
60
40
More than
20
0 X
20 40 60 80 100 120 140 160 180
Notes Illustration - 2
Find the median from the following series. Also draw less than ogive, more than ogive
and locate median on a graph.
Income (`) No. of Persons
0-20 82
20-40 112
40-60 150
60-80 95
80-100 48
Solution:
C.I. f Class l.c.f Class m.c.f.
(Less than) (More than)
0-20 82 20 82 0 487
20-40 112 40 194 20 405
40-60 150 60 344 40 293
60-80 95 80 439 60 143
80-100 48 100 487 80 48
No. of Persons
20 40 60 80 100
Median 50 Median Income
1.13 Summary
Statistics is used as general name for a large group of mathematical tools, not aiming
at absolutely accurate results but approximate results based on the theory of probability,
used to collect, analyse and interpret numerical facts for solving specific problems. Facts
that one dealt with in statistics must be capable of numerical expression. Otherwise they
do not come within the preview of statistics. Statistics is also concerned with a group
of data as for example, population of a country, sales price of the finished goods produced
by a concern etc.
One of the most convincing and appealing ways in which statistical results may be Notes
represented is through graphs and diagrams.
Subdivided bar diagram, each bar representing the magnitude of given value is further
subdivided into various components. Each component occupies a part of the bar
proportional to its share in the total.
Multiple bar diagram two or more set of related data are represented the components
are shown as separate adjoining bars. The height of each bar represents the actual value
of the component. The components are shown by different shades or colours.
Deviation bars are used to represent net quantities - excess or deficit i.e. net profit,
net loss, net exports or imports etc. Such bars have both positive and negative values.
Positive values lie above the base line and negative values lie below it.
Notes 3. Procedures of descriptive statistics and control charts which are used to improve
process are classified as
[a] Statistical tools
[b] Parallel tools
[c] Serial tools
[d] Behavioral tools
4. A circle in which sectors represents various quantities is called –
[a] Non of these
[b] Frequency Polygon
[c] Pie Chart
[d] Histogram
5. ___________ are used when you want to visually examine the relationship
between two quantitative variables.
[a] Bar graphs
[b] Pie graphs
[c] Line graphs
[d] Scatter plots
6. A Histogram contains a set of
[a] Adjacent rectangles
[b] Adjacent triangles
[c] Non Adjacent Rectangles
[d] Adjacent squares
7. Component bar charts are used when data is divided into
[a] Circles
[b] Parts
[c] Groups
[d] None of these
1.20 Bibliography
1. Srivastava V. K. etal – Quantitative Techniques for Managerial Decision Making,
Wiley Eastern Ltd
2. Richard, I.Levin and Charles A.Kirkpatrick – Quantitative Approaches to
Management, McGraw Hill, Kogakusha Ltd.
3. Prem.S.Mann, Introductory Statistics, 7th Edition, Wiley India, 2016.
4. Aczel A.D. and Sounderpandian J., “Complete Business Statistics”, 6th edition,
Tata McGraw Hill Publishing Company Ltd., New Delhi, 2012.
5. Anderson D.R., Sweeney D.J. and Williams T.A., Statistics for business and
economics, 11th edition, Thomson (South – Western) Asia, Singapore, 2012.
6. Budnik, Frank S Dennis Mcleaavey, Richard Mojena – Principles of Operation
Research - AIT BS New Delhi.
7. Sharma J K – Operation Research- theory and applications-Mc Millan,New Delhi
8. Kalavathy S. – Operation Research – Vikas Pub Co
9. Gould F J – Introduction to Management Science – Englewood Cliffs N J Prentice
Hall.
10. Naray J K, Operation Research, theory and applications – Mc Millan, New Dehi.
11. Taha Hamdy, Operations Research, Prentice Hall of India
12. Tulasian: Quantitative Techniques: Pearson Ed.
13. Vohr.N.D. Quantitative Techniques in Management, TMH.
11. Stevenson W.D, Introduction to Management Science, TMH.