Professional Documents
Culture Documents
The subject matter of statistics has to do a great deal with the ‘science’ of statecraft. The very word
“statistics” is said to have been derived from, say the Latin “status”, Italian “statista” and so on, but
obviously, only an organized and strong body of the state could venture into collection of statistics
which in the past was mainly on population, its composition and wealth or poverty, etc.
Statistics in modern times is not a mere tool of state administration; it has become a fact of day-to-day
life. ‘Statistics’ is being used both as a singular noun and a plural noun.initaily,a distinction was drawn
while as a singular noun, it represented a method of study based on analysis and interpretation of
facts.but,modern literature on the subject does away with any such distinction. Now ‘statistics’ can
signify ‘data ‘even when used as singular noun in which case it would be treated as a group noun. The
actual meaning of ‘Statistics’ in each case shall be constructed from the context in which it has been
used.thus, the word”Statistics” may mean any one of the following:
But, only the first two of these, being more relevant to general purpose, are given greater prominence.
The illustrations on the three possible uses are as follows:
• Production statistics is compiled for judging the process of a business firm (i.e. here ‘statistics’
has been used for ‘data’).
• Statistics helps in simplification, analysis and presentation of data(here ‘statistics’ has been
used to represent statistical method),
• Statistics derived from a small representation group taken from the whole lot used for
drawing inference about characteristics of the whole (here ‘statistics’ represent measure
based on sample observations.)
Definition:
The purpose of definition is to lay down precisely the meaning, the scope and the limitation of a subject.
This may not be served if there are many definition emphasizing one or more of its element, so statistics
have been grouped below under two main head:
• Statistics as data.
• Statistics as methods.
Statistics as data:
Some of the definitions of statistics as a data are described as quantitative facts which according to
Webster, he defines that:
“Classified facts respecting the condition of the people in a state especially those facts which can
be stated in numbers or in any way other tabular or classified arrangement.”
• Facts, especially the numerical ones. The non-numerical facts obviously cannot be
processed statistically.
• Facts which concern only the condition of the people in the state like emphasizing on
restricting the application of statistics to human science.
• Lastly statistics as data are classified and tabulated. It is necessary to bear in mind that no
valid comparisons are possible without proper arrangement of data and the use of the
statistical methods on such data.
Numerical values are commonly described as statistics, if it possesses any of the following:
• Descriptive statistics.
• Analytical statistics.
• Inductive statistics.
• Inferential and applied statistics.
Descriptive statistics:
Descriptive statistics include different types of methods like collection and tabulation, averages,
trends, index numbers, dispersion and skewness which help in summarizing and describing the main
feature of the data. The descriptive measures are basic to any statistical analysis. They sometime
constitute the preliminary steps to final analysis and interpretation. For example, mean and standard
deviation are important for deriving coefficient of correlation.
Analytical statistics:
This include methods like association of attributes, correlation, regression, etc., which help in
establishing functional relation between variables. Analysis is often attempted through comparisons
between two or more set of data, thus, the methods which help in making valid comparison are
included in analytical statistics.
Inductive statistics:
It’s a statistical method that help to generalization the random observation, also inductive statistics
help in estimating partial data.
Inferential statistics:
Inferential statistics help in drawing inferences on the characteristics of the population, on the basis
of sample are covered under ‘Inferential statistics’ where hypothesis of a sample are tested on the basis
of population parameters.
Applied statistics:
The applied statistics consists of application of statistical methods and technique to the problem.
Methods included in applied statistics are quality control, sample, surveys, quantitative analysis for the
business decision, etc.
Functions of statistics:
Statistics are important for the decision making and base on these, principles are formulated for
analyzed quantitative facts. To be more precise, some of the important uses of statistics are listed
below:
Statistics is widely used in modern times. Initially in olden days, it was employed by the state to collect
information on public affairs, but now a days, its is used in all scientific experiment where observations
were to be recorded and compared so that valid inferences could be drawn on important matters. The
significance of any hypothesis can also be judge through statistical methods. Statistics has been
extensively used in biology, meteorology and research number of important investigations in the field of
economics and commerce are largely statistical.
In short, statistics finds use whenever a large quantitative data needs simplification and analysis is
involved, or where estimates have to be made in regard to certain unknown and uncertain events on the
basis of past experience or on available information. Statistics finds use whenever a mass of quantitative
data needing simplification and analysis is involved, or where estimates have to be made in regard to
certain unknown and uncertain events on the basis of past experience or on available information.
Following are field of requirement of statistics
1. In State Affairs
With the growth of State and concept of Welfare State has taken roots, it has to undertake
greater responsibilities in social matters. To fulfill these obligations it has to collect information
on the economic condition of the people, and resource available in the State.
2. In Economics
I. Compilation of National Income Accounts: These are aggregate accounts of the nation
containing systematically arranged information on certain macro variable like
production, income, expenditure, saving with rest of the world.
A modern business firm, even if it is operating on a moderate scale, has to plan its affairs
systematically. Some important aspects of business planning are given below:
A decision on location from amongst various alternative places will depend on the
collection and analysis of information on the availability and cost of land, labour, raw
materials, etc. It is also be necessary to find out the relative cost of transportation on
the procurement of raw material and the distribution of the finished product.
It deals with the preparation of schedule for production of various types of goods to
avoid both over-stocking of goods and losses due to non-fulfillment of order in time.
4. Physical Science
Physical science use the statistics in increasing order and in includes in filed of astronomy,
chemistry, engineering, geology, meteorology and certain branches of physics
5. Natural Science
Statistical techniques have proved to be extremely useful in the study of all natural science like
astronomy, biology, medicine, meteorology, zoology, botany, etc. Example, in diagnosing the
correct disease the doctor has to rely heavily on actual data like temperature of the body, pulse
rate, blood pressure.
6. Research
Statistics in indispensable in research work. Most of the advancement in knowledge has taken
place because of experiment conducted with the help of statistical methods. Example,
experiment about crop yields and different fertilizers are frequently designed and analysed with
help of statistical methods.
Types of data collection:
Internal data.
External data.
Internal data:
Internal data comes from the internal records related to the operations of a business firm, records of
production, purchase and the accounting system. This is generally associated with the organizational
and functional activities of the firm. The internal data can be either insufficient or inappropriate for the
problem under investigation, thus we need external data to make decision.
External data:
The external data are collected and published by agencies external to the enterprise. The external data
can be collected either from the primary or the secondary source.
Primary data.
Secondary data.
Primary data:
Primary data are those statistical data which are collected for the first time and are original in nature.
Primary data are collected originally by the authorities who are required to collect them. The source
from which primary data are collected is called ‘primary source’. The method of collection of primary
data is also known as the ‘Primary Method of the collection of the data’. Primary data are collected for
the first time by the authorities who require the data for their own use and treatment. Data collected by
the field worker, investigator and enumerator are all primary data.
For example, the number of students enrolled in various courses and students graduating every year in
Sherubtse College published by the Royal university of Bhutan (RUB) are one of the examples of the
primary source of data.
Secondary data:
Secondary data are those statistical data which are collected and published by one organization and
subsequently treated and utilized by the other organizations. Secondary data are originally collected and
published by the organizations other than the authorities who require them subsequently for their use.
So secondary data of one organization become the primary data of other organizations who first
collected and publish them.
The source from which secondary data are collected is ‘Secondary source’. The method of collection of
secondary data is also known as ‘Secondary method of the collection of the data ’.
For example, the publications of different Trade association and Bhutan Chamber of Commerce (BCCI)
are the source of secondary data.
The difference between the primary and the secondary data is that data which is primary in the hands of
one may become secondary in the hands of others.however,the main distinction between two are listed
below on the basis of the criteria in table.
When the data is neither internally available nor there exist a suitable secondary source, the
expenditure on original collection becomes justified. There are many various methods to collect primary
data, but some of the most important collection of primary data is listed as below:
It is becoming very important for the need of reliable and adequate data for taking policy
decision in different fields of human activity. There are two ways of collecting required
information:
In complete enumeration survey method the data are collected for each and every unit like
person, household, field, shop, factory, etc belonging to the population or society which is the
complete set of items which are of interest in any particular situation.
The advantage of this method is that no unit is left out and hence greater accuracy are realized
and ensured. However, the effort, money and time required for carrying out complete
enumeration will be extremely large.
2. Sampling techniques
A sample is not studied for its own sake. The basis objective is to draw the proper judgment
about the population. Sampling is only tool which helps to know the characteristics of
population or universe by examining only small part of it.
Principle of Sampling
This principle is derived from the mathematical theory of probability. In the word of King, “The
law of statistical regularity lays down that a moderately large number of items chosen at
random from a large group are almost sure on the average to possess the characteristics of the
large group.” (Gupta, 1999, p43)
This principle points out that is a sample is taken at random from a population, it is likely to
possess almost the same characteristics as that of population. Through random selection we
mean a selection where each and every item of the population has an equal chance of being
selected in the sample. It is possible for one to predict fairly accurately the characteristics of the
population by studying only a part of it.
This principle is great practical significance because, it can reduce the considerable amount of
work and resource before the final conclusion is drawn regarding the population and universe.
This principle is the outcome of the principle of statistical regularity. It states that, other thing
being equal, larger the size of sample, more the accurate the results are likely to be. This is
because large numbers are more stable as compared to smaller one. The difference in
aggregate result is likely to be less when numbers in sample is large and therefore the variation
is insignificant.
Sampling Method:
The Various method of sampling can be grouped under the two broad heads:
Probability sampling methods are the methods in which every item in the universe has a known chance
of the item being chosen for the sample. In this case the selection of any item can be done randomly as
it has fewer chances for the variations of sample gathered. Personal bias or the investigator does not
influence the selection.
2) Simple Random Sampling:
Simple random sampling refers to that sampling technique in which each and every unit of the
population has an equal opportunity of being selected in the sample.
Lottery Method:
This is a popular method of taking a random sample. In this method, all items of the universe are
numbered on separate slip of identical size and shape of paper. Then it is mixed up in container and
blind selection is made. This selection of items thus depends entirely on chance. While adopting
lottery method, size of slips should be identical size. This is a simple random sampling method.
Systematic Sampling:
A systematic sample is formed by selecting one unit at random and then selecting additional unit at
evenly spaced intervals until the sample has been formed. This method is popularly used in those
cases where a complete list of the population from which sample is to be drawn is available. The list
may be prepared in alphabetical, geographical, numerical or some other order. The items are serially
numbered. The first item is selected at random generally by following the lottery method.
Non-probability sampling methods are the one which do not provide every item in the universe with the
known chance of being included in the sample. This process include little subjective for sample
collection.
Judgment Sampling:
In this method the investigator exercises his judgment in the choice and includes those items in the
sample which he thinks are most relevant of the universe with regard to the characteristics under
investigation. Example, if sample of five taxi driver is be selected from town for analyzing the
passenger travelled for a particular season, the investigator would select five drivers who in his
opinion are representative of the town.
This method is used in solving many types of economics and business problems. Some important
element may miss when small number of sampling are randomly selected. It is useful in everyday
business problems and making public policy decisions etc.
Quota Sampling
In quota sampling, quotas are set up according to some specified characteristics such as so many in
each of several income groups, so many in each age, so many with certain political or religious
affiliations etc. Each interviewer is then told to interview a certain number of persons which
constitute his quota. With the quota, the selection of sample items depends on personal judgment.
For example in radio listening survey, the interviewers may be told to interview 400 people living in
certain area and that out of every 100 person interviewed 60 are to be housewife, 40 farmers and
15 children under the age of 20. Within these quotas the interviewer is free to select the people to
be interviewed. The cost for interview may be small but there can be numerous chances of bias for
validation of the result. They may miss the person or person refuse to respond and interviewer
simply selects someone else. Because of bias and prejudice quota sampling are hardly used in
practical work.
Quota sampling is often used in public studies. The results are satisfactory and presentable if the
interviewer are well trained and follow instruction strictly.
Convenience Sampling
Hence the result obtained by following convenience sampling method can be hardly representative
of the population. They are generally biased and unsatisfactory. But it is used in making pilot
studies.
Advantage of sampling technique:
The sampling technique has following advantage over complete enumeration survey:
Considerable time and labor are saved when a sample survey is carried out as it is the
study of part of the population.
A total expense made in collecting data in a sample survey is generally less than that of
complete survey due the fact that, we study only a part of population.
The most convenient way to present the data and statistical results are through diagrams and graphs.
There are many ways in which statistical data may be displayed pictorially such as different type of
diagrams, graphs and maps. They give bird’s-eye of the entire data and information given is easily
understood. It helps in proper understanding of the data as it gives an interesting form to it.
The impressions created by diagrams last much longer than those created by the figures presented in a
tabular form. There are wide varieties of diagram used for data representation and few of diagram with
it data associated.
One-dimensional Diagram
Two-dimensional diagram.
One-dimensional Diagram:
They are called one dimensional because it is the length which contributes for actual data
representation and not the width of the diagram. They are readily understood, simplest and easiest to
make out the result represented. When large items are to be compared they are only the form that can
be used effectively. Example of one-dimensional diagram is bar diagram.
The different Types of Bar Diagrams are listed as shown below with the diagrams:
diagrams
2001 20
2002 78.2
2003 15
2004 54
Figure 1: The potato production from the khagma in a year from 2001-2004
2001
In multiple bar diagram two or more sets of data interrelated are presented. The only difference
from simple bar is that since more than one data are represented, different shades, colors, dots are
used to distinguish between the bars. It is used whenever a comparison between two or mor more
related variables is to be made.
Figure 2: The regional rainfall during the year 2008 to 2010 in Southern Dzongkhag .
Percentage bars are particularly used in statistical work which requires the portrayal of relative
changes in data. When such diagrams are prepared, the lengths are kept equal to 100 and segments
are cut in these bars to represent the percentage of the aggregate.
Net
Year Sales Profit
2002-
2003 20% 65%
2004-
2005 12% -35%
2006-
2008 9% -10%
2009-
2010 26% 72%
Sale and net profit for N.P General shop for the year 2002 to 2009
80% 65%
60% 43%
40% 26%
20%
20% 12% 9% Sales
0% Net Profit
-20% -10%
-40%
-35%
2002
2002-03 2004-05 2006-07 2008-09
Two-dimensional Diagrams::
i) Rectangle:
The area of rectangle is equal to the product of its length and width. To set the figure of given data
either ordinary method or percentage method can be adopted. Percentage method is used often as
it enable to made comparison on percentage basis.
Item o
of
Expenditure Student A Student
Ordinary % Ordinary %
Shoe 3000 20.13 2900 26.13
Clothing 4500 30.20 4000 36.04
Cosmetics 2000 13.42 1200 10.81
Food 5400 36.24 3000 27.03
Total 14900 100.00 11100 100.00
30000
25000
20000 Student B %
15000
10000
5000 Student B Ordinary
0
Student A %
Student A Ordinary
ii) Square:
Rectangle method gives difficulties in presenting data where the values of the items vary widely. In
case if the ratio of data is 1:10 or bigger, the diagram look unwieldy. To overcome the difficulties,
squares are used to represent the data having very gap between the correspondence figures.
III) Circle:
Another way of representing two-dimensional diagrams is in the form of circles. In such a diagram both
the total and component parts or sectors can be shown. The area of a circle is proportional to the square
of its radius. Circle can be used in all those cases in which squares are used. But circle are usually difficult
to compare and as such are not very popular in statistical works.
Pie diagram is used to show the breakup of a total into components parts. The pie chart is so called
because the entire graph looks like the pie, and the components resemble slices cut pie. While making
comparision, pie diagrams should be used on a percentage basis and not on an absolute basis. Since a
series of pie diagrams showing absolute figures would require that larger totals be represented by larger
circles.
Three dimensional Diagrams:
Three dimensional diagrams also known as volume diagram, consist of cubes, cylinders, spheres, etc.in
such diagrams, three things namely length, width and height have to be taken into account. Such
diagrams are used where the range of difference between the smallest and the largest value is very
large.
The examples of three dimensional diagrams are Cubes, Pictograms and Cartograms.