Professional Documents
Culture Documents
INTRODUCTION
people in the state…. specially those facts which can be stated in number
Webster
According to Horace Secrits, “Statistics are aggregates of facts
affected to a marked extent by multiplicity of causes numerically
expressed, enumerated or estimated according to reasonable
standards of accuracy, collected in a systematic manner for a
predetermined purpose and placed in relation to each other.”
The definition of statistics given by Horace Secrits is quite exhaustive and
on analyse, it makes clear that as numerical statements of facts, Statistics
should process the following characteristics:
4 1-50
(i) Statistics are concerned with “population” of facts rather
than with a single fact: A single age, height or sale does not
constitute statistics. A “set” “series” or “mass” of accidents,
sales, ages form statistics and they may be studied in relation to
time, place and number of occurances.
(ii) Statistics are affected to a marked extent by a
multiplicity of causes: Supposse, Cholera has broken out in an
area. The disease may have spread through a number of possible
sources, such as water, milk, vegetables etc. Production
statistics are affected by climate, soul, monsoon, fertility etc.
(iii) Statistics must be numerically expressed, enumerated or
estimated: Statistical methods apply only to date which are
capable of reducing to quantitative forms. Qualitative
expressions such as beauty, health intelligence are not statistics
unless a numerical equivalent is assigned to each expression.
5 1-50
(iv) Statistics must be enumerated or estimated or
approximated according to reasonable standards of
accuracy: Reasonable standard of accuracy depend on the
purpose for which the statistics are being collected. No hard
and fast rule can be laid down for all cases. But one thing is
certain, i.e., personal bias and prejudices of the investigator
should not enter into the enumeration or estimation of
figures, otherwise conclusions from the figures would not
be accurate.
(v) Statistics should be collected in a systematic manner
for a predetermined purpose: The purpose of a series of
weights and heights of a particular class in a school may be
to find out whether the students having more weights are
taller or not. In the absence of a predetermined purpose, the
6
data1-50will have to go to the waste paper basket.
Business Statistics and Their Uses
9 1-50
(3) Presentation of data: Data may be
presented in the form of (i) Statistical tables
and (ii) diagrams and graphs.
Information
Data that are transformed into useful facts that can be used for a
specific purpose, such as making a decision
11 1-50
Data
Data: Raw facts or measurements of sales
12 1-50
Information
Analyzing the data can provide information for
decision making
Table |Sales Score Data
Date Score
13/6 94
20/6 96
27/6 93 What will be the average sales
10/7 89 scores?
16/7 86
24/7 89
13 1-50
The Sources of Data
Primary data :
Data collected first time relating to a particular problem on any
area of knowledge. So the data collected for the first time by
the investigations as original data are known as Primary data.
Secondary data:
Secondary data is that kind of data, that are already
published for the purposes other than the problem under
study. So when as investigation is using data which have
been collected by someone else, then such data are known as
secondary data.
14 1-50
The Sources of Data
Primary data Secondary data
Advantages: Advantages:
• collected by the person or • Readily available
organization who uses the • Less expensive to collect
data
Disadvantages:
Disadvantages: • No control over how the
• Can be expensive and data was collected
time-consuming to gather • Less reliable unless
collected and recorded
15 1-50 accurately
Methods of Collecting Primary Data:
(1). Interview by enumerators with a prepared
schedule or questionnaire: Under the method the
enumerator is provided with a prepared questionnaire
and he puts the questions to be informant and records
the answers. The informant does not fill up the
schedule himself but the enumerator fills it up.
Obviously, this methods needs qualified and trained
enumerators. Much of the success under this scheme
depends upon the standardization of the questions and
the skill and tactfulness of the enumerators.
16 1-50
Methods of Collecting Primary Data:
(2) Schedules to be filled in by the informants themselves:
Under this method the questionnaires are sent to the individual
respondents through mail with a request to fill them up and
send them back to the supplier. Under this method framing the
questionnaire is very important. Questions included in the
questionnaire should be simple, easy and self-explanatory. The
nature of questions should be such that man with average
intelligence can easily answer them. Usually, the answers to
questions term to be “yes” or “no” type and possible
alternative answers are quoted in the schedule. This method of
collection suffers from a number of drawbacks. The success of
the method depends upon the efficient preparation of the
questionnaire as well as the responsiveness of the informants.
17 1-50
Methods of Collecting Primary Data:
18 1-50
Methods of Collecting Primary Data:
(4) Direct personal observation: Under this
method the person interested in studying a
problem undertakes the task of personally
observing the phenomenon. In fact, the
investigation might himself get to the spot,
mix up with the people, may even live with
them and from such personal knowledge he
may take careful observations about the
condition of the people in all aspects or the
aspect in which the investigator is interested.
19 1-50
Methods of Collecting Primary Data:
(5) Indirect and Oral Investigation: This
method is used where information to be
collected are of complex type and a direct
approach for information may not produce the
desired result. The Investigator may have list of
questions and he interviews the persons well
informed of the phenomenon about which the
information is sought. Where the field of inquiry
is very vast, for example, a whole nation, a good
number of informed individuals may be called
upon to give their views on the problem.
20 1-50
Methods of Collecting Primary Data:
(6) Information through Local
Correspondents on Sources:
Sometimes Local correspondents are employed to
give the estimate of a particular phenomenon. They
provide their own estimates in their indigenous way
and these estimates from the basis of compilation.
This method is mostly employed in crop estimates.
This method has very little systematic basis and the
information obtained is also not very reliable.
21 1-50
Types of Variables
Categorical (qualitative) Variables have values
that can only be placed into categories, such as
“yes” and “no.”
Variables
Categorical Numerical
( Qualitative ) ( Quantitative )
Examples:
Discrete Continuous
Marital Status
Political Party
Examples: Examples:
Eye Color
(Defined categories) Number of Weight
Children Voltage
Defects per hour (Measured
23 1-50 (Counted items) characteristics)
Discrete vs. Continuous Data
Examples of Discrete data
Number of children per family
Number of cars listed per insurance policy
Vacation days per month
24
Classifying Data by Level of Measurement
Types of Data
Qualitative Quantitative
25 1-50
Nominal level G ender
Data that is
classified into
categories and
cannot be E ye
arranged in any Color
particular order.
Nominal data
26
Ordinal level: involves data arranged in some
order, but the differences between data values
cannot be determined or are meaningless.
27
Levels of Measurement
Interval level
Similar to the ordinal level, with the additional
property that meaningful amounts of differences
between data values can be determined. There is
no natural zero point.
Temperature on
the Fahrenheit
scale.
28 Levels of Measurement
Ratio level: the interval level with an
inherent zero starting point. Differences and
ratios are meaningful for this level of
measurement.
M ont hly income
of surgeons
Levels of Measurement
Levels of Measurement
A nominal scale classifies data into distinct categories
in which no ranking is implied.
Categorical Variables
Categories
Do you have a Yes, No
Facebook profile?
Growth, Value ,
Other
Type of investment
AT&T, Sprint,
Verizon, Other, None
Cellular Provider
30 1-50
Levels of Measurement (con’t.)
An ordinal scale classifies data into distinct categories
in which ranking is implied
31 1-50
Interval and Ratio Scales
32 1-50
Statistical Procedures
Descriptive Statistics
Procedures and techniques designed to describe data.
Collecting, summarizing, and displaying data.
.
Inferential Statistics
Tools and techniques that help decision makers to draw
inferences from a set of data. Making claims or conclusions
about the data based on a sample
1-33 1-50
Descriptive Procedures
Charts and graphs
Numerical measures
x i
Sum of all data values
Average
i 1
35 1-50
Inferential Procedures
Estimation
e.g., Estimate the population
mean weight using the sample mean
weight
Hypothesis Testing
e.g., Use sample evidence to test the
claim that the population mean weight
1-35
36
is 120 pounds
1-50
Time Series vs. Cross-Sectional Data
37
Time Series vs. Cross-Sectional Data
38
Descriptive and Inferential Statistics
Descriptive statistics
Collecting, summarizing, and displaying
data
Inferential statistics
making claims or conclusions about the
data based on a sample
39
Population vs. Sample
Population
• represents all possible subjects that are of interest in a
particular study
Sample
• refers to a portion of the
population that is
representative of the
population from which
it was selected
40 1-50
Data Is Collected From Either A Population or A
Sample
POPULATION
A population consists of all the items or individuals
about which you want to draw a conclusion. The
population is the “large group”
SAMPLE
A sample is the portion of a population selected for
analysis. The sample is the “small group”
41 1-50
Parameter vs. Statistic
• Parameter – a described characteristic about a population
• Statistic – a described characteristic about a sample
Population Sample