Professional Documents
Culture Documents
Anjan Mahanta
anjan_mahanta@hotmail.com
International Education Institute
Course Outline
Definitions, Scope and Limitations
Introduction to sampling methods
Collection of data, Classification and Tabulation
Frequency Distribution
Diagrammatic and Graphical Representation
Measures of Central Tendency
2
Chapter 1
Definitions, Scope and Limitations
3
Introduction
In the modern world of computers and information
technology, the importance of statistics is very well
recognized by all the disciplines.
Statistics has originated as a science of statehood and
found applications slowly and steadily in
◦ Agriculture,
◦ Economics,
◦ Commerce,
◦ Biology,
◦ Medicine,
◦ Industry,
◦ Planning, education and so on.
As on date there is no other human walk of life, where
statistics cannot be applied.
4
Origin and Growth of Statistics
5
Origin and Growth of Statistics
The word ‘ Statistics’ and ‘ Statistical’ are all derived from the
Latin word Status, means a political state.
6
Meaning of Statistics
The word ‘ statistic’ is used to refer to
◦ Numerical facts, such as the number of people living
in particular area.
◦ The study of ways of collecting, analysing and
interpreting the facts.
7
Definition by A.L.Bowley
“Statistics are numerical statement of facts in any
department of enquiry placed in relation to each other.”
8
Definition by Croxton and Cowden
“Statistics may be defined as the science of collection,
presentation analysis and interpretation of numerical data
from the logical analysis.”
◦ 1. Collection of Data
◦ 2. Presentation of data
◦ 3. Analysis of data
◦ 4. Interpretation of data
9
Functions of Statistics
◦ 1. Condensation
◦ 2. Comparison
◦ 3. Forecasting
◦ 4. Estimation
10
Scope of Statistics
◦ 1. Statistics and Industry
◦ 2. Statistics and Commerce
◦ 3. Statistics and Agriculture
◦ 4. Statistics and Economics
◦ 5. Statistics and Education
◦ 6. Statistics and Planning
◦ 7. Statistics and Medicine
◦ 8. Statistics and Modern applications
11
Limitations of Statistics
◦ Statistics is not suitable to the study of qualitative
phenomenon
◦ Statistics does not study individuals
◦ Statistics laws are not exact
◦ Statistics table may be misused
◦ Statistics is only, one of the methods of studying a
problem
◦
12
Exercises
◦
13
Exercises
◦
14
Assignments
◦
15
Assignments
◦
16
Assignments
◦
17
Project Example
PROJECT TITLE - WIMBLEDON
18
Information from Magazines
PROJECT TITLE - WIMBLEDON
19
Information from Newspaper
20
Information from Television
21
Information from Television
22
Information from Internet
23
Information from Internet
24
Examples
◦
25
Examples
◦
26
Chapter 2
Introduction to Sampling Methods
27
Introduction
Sampling is very often used in our daily life.
For example:
28
Population
In a statistical enquiry, all the items, which fall
within the purview of enquiry, are known as
Population or Universe.
29
Population
30
Population
Sometimes it is possible and practical to examine
every person or item in the population we wish to
describe. We call this a Complete enumeration,
or census.
31
Sampling
Statisticians use the word sample to describe a
portion chosen from the population.
A finite subset of statistical individuals defined in a
population is called a sample.
The number of units in a sample is called the sample
size.
32
Reasons for selecting a sample
Complete enumerations are practically impossible when
the population is infinite.
33
Sampling
Representing Size
Mathematically, A capital N is used to represent the size of a population.
Let’s say you want to find the average GPA of a student at your university.
Your university has 20,000 students, and you select 100 students and ask
them their GPAs.
34
Principles of Sampling
Principle of statistical regularity
Principle of Validity
Principle of Optimization
35
Methods of selection of samples
Simple Random Sampling
36
Methods of selection of samples
Simple Random Sampling
37
Methods of selection of samples
Stratified Random Sampling
38
Methods of selection of samples
Stratified Random Sampling
39
Methods of selection of samples
Systematic Random Sampling
40
Methods of selection of samples
Systematic Random Sampling
41
Lottery Method
This is the most popular and simplest method.
42
Lottery Method
43
Table of Random Numbers
A random number table is so constructed that all
digits 0 to 9 appear independent of each other with
equal frequency.
If we have to select a sample from population of size
N= 100, then the numbers can be combined three by
three to give the numbers from 001 to 100.
When the size of the population is less than thousand,
three digit number 000,001,002,….. 999 are assigned.
If any random number is greater than the population
size N, then N can be subtracted from the random
number drawn.
44
Table of Random Numbers
45
Table of Random Numbers
46
Random Number selection using
calculators and computers
Random number can be
generated through scientific
calculator or computers.
For each press of the key get a
new random numbers.
The ways of selection of sample
is similar to that of using
random number table.
47
Merits of using random numbers
Personal bias is eliminated as a selection depends solely on
chance.
A random sample is in general a representative sample for a
homogenous population.
There is no need for the thorough knowledge of the units of
the population.
The accuracy of a sample can be tested by examining another
sample from the same universe when the universe is
unknown.
This method is also used in other methods of sampling.
48
Limitations of using random numbers
Preparing lots or using random number tables is tedious
when the population is large.
When there is large difference between the units of
population, the simple random sampling may not be a
representative sample.
The size of the sample required under this method is more
than that required by stratified random sampling.
It is generally seen that the units of a simple random sample
lie apart geographically. The cost and time of collection of
data are more.
49
Stratified Random Sampling
There are two types of stratified sampling. They
are proportional and non-proportional.
In the proportional sampling 20 equal and
proportionate representation is given to
subgroups
The population size is denoted by N and the
sample size is denoted by ‘ n’
The sample fractions is a constant for each
stratum. That is given by n/N = c.
50
Example
51
Stratified Random Sampling
Merits
It is more representative.
It ensures greater accuracy.
It is easy to administer as the universe is sub - divided.
Greater geographical concentration reduces time and
expenses.
When the original population is badly skewed, this method
is appropriate.
For non – homogeneous population, it may field good
results.
52
Stratified Random Sampling
Limitations
To divide the population into homogeneous strata,
it requires more money, time and statistical
experience which is a difficult one.
53
Systematic Sampling
This method is widely employed because of its
ease and convenience.
A frequently used method of sampling when a
complete list of the population is available is
systematic sampling.
It is also called Quasi-random sampling.
54
Systematic Sampling
Selection Procedure
The whole sample selection is based on just a random start.
55
56
Systematic Sampling
Merits
This method is simple and convenient.
Time and work is reduced much.
If proper care is taken result will be accurate.
It can be used in infinite population.
57
Systematic Sampling
Limitations
Systematic sampling may not represent the
whole population.
There is a chance of personal bias of the
investigators.
58
Exercise - 2
59
Exercise - 2
60
Exercise - 2
61
Chapter 3
Collection of Data, Classification and Tabulation
62
Collection of Data, Classification and Tabulation
Introduction
63
Collection of Data, Classification and Tabulation
Nature of Data
Time Series Data
64
Nature of Data
Time Series Data
It is a collection of a set of numerical values, collected
over a period of time.
65
Nature of Data
Spatial Data
If the data collected is connected with that of a place,
then it is termed as spatial data.
66
Nature of Data
Spacio Temporal Data
If the data collected is connected to the time as well as
place then it is known as spacio temporal data.
67
Categories of Data
Primary Data
68
Categories of Data
Secondary Data
69
Classification of Data
Objectives of Classification
It condenses the mass of data in an easily assimilable
form.
It eliminates unnecessary details.
It facilitates comparison and highlights the significant
aspect of data.
It enables one to get a mental picture of the information
and helps in drawing inferences.
It helps in the statistical treatment of the information
collected.
70
Types of Classification
Chronological Classification
In chronological classification the collected data are arranged
according to the order of time expressed in years, months, weeks, etc.,
71
Chronological Classification
72
Types of Classification
Geographical Classification
In this type of classification the data are classified according to
geographical region or place.
73
Types of Classification
Qualitative Classification
In this type of classification data are classified on the basis
of same attributes or quality like sex, literacy, religion,
employment etc., Such attributes cannot be measured
along with a scale.
Examples:
74
Types of Classification
Quantitative Classification
Quantitative classification refers to the classification of
data according to some characteristics that can be
measured such as height, weight, etc.
Examples:
◦ He has 4 legs
◦ He has 2 brothers
◦ He weighs 25.5 kg
◦ He is 170 cm tall
75
Types of Classification
76
Tabulation
Tabulation is the process of summarizing classified or
grouped data in the form of a table so that it is easily
understood and an investigator is quickly able to locate
the desired information.
77
Preparing a Table
An ideal table should consist of the following main parts:
◦ 1. Table number
◦ 2. Title of the table
◦ 3. Captions or column headings
◦ 4. Stubs or row designation
◦ 5. Body of the table
◦ 6. Footnotes
◦ 7. Sources of data
78
A model structure
79
Types of Table
Simple or one-way table
◦ A simple or one-way table is the simplest table which
contains data of one characteristic only.
80
Types of Table
Two way table
◦ A table, which contains data on two characteristics, is
called a two way table.
81
Types of Table
Manifold table
◦ A table ,which has more than two characteristics of data
is considered as a manifold table.
82
Exercise - 3
83
Exercise - 3
84
Data Collection Project
Prepare a questionnaire
Collect the data
Separate the data into qualitative and quantitative
types
Display data in graphs
Project Examples –
◦ 24 hour activities of our students
◦ Money spending habits of our students
85
Chapter 4
Frequency Distribution
86
Frequency Distribution
Introduction
Frequency is how often something occurs.
87
Frequency Distribution
Introduction
By counting
frequencies we can
make a Frequency
Distribution table.
88
Frequency Distribution
A frequency distribution is constructed for three main reasons:
89
Frequency Distribution
90
Frequency Distribution
Discrete (or) Ungrouped frequency distribution:
91
Frequency Distribution
Discrete (or) Ungrouped frequency distribution:
92
Frequency Distribution
Exercise - I
93
Frequency Distribution
Continuous frequency distribution: In this form of distribution
refers to groups of values.
Wage distribution of 100 employees
94
Nature of Class
Class Limit
◦ The class limits are the lowest and the highest values that can
be included in the class. For example, take the class 30-40.
The lowest value of the class is 30 and highest class is 40.
95
Nature of Class
Class Limit
◦
96
Nature of Class
Class Interval
◦ The class interval may be defined as the size of each grouping
of data. For example, 50-75, 75-100, 100-125… are class
intervals.
97
Nature of Class
Width or size of the class interval
◦ The difference between the lower and upper class limits is called
Width or size of class interval and is denoted by ‘ C’ .
98
Nature of Class
Range
◦ The difference between largest and smallest value of the
observation is called The Range
◦ It is denoted by ‘ R’
R = Largest value – Smallest value
R=L-S
99
Nature of Class
Mid-value or Middle Point
◦ The central point of a class interval is called the mid
value or mid-point.
◦ Mid-Value = L + U / 2
100
Nature of Class
Frequency
◦ Number of observations falling within a particular class
interval is called frequency of that class.
101
Nature of Class
No of class intervals
◦ The number of class interval in a frequency is matter of
importance.
◦ The number of class interval should not be too many.
◦ For an ideal frequency distribution, the number of class
intervals can vary from 5 to 15.
102
Nature of Class
Method of determining class intervals
◦
103
Nature of Class
Size of Class Interval
◦
104
Types of Class Intervals
1.Exclusive Method
When the class intervals are so fixed that the upper limit of
one class is the lower limit of the next class; it is known as
the exclusive method of classification.
◦
105
Exclusive Method
106
Types of Class Intervals
2.Inclusive Method
In this method, the overlapping of the class
intervals is avoided.
Both the lower and upper limits are included in
the class interval.
107
Inclusive Method
108
Types of Class Intervals
3. Open end classes
A class limit is missing either at the lower end of
the first class interval or at the upper end of the
last class interval or both are not specified.
◦
109
Open end classes
110
Constructing the frequency table
111
Preparation of frequency table
Example 1
112
Preparation of frequency table
Example 1
113
Preparation of frequency table
Example 1
114
Preparation of frequency table
Example 2
115
Preparation of frequency table
Example 2
116
Preparation of frequency table
Example 2
117
Percentage frequency table
118
Cumulative frequency table
Cumulative frequency distribution has a running total of the values. It
is constructed by adding the frequency of the first class interval to the
frequency of the second class interval.
119
Exercise
120
Exercise
121
Quiz
https://www.mathsisfun.com/data/frequency-distribution.html
122
Chapter 5: Graphs
What is a graph?
It is a diagram that exhibits a relationship, often functional,
between two sets of numbers as a set of points having
coordinates determined by the relationship.
123
Graphs
Origin (0,0)
Example
124
Types of Graphs
Line Graph
A line graph is useful in displaying data or
information that changes continuously over time.
The points on a line graph are connected by a line.
Another name for a line graph is a line chart.
125
Types of Graphs
Bar Graph
A bar graph is a chart that uses either horizontal
or vertical bars to show comparisons among
categories.
126
Types of Graphs
Pie Chart
A pie chart (or a circle chart) is a circular
statistical graphic, which is divided into slices to
illustrate numerical proportion.
127
Review
128
Creating Graph in Excel
Enter the following information in Excel
Worksheet to create a Line Graph
129
Creating Graph in Excel
Enter the following information in Excel
Worksheet to create a Bar Graph
130
Creating Graph in Excel
Enter the following information in Excel
Worksheet to create a Pie Chart
131
Chapter 6
132
Measures of Central Tendency
•It is not possible to grasp any idea about the characteristic
133
Measures of Central Tendency
• That number must be a good representative one for all the
134
Arithmetic Mean
•
135
Arithmetic Mean
•
136
Arithmetic Mean
•
137
Arithmetic Mean
•
138
Arithmetic Mean
•
139
Arithmetic Mean
•
140
Arithmetic Mean
•
141
Arithmetic Mean
•
142
Arithmetic Mean
•
143
Median
• The median is that value of the variant which divides the
group into two equal parts, one part comprising all values
greater, and the other, all values less than median.
Arrange the given values in the increasing or decreasing
order.
If the number of values are odd, median is the middle
value.
If the number of values are even, median is the mean of
middle two values.
144
Median
• Example of ODD Number
145
Median
•
146
Median
• Even Number
147
Median
•
148
Median
•
149
Mode
• The mode refers to that value in a distribution, which
occur most frequently. It is an actual value, which has the
highest concentration of items in and around it.
150
Mode
•
151
Mode
•
152
Mode
•
153
Mode
•
154
Mode
•
155
Exercises
•
156
Answers
•
157