Professional Documents
Culture Documents
Session-6
Sampling
• Census Vs. Sampling
– Sampling is the selection of a subset (a statistical sample) of
individuals from within a statistical population to estimate
characteristics of the whole population
– Why Proper Sampling is important?
• Target Population
• Sampling Frame
– A representation of the elements of the target population. It consists
of a list or set of directions for identifying the target population
• Sampling technique & Sample Size
1
31-Jan-20
Sampling Techniques
Nonprobability Probability
Sampling Sampling
Convenience Sampling
• Convenience sampling attempts to obtain a sample of
convenient elements. Often, respondents are selected
because they happen to be in the right place at the right
time.
– use of students, and members of social organizations
– mall intercept interviews without qualifying the
respondents
– “people on the street” interviews
2
31-Jan-20
Judgmental Sampling
• Judgmental sampling is a form of convenience sampling in
which the population elements are selected based on the
judgment of the researcher.
– test markets
– purchase engineers selected in industrial marketing
research
– expert witnesses used in court
Quota Sampling
• Quota sampling may be viewed as two-stage restricted
judgmental sampling.
– The first stage consists of developing control categories, or quotas,
of population elements.
– In the second stage, sample elements are selected based on
convenience or judgment.
Population Sample
composition composition
Control
Characteristic % % Number
Sex
Male 48 48 480
Female 52 52 520
____ ____ ____
100 100 1000
3
31-Jan-20
Snowball Sampling
• In snowball sampling, an initial group of respondents is
selected, usually at random.
• After being interviewed, these respondents are asked to
identify others who belong to the target population of
interest.
• Subsequent respondents are selected based on the
referrals.
Sampling Techniques
Nonprobability Probability
Sampling Techniques Sampling Techniques
4
31-Jan-20
Systematic Sampling
• The sample is chosen by selecting a random starting point
and then picking every ith element in succession from the
sampling frame.
– For example, there are 100,000 elements in the population and a
sample of 1,000 is desired. In this case the sampling interval, i, is
100. A random number between 1 and 100 is selected. If, for
example, this number is 23, the sample consists of elements 23, 123,
223, 323, 423, 523, and so on.
10
5
31-Jan-20
Stratified Sampling
• A two-step process in which the population is partitioned
into subpopulations, or strata.
– The strata should be mutually exclusive & collectively
exhaustive in that every population element should be assigned to
one and only one stratum and no population elements should be
omitted.
– Next, elements are selected from each stratum by a random
procedure, usually SRS.
• A major objective of stratified sampling is to increase
precision without increasing cost.
• The elements within a stratum should be as homogeneous
as possible, but the elements in different strata should be as
heterogeneous as possible.
11
Cluster Sampling
• The target population is first divided into mutually exclusive
and collectively exhaustive subpopulations, or clusters.
• Then a random sample of clusters is selected, based on a
probability sampling technique such as SRS.
• For each selected cluster, either all the elements are
included in the sample (one-stage) or a sample of elements
is drawn probabilistically (two-stage).
• Elements within a cluster should be as heterogeneous as
possible, but clusters themselves should be as
homogeneous as possible. Ideally, each cluster should be a
small-scale representation of the population.
12
6
31-Jan-20
13
14
7
31-Jan-20
15
16
8
31-Jan-20
17
17
18
9
31-Jan-20
19
• Ho μ > 25
• H1 μ < 25
– Take action if Ho is rejected (H1 is accepted).
20
10
31-Jan-20
• Ho μ < 20
• H1 μ > 20
– Take action if Ho is rejected (H1 is accepted).
21
21
• Ho μ = 50
• H1 μ =/ 50
– Take action if Ho is rejected (H1 is accepted).
• Two-Tail Study
22
22
11
31-Jan-20
• Hypothesis (alternate)
– H1: Customers who are store loyal are less knowledgeable about
the shopping environment.
– H2: Store-loyal customers are more risk-averse than are non-loyal
customers.
– H3: Customers of Big Bazar are loyal.
23
– H0: Customers who are store loyal are more or as (at least as)
knowledgeable about the shopping environment as other
customers.
– H1: Customers who are store loyal are less knowledgeable about
the shopping environment.
24
12
31-Jan-20
Types of Hypotheses
Null
– H0: μ = 50
– H0: μ < 50
– H0: μ > 50
Alternate
– HA: μ =/ 50
– HA: μ > 50
– HA: μ < 50
25
25
26
26
13
31-Jan-20
Two-tail Study
27
27
28
28
14
31-Jan-20
29
29
• Note:
– A null hypothesis may be rejected, but it can never be accepted
based on a single test.
– In classical hypothesis testing, there is no way to determine whether
the null hypothesis is true.
30
30
15
31-Jan-20
31
31
Hypothesis Tests
Tests of Tests of
Association Differences
Median/
Distributions Means Proportions Rankings
32
32
16
31-Jan-20
Frequency Distribution
33
Frequency Distribution
• In a frequency distribution, one variable is considered at a
time.
– A frequency distribution for a variable produces a table of
frequency counts, percentages, & cumulative percentages for all
values associated with that variable.
34
34
17
31-Jan-20
Measures of Location
• Mean
– Most commonly used measure of central tendency.
– Used when data is in interval or ratio scale.
• Median
– Middle value when data are arranged in ascending or descending
order. It is the 50th percentile.
– When data is in Ordinal Scale & also interval or ratio scale
• Mode
– The value that occurs most frequently & represents the highest
peak of the distribution.
– Mode is a good measure of location when the variable is inherently
categorical or has otherwise been grouped into categories.
35
35
Measures of Variablity
• Variability is a measure of the dispersion or spread of
scores in a distribution.
– Variability ranges from 0 to ∝.
• Range
• Interquartile Range
• Variance
– Mean squared deviation from the mean. The variance can never be
negative.
• Standard Deviation
– Square root of the variance.
• Coefficient of variation
– Ratio of SD to the mean expressed as a percentage & is a unitless
measure of relative variability.
– Can be used with ratio scale only.
36
36
18
31-Jan-20
Symmetric Distribution
Skewed Distribution
37
• A positively skewed
distribution is a
distribution of scores
where a few outliers are
substantially larger (toward
the right tail in a graph)
than most other scores.
• A negatively skewed
distribution is a
distribution of scores
where a few outliers are
substantially smaller
(toward the left tail in a
graph) than most other
scores. 38
38
19
31-Jan-20
39
39
40
20