Professional Documents
Culture Documents
Sampling
(based largely on PS Mann Appendix 1)
1
Recall the 5 steps in Statistical
Investigation
• Question
• Sampling procedure
• Collection of data
• Make generalizations
• Decide on reliability
2
Why Sample?
• Why not take a census?
3
Sample Objectives
• To be as representative as possible of the
underlying population
• Non-Random sampling
– Quota sampling
5
Population versus Sample
• Recall the difference between a Population
and a Sample of a Population.
7
Example / Activity
• I am interested in the ages of first year students at
UWI. I ask the ages of my ECON 1005 class.
– What is the population?
– What is the sample?
– Do you see any problems with this sample?
• Examples
– The population census data
– Telephone directory
– Students registration database at the UWI
9
Activity
• Name the populations that
correspond to the following sampling
frames:
–The population census data
–Telephone directory
–Students registration database at
the UWI
–VAT database
10
Activity
• Explain how you will develop the
sampling frame for each of the following
populations:
– Level I Courses offered by the Faculty of Social
Sciences
– Citizens of CARICOM countries
– Level I students of the Faculty of Social Sciences
11
Sampling frame vs Sample – Situation 1
Suppose that we want to measure the proportion of cricket fans across
the Caribbean Region who support the West Indies cricket team
• In this case, I would have measured the support for the Windies
team across a subset of the population i.e. a sample of the
population.
12
With Replacement or Without Replacement?
• A sample may be selected with replacement or without
replacement.
• In sampling with replacement, each time we select an element
from the population, we put it back in the population before we
select the next element of the sample
• Therefore, the population contains the same number of items
each time a selection is made, and we can also select the same
item more than once
• Sampling without replacement occurs when the selected
element is not replaced in the population
• Therefore, each time we make a selection, the size of the
population is reduced by one element. We cannot select the
same item more than once.
• Most times, sample taken in statistics are without replacement.
13
Census vs Sample Survey
14
Challenge in Sampling
• Consider the measurement of support for the
Windies team.
• While we wish to only measure the support
among a subset of the population for reasons
discussed above, we wish that the resulting
proportion of fans will be almost the same as if
we had conducted a census.
• We can only achieve such a result if the sample is
representative of the population.
• We might say that this is the challenge of
sampling.
15
Activity
• Create a statistical investigation around Level I
of the Faculty of Social Sciences
16
Random vs Non-random Samples
• A sample may be random or nonrandom.
• A random sample is a sample that has been
drawn so that each element of the population
has a measureable chance of being selected for
the sample.
• If a sample does not assign a chance of being
included, to a specific set of elements within the
population, it is nonrandom.
• It is important to note that the randomness is in
the procedure and any corruption of that
procedure is likely to “corrupt” the randomness,
thereby introducing selection bias.
17
Activity
• Choosing a random sample is similar to drawing names
from a hat.
19
Sampling Frames and Random Sampling
• No matter what random sampling design is selected,
the actual implementation of the exercise and the
explicit choice of the individual members of the sample
cannot be carried out without a sampling frame.
20
Sampling Frames and Random Sampling
21
Situation 2
Suppose, for example, that I want to determine the
average age of the statistics class of 300 students by
using a random sample of 10 students drawn from
the class.
22
Simple Random Sampling
What is it?
• A simple random sample is a sample that has
been drawn so that each element of the
population has an equal chance of being
selected for the sample.
23
Activity
• What if I were to pick a sample from this class at this
moment, by “randomly” choosing students?
– I can walk along the aisle, choosing people with my hands
– I can stay at the podium, choosing people with my eyes
25
Simple Random Sample - Its Limitations
• Simple random samples can be very costly, for instance
when the population being sampled is distributed across a
wide (geographical) area.
• The stratification factor must be chosen so that the strata are mutually
exclusive. i.e. Each member of the population must be assigned to
exactly one stratum.
29
Stratified Random Sampling – Its Advantages
• The stratified random sampling design makes use of knowledge of the
population to increase the precision of the results obtained from the
sample.
• The chance of any individual being drawn is still measurable and all
possible samples of equal size still have the same chance of selection. So,
although the choice of the stratification factor is almost entirely dependent
on human judgment, the procedure still maintains an element of
randomness to it, especially since the mode of selection within each
stratum is clearly random.
31
Best Results from Stratified Random Sampling
• Stratified random sampling produces its best
results when the variation within each
stratum/group is small compared to the variation
between strata/groups.
• When the within-group variation is small, it will
provide results nearly identical to those of Simple
random sampling.
• Go back to the examples of age of an ECON 1005
student, and height of an ECON 1005 student,
and the choice of the stratification factors.
Describe the within-group and between-group
variations. 32
Stratified vs Simple Random Sampling
• Return to the Examples above; in the extreme
case, simple random sampling could have
resulted in an all-male or all female selection / an
all FT or EU selection, causing, in either case, a
misleading result.
33
Cluster or Multistage Sampling
What is it?
• The cluster sampling design involves the
division of the population into groups called
clusters
• A simple random sampling process is then
applied to select a sample of clusters.
• All members of the selected cluster(s)
constitute the cluster sample.
34
Cluster Sampling - Example
36
Cluster Sampling - Its Advantages
37
Cluster Sampling - Its Limitations
39
Just so you know…
• In your reading you may also encounter
systematic sampling, probability proportional
to size sampling, panel surveys.
40
Moving on…
• We have looked at the relevant random
sampling techniques. Do you recall why we
started discussing them? Remember the 5
stages?
• Now we consider non-random sampling
techniques.
• Quota sampling
• Convenience sampling
• Judgement sampling
41
Quota Sampling
What is it?
• As its name implies, this method involves the
identification of a quota(s) that the selected
sample is required to fulfill.
• This quota is based on the diversity of the
population, and so represents an attempt to
ensure that the population is truly represented by
the resultant sample.
• Once the quota has been established, the actual
selection of the members of the sample is left up
to the discretion of the enumerators/interviewers
(the people who conduct the survey).
42
Quota Sampling - Example
• We may wish to draw a sample of 500 investors in
Credit Unions in your country but 200 must be men;
175 must be women; and 125 must be persons under
the age of twenty-five.
43
Quota Sampling - Its advantages
44
Quota Sampling - Its Limitations
45
Convenience and Judgment Sampling
46
Errors in Sampling
• Sampling Error
• Non-Sampling Error
47
Sampling Error
48
Non-sampling Error
• Non-sampling Error is due to mistakes made by the
researcher/ interviewers in the acquisition of the data
or due to the improper selection of the sample. This
type of error is within the control of the
researcher/interviewer.
60
Methods of Data Collection
• Direct Observation
• Experiments
• Surveys
– Personal Interviews
– Telephone Interviews
– Self Administered Survey
– Mail Survey
– Internet Survey
61
Instruments
• Questionnaire
- Keep it short and simple; avoid open-ended
questions; leave options for “Other”; should be
anonymous
• Observation schedules
• Experiments
-controlled and treatment group
• Interviews
- formal or informal; predetermined list of
questions
Research is being conducted to determine the influence of exercise on cholesterol
level.
Questionnaire
1. Name:…………………………………………….
2. Sex: ………………………………………………
3. Do you exercise?…………………………………
4. What type of exercises do you do?……….……………..
5. Do you exercise at least 3 times per week for 30 minutes or
more?……………………………..
6.What is your total cholesterol level?………………………
• These are Lecture Outlines only, which means you have to use
these broad topics, read the relevant chapter in the Mann and
do a complete review.
67