Professional Documents
Culture Documents
Chapter 1 Review
david.chew@nus.edu.sg
1
A DMIN : A SSESSMENT
2
I N THIS TUTORIAL . . .
1. Sampling
3. Experiments
4. Observational studies
3
1 S AMPLING
• Population
A collection of units we wish to obtain a parameter of.
• Parameter
A numerical fact about a population. For example, mean, variance, range, etc.
• Census
An attempt to take measurement of EVERY unit in the population.
• Sample
A selection of units in a population, used to estimate a population parameter.
Done because it is cheaper, faster, or because a census is not possible.
4
• Sampling frame
To identify all units in a population (directly or via a variable with link to them), we construct a
list called the sampling frame. For example, a list of home addresses or physicians.
• Types of bias
– Selection bias: Some types of units excluded (due to imperfect sampling frame or bad
sampling plan).
– Non-response bias: Happens when not all selected units are contactable or willing to take
part. Results biased since non-respondents usually differ from respondents.
– Other types of bias: For example, understated responses about undesirable habits, phras-
ing of questions.
5
• Probability sampling plans
Definition: Probability of selecting each unit from the population is known (but may not be the
same for each unit!). This is used to avoid selection bias.
• Random error
– For any sample that is taken randomly, there is the element of random error present.
– The larger the sample size, the smaller the random error of sample estimate.
– If there is no bias, then the sample estimate will fluctuate about the parameter.
6
P ROBABILITY S AMPLING P LANS
7
Plan 3: Stratified random sampling
• Divide population into homogeneous subgroups (strata), e.g., faculties/schools in NUS.
• Take a random sample from each subgroup (stratum).
• Combine these samples to form the final sample.
Plan 4: Cluster sampling
• Divide population into naturally occurring subgroups (clusters), e.g., tutorial groups in
GEA1000.
• Take a random sample of clusters.
• Combine ALL units in selected clusters to form the final sample.
8
S AMPLING P LANS : A COMPARISON
9
W HEN WILL OUR ESTIMATION BE GOOD ?
• ensure response rate is not too low (say ≥ 70%) ← avoid non-response bias
10
2 VARIABLES
11
S UMMARY S TATISTICS
12
3 E XPERIMENTS
Comparison of outcomes between a treatment group and a control group, to see the effect of a treat-
ment.
• How similar must the treatment group and the control group be?
• How different must the treatment group and control group be?
• What if the control group and treatment group are different in other factors?
It becomes difficult to ascertain if the observed differences between both groups is due to the treatment.
Ask the question: “Are the groups different, aside from the treatment?"
13
E NSURING SIMILARITY: R ANDOM A LLOCATION
Thus the random allocation / assignment of subjects into control and treatment groups ensures that the only
difference between the two groups is due to the treatment.
Check out
https://david-chew.shinyapps.io/RandomAssignment/
14
AVOIDING BIAS : B LINDING
• the participants are blind to whether they are in the treatment or control group; XOR
• the evaluators are blind to whether they are assessing someone in the treatment or control
group.
• the participants are blind to whether they are in the treatment or control group; AND
• the evaluators are blind to whether they are assessing someone in the treatment or control
group.
15
T HE G OLD S TANDARD
• randomised controlled
— treatment and control groups are randomly assigned
• double-blinded
— both participants and evaluators are "blinded"
16
4 O BSERVATIONAL STUDIES
In an observational study, the investigators do not assign the subjects to either group.
17
A SSOCIATION
• With observational studies, we often need to grapple with the issue of a confounder.
18
C OMPARISON : E XPERIMENTS AND O BSERVATIONAL STUDIES
19