Professional Documents
Culture Documents
DETERMINATION
• Sampling
• Selecting cases
• Sample size determination
SAMPLING
WHAT IS SAMPLING ?
• Population
• Sampling criteria
• Sampling frame
a) Makes the research of any type and size
manageable;
b) Significantly saves the costs of the research;
c) Results in more accurate research findings;
d) Provides an opportunity to process the
information in a more efficient way;
e) Accelerates the speed of primary data collection.
6
WHAT IS A POPULATION ?
• Probability
• Non-probability
PROBABILITY (RANDOM) SAMPLING
METHODS
• Multi-stage sampling (also known as multi-stage cluster sampling) is a more complex form
of cluster sampling which contains two or more stages in sample selection.
• In simple terms, in multi-stage sampling large clusters of population are divided into
smaller clusters in several stages in order to make primary data collection more
manageable.
NONPROBABILITY (NONRANDOM)
SAMPLING
• Used when characteristics to be possessed by samples are rare and difficult to find.
• Involves primary data sources nominating another potential primary data sources to be used
in the research
• Establish a contact with one or two initial cases from the sampling frame. This stage is usually
the most difficult one.
• Request the initial cases to identify more cases
• Ask new cases to identify further cases (and so on)
• Stop when:
• a) Your pre-specified sample size has been completed;
• b) There are no further cases left;
• c) Pursuing further cases will make the project unmanageable due to the large size.
TASKS
big enough’ to
• Detect effect of expected magnitude
• To be also statistically significant
not be ‘too big’
Where we spend a lot of resources
HOW TO CALCULATE THE SAMPLE SIZE?
• Outcomes – 3 groups
a) two alternatives exist - Case/ dead/ alive / vaccinated
b) Two mutually exclusive alternatives- blood groups , religion
c) continuous variables- height / weight / BP
• The statistical method for sample size detection will depend on the outcome
we decided
THE DESIRED POWER
• Power is the probability that the null hypothesis will be correctly rejected i.e.
rejected when there is indeed a real difference or association.
• It can also be thought of as "100 minus the percentage chance of missing a real
effect" - therefore the higher the power, the lower the chance of missing a real
effect.
• Power is typically set at 80%, 90% or 95%. Power should not be less than 80%.
• If it is very important that the study does not miss a real effect, then a power of
90% or more should be applied.
THE DESIRED SIGNIFICANCE LEVEL
• The significance level is a cut-off point for the p-value, below which the null
hypothesis will be rejected and it will be concluded that there is evidence of an
effect.
• The significance level is typically set at 5%
• If the observed p-value is smaller than 5% then there is only a small probability
that the study could have observed the data it did if there was truly no effect,
and so it would be concluded that there is evidence of a real effect.
ERRORS ASSOCIATED WITH
SIGNIFICANCE TESTS
• This is the smallest difference between the group means or proportions (or
odds ratio/relative risk closest to unity) which would be considered to be
clinically or biologically important.
• The sample size should be set so that if such a difference exists, then it is very
likely that a statistically significant result would be obtained.
WHETHER ANALYSIS WILL INVOLVE
ONE- OR TWO-SIDED TESTS
• In a two-sided test, the null hypothesis states there is no effect, and the
alternative hypothesis (often implied) is that a difference exists in either
direction.
• Non-response is added
• Design effect
Basic formulae to calculate sample size are based on the assumption that simple
random sampling is being used.
•
DESIGN EFFECT
• Basic formulae to calculate sample size are based on the assumption that
simple random sampling is being used.
• In general for a given sample size the standard error tends to be higher for
other sampling methods compared to simple random sampling.
• Sampling scheme should be taken into consideration during analysis
X= Zα/2 *p*(1-p)
2 / E2
• P = population proportion
• Calculate the sample size for a determine the prevalence of DM in Sri Lanka
• Knowledge of NCD among school students is 40%. Calculate the sample size
DESCRIPTIVE STUDY TO ESTIMATE MEAN