Professional Documents
Culture Documents
STEP BY STEP By Courtney Keeler, PhD, and Alexa Colgrove Curtis, PhD, MPH, FNP, PMHNP
A series coordinated by the Heilbrunn Family Center for Research Nursing at Rockefeller University
Case–Control Studies
Appropriate uses of and approaches to this form of
observational design.
Editor’s note: This is the eighth article in a series on clinical research by nurses. The series is designed to
give nurses the knowledge and skills they need to participate in research, step by step. Each column will
present the concepts that underpin evidence-based practice—from research design to data interpreta-
tion. The articles will be accompanied by a podcast offering more insight and context from the authors. To
see all the articles in the series, go to https://links.lww.com/AJN/A204.
L
ike cohort studies, case–control studies con- ies begin with an exposure and track subsequent
nect a health outcome with a specific expo- outcome manifestations.1, 2 This process can be
sure, allowing researchers to compare the either prospective (tracking future outcome mani-
association between the two. Here, we compare festations) or retrospective (reflecting on past expe-
cohort and case–control studies, discuss the selec- rience). Unlike cohort studies, case–control studies
tion of cases and controls, describe how one might begin with a specific disease or other health out-
evaluate whether an association exists between come and retrospectively track exposure. (Figure 1
them, and outline potential sources of bias. We details this distinction in study design.) More specif-
end by discussing when a case–control study might ically, case–control studies begin with two groups:
be appropriate. those with a specific disease or health outcome
(cases) and those without the specific disease or
COHORT VS. CASE–CONTROL STUDIES health outcome (controls). Next, researchers com-
Both case–control and cohort studies compare a pare exposure between those with and those with-
given exposure with a specific health outcome. Like out the disease or health outcome. In summary, in a
cohort studies, case–control studies are observa- cohort study one compares rates of disease, in a
tional, meaning that exposure is “observed” rather case–control study one compares rates of exposure.
than assigned by the researcher. For instance, sup- Let’s consider an example. As part of a larger
pose a researcher is interested in exploring the rela- cohort study, D’Souza and colleagues used a case–
tionship between smoking status and heart disease. control framework to investigate the association
In an observational study, the researcher would between human papillomavirus (HPV) and the patho-
“observe” smoking status within a sample, rather genesis of oropharyngeal squamous-cell carcinoma.3
than assign participants to smoking and nonsmok- The investigators recruited 300 participants: 100
ing groups. Indeed, this example highlights the patients with a newly confirmed diagnosis of oropha-
necessity of an observational study design in some ryngeal squamous-cell carcinoma and 200 controls.
contexts, since a researcher cannot ethically assign The control group met four important criteria; con-
subjects to dangerous exposure groups like smoking. trols had no history of cancer (1) and were treated for
As with all observational study designs, causal- benign conditions (2) in the same clinic as the cases (3)
ity is difficult to establish in case–control and cohort during the same time frame (4). The research team
studies. Instead, in interpreting study findings, the then assessed the presence of HPV in each group, col-
researcher assesses whether an association exists lecting specimens from the cases before treatment and
between an exposure and a given health outcome. The from the controls at enrollment.3
researcher cannot determine cause and effect because,
unlike in an experimental study such as a randomized CASE–CONTROL STUDIES
controlled trial, exposure is not randomly assigned, Identifying cases. Case–control studies begin
and subsequent health outcomes are not tracked. with the identification of cases. To identify cases,
In determining whether any association exists researchers need to create a working definition of
between an exposure and an outcome, cohort stud- the disease or health outcome of interest. They can
POPULATION POPULATION
NOT NO
EXPOSED EXPOSED DISEASE
DISEASE
rely on specific diagnostic criteria or use validated who manifest target outcomes (cases). In case–
survey instruments to assess the presence of disease control studies, controls must meet two important
or other health outcome. Before the study begins requirements. First, controls must be drawn from
(or a priori), researchers need to identify who will the same “at-risk” target population as cases, ensur-
and will not be included in the study sample, estab- ing that both cases and controls are exposed to sim-
lishing inclusion and exclusion criteria. Importantly, ilar observable and unobservable factors that influ-
researchers will need to ensure that cases are drawn ence the association of interest.5 Second, exposure
from the target population of interest, so the study status must not influence their selection. Further,
findings are generalizable to this group. that exposure should ideally be measured with com-
For instance, a researcher may be interested in parable precision across case and control groups.
understanding what exposures predisposed clini- Consider a researcher exploring the relationship
cal nurses toward severe cases of COVID-19 during between cigarette smoking and stroke among women
the pandemic, such as length of a typical shift; type ages 18 and older. The researcher should ensure that
of personal protective equipment used; clinical unit; both cases (those who experienced a stroke) and con-
or underlying health characteristics such as age, body trols (those who did not experience a stroke) were
mass index, or presence of pulmonary disease. Like drawn from the same at-risk population. Addition-
many health outcomes, COVID-19 severity could ally, in assessing smoking behaviors, the researcher
be measured in several ways, including by hospital- should gauge exposure with equal accuracy. Admit-
ization, duration of disease, and oxygen utilization, tedly, in this example, it would be difficult to collect
among other factors. Suppose the researcher defines equally accurate information from cases and controls
severe cases of COVID-19 as those requiring hospital- if case patients were unable to self-report smoking
ization with use of oxygen. With this case definition status. In this case, because the researcher might need
in mind, the researcher can select participants who to rely on secondary accounts from a partner, rela-
were hospitalized with COVID-19 over a fixed period tive, friend, or if available, medical records, the sec-
(such as March 1 through December 31, 2020). ond criterion would be hard to meet.
In a related example, Cai and colleagues investi- Identifying populations from which to draw con-
gated the association between frontline medical trols. A researcher has several options when select-
work in China during the pandemic and mental ing controls: population-based controls and hospi-
health outcomes.4 Medical workers were defined as tal- or clinic-based controls.1, 2 In some cases,
“physicians, nurses, and other healthcare workers researchers utilize both options (that is, they have
[such as] medical technicians, respiratory therapists, two control populations).
or emergency room attendants.” Cases were identi- Researchers seek to minimize the influence of con-
fied based on responses to a series of questionnaires founders by optimizing the matching of cases with
sent to individuals working in hospitals in China controls. When selecting controls, the researcher tries
between February 11 and 26, 2020, who met the to mirror cases in as many observable dimensions as
above definition of a “medical worker.” possible (for example, age, weight, or gender). One
Identifying controls. Criteria for selecting con- way to accomplish this is through individual or fre-
trols. Controls provide a comparison with those quency matching with population- and clinical-based
While the odds ratio provides some indication Broader consideration of confounders. Con-
of the relative magnitude of the effect, the above founding variables distort the observed association
calculation provides no information on statistical sig- between the exposure and the outcome of inter-
nificance. More sophisticated statistical methodology est. To the extent possible, researchers should con-
should be considered in assessing whether an asso- trol for all known confounders. For instance, in
ciation exists (however, such approaches are beyond the example of placental abruption, suppose the
the scope of this article). Researchers should consult researchers are exploring the relationship between
biostatisticians or other experts in developing their that event and workplace stress. They believe that
evaluation strategy. For instance, in the above exam- age is correlated with both workplace stress and the
ple, the researcher should interpret the odds ratio likelihood of placental abruption. To the extent that
with caution and not assume a positive association age remains unaddressed in the study design, it will
between frontline medical work during the pandemic confound (or distort) the researchers’ findings. One
and increased risk of depression without incorporat- solution might be for the researchers to stratify the
ing additional controls and statistical analysis. study sample into age groups, such as less than 35
Sources of bias. In designing a study, researchers and 35 and older (by definition, a “geriatric preg-
want to ensure the external and internal validity of nancy” occurs among women 35 and older). More
their work. Bias limits the validity of study results. broadly, the matching process will help control for
Selection bias limits the external validity of the confounding variables, as discussed above.
study by limiting the researcher’s ability to general-
ize findings to the target population of interest. WHEN IS A CASE–CONTROL STUDY APPROPRIATE?
Selection bias occurs when the population of Case–control studies allow researchers to investi-
interest is systematically different from the target gate a range of exposures and can be a useful start-
population.6 If the study participants are not rep- ing point in analyzing an association between expo-
resentative of the target population, then research- sure and health outcomes or when limited evidence
ers will not be able to generalize their findings back on the association is available. For instance, Park
to the target population. Returning to the exam- and colleagues argued that the limited literature
ple of severe pneumonia and obesity, suppose the regarding medication use and falls among nursing
researcher selected cases and controls from adults home residents in Japan made it important to con-
admitted to the ICU. Further suppose that both duct their case–control study on the topic.7 While
cases and controls had a disproportionately higher their findings may not be generalizable to all nurs-
number of comorbidities relative to the general adult ing home residents, they provided initial evidence
population. As a result, the researcher might find it linking medication use and patient falls in Japan.
difficult to generalize the findings of an association Case–control studies are also useful when a health
between severe pneumonia and obesity to a popula- event is rare, a disease has a long latent period, or a
tion of adults without numerous comorbidities. population is difficult to follow.1 For example, given
Information bias poses a simple question: are the relative rarity of the events under investigation,
you measuring what you think you are measur- Wiebe used a case–control study design to explore
ing? Information bias results from the systematic whether having a gun in the home was associated
mismeasurement or misreporting of data6 and its with the incidence of suicide and homicide.8 Wiebe
presence can limit the internal validity of the study. concluded that having a gun in the home was indeed
While information bias can take many forms, its a risk factor for both outcomes.
major sources in case–control studies concern lim- Finally, compared with other study designs,
itations in recall (that is, participants may not case–control studies are relatively inexpensive, in
remember or may misremember an event).2 In the terms of both finances and time. For Hansen and
extreme, recall/rumination bias—systematic misre- Stevens, a case–control study offered a convenient
porting or differences in other recollections within opportunity to explore the association of nursing
or across groups—may be present. Discrepan- night shift work and the incidence of breast can-
cies in recall across groups is referred to as differ- cer within the context of an ongoing cohort study
ential recall.2 Suppose a research team is interested (this is referred to as a nested case–control study).9
in exploring exposures related to placental abrup- Table 2 provides an expanded discussion of these
tion. Women who have experienced a placental study examples.7-10
abruption (cases) are far more likely to be aware At the opposite end of the spectrum, Lewallen
of potential exposures that may have triggered this and Courtright catalog some of the disadvantages
adverse health event than women who had a rela- of case–control study designs, including the retro-
tively standard pregnancy (controls). spective nature of the data, the potential for bias,