Professional Documents
Culture Documents
doi: 10.1093/toxsci/kfw059
Advance Access Publication Date: May 5, 2016
Forum Article
FORUM ARTICLE
ABSTRACT
The Evidence-based Toxicology Collaboration hosted a workshop on “The Emergence of Systematic Review and Related
Evidence-based Approaches in Toxicology,” on November 21, 2014 in Baltimore, Maryland. The workshop featured speakers
from agencies and organizations applying systematic review approaches to questions in toxicology, speakers with experience
in conducting systematic reviews in medicine and healthcare, and stakeholders in industry, government, academia, and non-
governmental organizations. Based on the workshop presentations and discussion, here we address the state of systematic
review methods in toxicology, historical antecedents in both medicine and toxicology, challenges to the translation of
systematic review from medicine to toxicology, and thoughts on the way forward. We conclude with a recommendation that
as various agencies and organizations adapt systematic review methods, they continue to work together to ensure that there
is a harmonized process for how the basic elements of systematic review methods are applied in toxicology.
The Evidence-based Toxicology Collaboration (EBTC) is a group November 21, 2014 at the Johns Hopkins Bloomberg School of
of North American and European researchers from government, Public Health, in Baltimore, Maryland, USA. The workshop fea-
academia, and industry who are interested in applying system- tured speakers from agencies and organizations developing and
atic review methods to toxicology (EBTC, 2015). The EBTC applying systematic review methods to questions in toxicology,
hosted a workshop on “The Emergence of Systematic Review speakers with experience in conducting systematic reviews in
and Related Evidence-based Approaches in Toxicology,” on medicine and healthcare, and stakeholders from industry,
C The Author 2016. Published by Oxford University Press on behalf of the Society of Toxicology.
V
This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs licence (http://creativecom-
mons.org/licenses/by-nc-nd/4.0/), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work
is not altered or transformed in any way, and that the work properly cited. For commercial re-use, please contact journals.permissions@oup.com
10
STEPHENS ET AL. | 11
government, academia, and non-governmental organizations. of systematic review methods, these organizations’ approaches
The full program and a brief summary are available online share commonalities including the fundamental steps of a sys-
(EBTC, 2015). In this article, we expand on the workshop presen- tematic review.
tations to look at the use of systematic review approaches in
toxicology, their roots in medicine and healthcare, the chal-
lenges facing practitioners, and some thoughts on the way for-
SYSTEMATIC REVIEW METHODOLOGY AS
ward, including implications for toxicologists in designing and
reporting their studies.
APPLIED TO TOXICOLOGY
Systematic review methods constitute a standardized ap- The initial steps of systematic review begin with scoping and
proach for identifying and analyzing evidence related to clearly problem formulation to identify the question of interest, gain a
formulated questions (Higgins and Green, 2011; Institute of sense of the relevant literature, define and refine the question,
Medicine, 2011). Systematic reviews proceed through a se- define a PECO statement (see below), and develop the review
quence of steps, typically formulating a specific research ques- protocol. Formulating the question to be answered is a simple-
tion, developing a review protocol, performing a comprehensive sounding process but it requires careful deliberation because
Advantages.
A protocol for how the review will be conducted—written in advance—reduces the likelihood that ad hoc changes will be made that bias the
outcomes. In cases where the protocol is published or otherwise shared with interested parties in advance of the actual review, stakeholders
are thereby given the opportunity to recommend changes.
The incorporation of explicit criteria for including and excluding individual studies gives readers of the review a clear rationale for why some
studies were included or excluded.
Assessing the risk of bias or broader methodological quality of the included studies gives reviewers and readers a sense of how much confi-
dence to have in the review’s conclusions.
Reviews that assess certain studies as having a high risk of bias are likely to encourage the authors of those studies to improve the quality of
their future research.
The explicit and transparent nature of the review process and its published review give readers a clear sense of how the review was carried
out. This also enables interested parties to replicate the review, with or without making any protocol amendments deemed desirable.
Under certain conditions, data synthesis lends itself to meta-analysis, which provides a quantitative summary of the data from individual
observational studies), in order to reach a conclusion. Data inte- In the OHAT framework, evidence integration begins with
gration frameworks should be true to the evidence-based prin- the process for rating confidence in the findings for each body
ciples of transparency, objectivity, and consistency. of evidence separately (eg, human and animal studies on a par-
Groups of scientists in both the United States and European ticular outcome) based on the GRADE approach (Guyatt et al.,
Union are collaborating to advance systematic review 2008) with modifications on the initial starting point for obser-
approaches in toxicology. Guidance for conducting systematic vational human studies. It includes guidance for human and
reviews in toxicology has been published (European Food Safety animal studies and a process for considering mechanistic stud-
Authority, 2010; Rooney et al., 2014; Woodruff and Sutton, 2014). ies. Rating confidence in the body of evidence is developed us-
There is substantial consistency across the approaches that ing the GRADE factors that reflect strengths and weaknesses of
have been developed in terms of how they frame the review (de- a body of evidence (eg, dose response, or indirectness) with an
veloping a specific research question and PECO statement), additional factor that may increase confidence in the associa-
identify the evidence, evaluate its study quality (eg, by using tion between exposure and health outcome when there is con-
risk of bias tools), and rate the certainty in the findings. The cer- sistency of the response across species, study designs, or
tainty rating in these approaches has been based on the GRADE human populations. These ratings are translated into levels of
approach, a system for evaluating the quality of the body of evi- evidence for each health effect based on whether the reviewed
dence, which is widely accepted in healthcare (Guyatt et al., studies do or do not show an adverse effect. Finally, the degree
2011). of support from mechanistic studies is considered and the 3 evi-
dence streams (human, animal, mechanistic) are integrated to
reach a hazard conclusion of “known,” “presumed,” “sus-
AGENCY FRAMEWORKS AND OTHER EFFORTS pected,” or “not classifiable” as a hazard to humans that reflects
The Office of Health Assessment and Translation (OHAT) of the the confidence and consistency across each body of evidence.
National Institute of Environmental Health Sciences has created OHAT released a detailed methods guide of standard operat-
a framework for applying systematic review methods to envi- ing procedures for using systematic review in its evaluations in
ronmental health questions, including methods to develop con- 2014 (National Toxicology Program, 2015). This guide reflected
clusions from the full range of relevant data (human, animal OHAT’s then-current practices, with the expectation that the
and in vitro data) (Rooney et al., 2014). The OHAT approach was procedures will be updated and refined as best practices in the
developed in a process involving public comment and consulta- field of environmental health and systematic review continue
tion with experts from toxicology and systematic review, build- to evolve.
ing on and extending guidance from major systematic review The US Environmental Protection Agency’s Integrated Risk
groups (eg, the Cochrane Collaboration (Higgins and Green, Information System (IRIS) program is adapting and implement-
2011), Agency for Healthcare Research and Quality ing systematic review methods (US Environmental Protection
(Viswanathan et al., 2013), GRADE Workgroup (Guyatt et al., Agency, 2015a). These procedures, modeled after the Cochrane
2011), and the Navigation Guide Work Group (Woodruff and Handbook for Systematic Reviews of Interventions (Higgins and
Sutton, 2014)). The evaluation process begins with a problem Green, 2011), also reflect recommendations from 2 US National
formulation step to form the specific research question and the Research Council (NRC) reviews of the IRIS process (National
PECO statement, and then involves the development of a proto- Research Council, 2011; National Research Council, 2014).
col for conducting the review. The protocol outlines the meth- IRIS assessments involve multiple systematic reviews to de-
ods for the evaluation tailored to the research question, termine whether a chemical causes specific adverse effects, eg,
including the literature search strategy, inclusion/exclusion cri- for endpoints such as carcinogenicity or neurotoxicity, in hu-
teria, risk of bias approach, establishing confidence in the evi- mans or in animals. The data include mainly animal and in vitro
dence, and methods for evidence integration. studies, and observational epidemiological studies. IRIS
STEPHENS ET AL. | 13
assessments must address disparate data, such as different ani- based on the recognition that evidence is needed in all assess-
mal species and strains that may tolerate different doses, differ- ments and the process for collecting, appraising, and analyzing
ing results (ie, an effect occurs in one species but not another), it should be the same regardless of the objectives of the assess-
or occupational studies conducted while exposure levels ment or who conducts it. Assessments focused on efficacy,
change, eg, use of protective equipment, or changing industrial safety, and risk should all follow the same process. Another ra-
processes. tionale for the Prometheus project is to address the issues pre-
The IRIS program’s emerging approach to systematic review sented when evidence is not available or there is insufficient
is similar to OHAT’s, and it includes a step for systematic inte- time for applying extensive or complex approaches. EFSA re-
gration of evidence for each health outcome. Both the NRC re- cently published a report on the resulting methodological
view (NRC 2014) and a subsequent workshop on the subject framework (European Food Safety Authority, 2015b). The
suggested that guided expert judgment, coupled with structured Authority is also working on a report on how to analyze data
processes, are required for integrating IRIS evidence streams gaps and the impacts thereof.
(US Environmental Protection Agency, 2015b). The EBTC is particularly interested in the Cochrane
After evidence integration, IRIS assessments characterize Collaboration’s emerging methodology for systematic reviews
Toxicology Collaboration in 2011 (Stephens et al., 2013). As in • Expert judgment: Exactly what constitutes the proper role for ex-
medicine, evidence-based approaches in toxicology include not pert judgment in the context of a systemic review also merits
only evidence synthesis across studies (systematic review) but some consideration and, potentially, guidance. The kind of ex-
also the application of individual elements of systematic review pert judgment used in conducting a systematic review is and
methodology to other contexts, such as appraising the risk of should be separate from the kind of expert judgment involved in
bias of an individual article or appraising the quality of evidence making policy. A related challenge is the misperception that evi-
in regulatory submissions. dence-based approaches leave no room for professional judg-
Among the first actual systematic reviews in toxicology and ment. Systematic reviews should strive to make expert
environmental health were those conducted by Navas-Acien, judgments clear along with the scientific basis for those judg-
Silbergeld, and colleagues, examining the association between ments in developing conclusions for a systematic review.
exposure to environmental chemicals and human health effects Analyzing the approach that has been developed for involving
(Navas-Acien et al., 2005; Navas-Acien et al., 2006; Navas-Acien expert judgment in risk analysis may prove helpful in efforts to
et al., 2008). These early reviews examined in vivo, in vitro, and determine how to best use expert judgment in systematic re-
epidemiological evidence to address specific questions. More re- views for toxicology (Cooke and Goossens, 2008; Morgan, 1992).
actual pathways in the human body and whether proposed time for clarity in definition, methods, and reporting. J. Clin.
pathway-based tests do a good job of tracking perturbations to Epidemiol. 67, 1291–1294.
the pathways in question. These questions are related to the Cooke, R. M., and Goossens, L. H. J. (2008). TU Delft expert judg-
thorny issue of test method validation. More generally, evi- ment data base. Special issue on expert judgment. Reliab.
dence-based toxicology can aid in evaluating new mechanistic Eng. Syst. Safe 93, 657–674.
in vitro tools for assessing toxicity (Hartung, 2010). The new eval- de Vries, R. B. M., Wever, K. E., Avey, M. T., Stephens, M. L., Sena,
uation approaches compare well to traditional validation E. S., and Leenaars, M. (2014). The usefulness of systematic
approaches in that they are more systematic and able to focus reviews of animal experiments for the design of preclinical
on mechanistic relevance, rather than on predicting animal and clinical studies. ILAR J. 55, 427–437.
data (Hartung et al., 2013). Approaches for how the methodology Evidence-based Toxicology Collaboration (2014). EBTC
can be used to validate high-throughput assays in support of Workshop, November 21, 2014, Baltimore, MD. Available at:
21st century toxicity testing are being developed (Judson et al., http://www.ebtox.com/nov2014_wksp/. Accessed November
2013). 13, 2015.
The issue of training came up repeatedly during the work- Evidence-based Toxicology Collaboration (2015). Available at:
Judson, R., Kavlock, R., Martin, M., Reif, D., Houck, K., Knudsen, literature-based environmental health science assessments.
T., Richard, A., Tice, R. R., Whelan, M., Xia, M., et al. (2013). Environ. Health Perspect. 122, 711–718.
Perspectives on validation of high-throughput assays sup- Sackett, D., Rosenberg, W. M., Gray, J. A., Haynes, R. B., and
porting 21st century toxicity testing. Altex 30, 51–66. Richardson, W. S. (1996). Evidence-based medicine: what it is
Klimisch, H. J., Andreae, M., and Tillmann, U. (1997). A systematic and what it isn’t. BMJ 312, 71.
approach for evaluating the quality of experimental toxicolog- Samuel, G. O., Hoffmann, S., Wright, R. A., Lalu, M. M., Patlewicz,
ical and ecotoxicological data. Regul. Toxicol. Pharmacol. 25, 1–5. G., Becker, R. A., DeGeorge, G. L., Fergusson, D., Hartung, T.,
Koustas, E., Lam, J., Sutton, P., Johnson, P. I., Atchley, D. S., Sen, Lewis, R. J., et al. (2016). Guidance on assessing the methodo-
S., Robinson, K. A., Axelrad, D. A., and Woodruff, T. J. (2014). logical and reporting quality of toxicologically relevant stud-
The Navigation Guide—evidence-based medicine meets en- ies: a scoping review. Environ. Int. http://dx.doi.org/10.1016/j.
vironmental health: systematic review of nonhuman evi- envint.2016.03.010.
dence for PFOA effects on fetal growth. Environ. Health Schneider, K., Schwarz, M., Burkholder, I., Kopp-Schneider, A.,
Perspect. 122, 1015–1027. Edler, L., Kinsner-Ovaskainen, A., Hartung, T., and
Krauth, D., Woodruff, T. J., and Bero, L. (2013). Instruments for as- Hoffmann, S. (2009). “ToxRTool”, a new tool to assess the reli-