Professional Documents
Culture Documents
Ferramenta de Metodologia
Ferramenta de Metodologia
com
Int J Health Policy Manag 2014, 3(3), 123–128 doi 10.15171/ijhpm.2014.71
Original Article
Abstract
Article History:
Background: Recently there has been a significant increase in the number of systematic reviews addressing
Received: 18 June 2014
questions of prevalence. Key features of a systematic review include the creation of an a priori protocol, clear
Accepted: 11 August 2014
inclusion criteria, a structured and systematic search process, critical appraisal of studies, and a formal process
ePublished: 13 August 2014
of data extraction followed by methods to synthesize, or combine, this data. Currently there exists no standard
method for conducting critical appraisal of studies in systematic reviews of prevalence data.
Methods: A working group was created to assess current critical appraisal tools for studies reporting prevalence
data and develop a new tool for these studies in systematic reviews of prevalence. Following the development of
this tool it was piloted amongst an experienced group of sixteen healthcare researchers.
Results: The results of the pilot found that this tool was a valid approach to assessing the methodological
quality of studies reporting prevalence data to be included in systematic reviews. Participants found the tool
acceptable and easy to use. Some comments were provided which helped refine the criteria.
Conclusion: The results of this pilot study found that this tool was well-accepted by users and further
refinements have been made to the tool based on their feedback. We now put forward this tool for use by
authors conducting prevalence systematic reviews.
Keywords: Prevalence, Survey, Critical Appraisal, Systematic Review
Copyright: © 2014 by Kerman University of Medical Sciences
Citation: Munn Z, Moola S, Riitano D, Lisy K. The development of a critical appraisal tool for use in *Correspondence to:
systematic reviews addressing questions of prevalence. Int J Health Policy Manag 2014; 3: 123–128. Zachary Munn
doi: 10.15171/ijhpm.2014.71 Email: zachary.munn@adelaide.edu.au
The Joanna Briggs Institute, Faculty of Health Sciences, University of Adelaide, Adelaide, Australia
Munn et al.
formal quality assessment altogether (14,15). Therefore, the external review. Details on how to answer each question in
working group sought to address this gap by developing and the tool are available in Appendix 1 and the Joanna Briggs
testing a critical appraisal form that could be used for studies Institute guidance on conducting prevalence and incidence
included in systematic reviews of prevalence data. reviews (Table 2) (21).
Figure 1. Number of systematic reviews of prevalence by year of publication identified in a PubMed search
124 International Journal of Health Policy and Management, 2014, 3(3), 123–128
Munn et al.
Question Measurement
Ease of use of the tool 5-point Likert scale (1 very difficult, 5 very easy)
Is this a valid tool for prevalence data? Yes/No
Timeliness 5-point Likert Scale (1 very unacceptable, 5 very acceptable)
Acceptability 5-point Likert Scale (1 very unacceptable, 5 very acceptable)
Redundant questions Free text
Comments for improvement Free text
participants providing a rating of 4, corresponding to ‘easy’. burden of disease issues both now and in to the future.
For the acceptability of the tool, the mean score was 4.33, with The conduct of a systematic review is a scientific exercise
all participants giving either a ranking of 4 (acceptable) or 5 that produces results which may influence healthcare
(very acceptable). For timeliness, the mean score was 3.94, decisions. As such, reviews are required to have the same
with 88% providing a score of 4 (acceptable). Out of all the rigor expected of all research. The quality of a review, and
participants, all except 1 viewed the tool as a valid quality any recommendations for practice that may arise, depends on
appraisal checklist for prevalence data (Table 4). the extent to which scientific review methods are followed to
There were a number of suggestions provided for refinement minimize the risk of error and bias. The explicit and rigorous
and improvement of the tool. These comments resulted in methods of the process distinguish systematic reviews from
some changes in the order of the questions of the tool and traditional reviews of the literature (2).
the supporting information used to assist in judging criteria, Systematic reviews normally rely on the use of critical appraisal
although no changes were made to the individual questions. checklists that are tailored to assess the quality of a particular
study design. For example, there may be separate checklists
Discussion used to appraise randomized controlled trials, cohort studies,
Systematic reviews of prevalence and incidence data are cross-sectional studies and so on. Prevalence data can be
becoming increasingly important as decision makers realize sourced from various study designs, even randomized
their usefulness in informing policy and practice. These controlled trials (11); however, critical appraisal tools directed
reviews have the potential to better support healthcare at assessing the risk of bias of randomized controlled trials are
professionals, policy-makers, and consumers in making aimed at assessing biases related to causal effects and hence
evidence-based decisions that effectively target and address are not appropriate for reviews examining the prevalence
International Journal of Health Policy and Management, 2014, 3(3), 123–128 125
Munn et al.
of a condition. For example, criteria regarding the use of an workshop and the wider staff of the Joanna Briggs Institute
intention-to-treat analysis as often seen in critical appraisal and its collaboration for their feedback.
checklists for randomized controlled trials are not a true
quality indicator for questions of prevalence. Ethical issues
Due to this, a new tool assessing validity and quality No ethical issues are raised.
indicators specific to issues of prevalence has been developed.
This checklist addresses critical issues of internal and external Competing interests
validity that must be considered when assessing validity of The authors declare that they have no competing interests.
126 International Journal of Health Policy and Management, 2014, 3(3), 123–128
Munn et al.
Key Messages
International Journal of Health Policy and Management, 2014, 3(3), 123–128 127
Munn et al.
to determine an adequate sample size. This will estimate how bias. Many health problems are not easily diagnosed or
many subjects are needed to produce a reliable estimate of the defined and some measures may not be capable of including
measure(s) of interest. For conditions with a low prevalence, or excluding appropriate levels or stages of the health problem.
a larger sample size is needed. Also consider sample sizes for If the outcomes were assessed based on existing definitions or
subgroup (or characteristics) analyses, and whether these are diagnostic criteria, then the answer to this question is likely to
appropriate. Sometimes, the study will be large enough (as be yes. If the outcomes were assessed using observer reported,
in large national surveys) whereby a sample size calculation or self-reported scales, the risk of over- or under-reporting
is not required. In these cases, sample size can be considered is increased, and objectivity is compromised. Importantly,
adequate. determine if the measurement tools used were validated
When there is no sample size calculation and it is not a large instruments as this has a significant impact on outcome
national survey, the reviewers may consider conducting their assessment validity.
own sample size analysis using the following formula (24,25):
7. Was the condition measured reliably?
Z 2 P(1 − P) Considerable judgment is required to determine the presence
n=
d2 of some health outcomes. Having established the objectivity
Where: of the outcome measurement instrument (see item 6 of this
n= sample size scale), it is important to establish how the measurement
Z= Z statistic for a level of confidence was conducted. Were those involved in collecting data
P= Expected prevalence or proportion (in proportion of one; trained or educated in the use of the instrument/s? If there
if 20%, P= 0.2) was more than one data collector, were they similar in terms
d= precision (in proportion of one; if 5%, d= 0.05) of level of education, clinical or research experience, or level
of responsibility in the piece of research being appraised?
4. Were the study subjects and setting described in detail? - Has the researcher justified the methods chosen?
Certain diseases or conditions vary in prevalence across - Has the researcher made the methods explicit? (For
different geographic regions and populations (e.g. women interview method, how were interviews conducted?)
vs. men, socio-demographic variables between countries).
Has the study sample been described in sufficient detail so 8. Was there appropriate statistical analysis?
that other researchers can determine if it is comparable to the As with any consideration of statistical analysis, consideration
population of interest to them? should be given to whether there was a more appropriate
alternate statistical method that could have been used. The
5. Is the data analysis conducted with sufficient coverage of the methods section should be detailed enough for reviewers
identified sample? to identify the analytical technique used and how specific
A large number of dropouts, refusals or “not founds” amongst variables were measured. Additionally, it is also important to
selected subjects may diminish a study’s validity, as can low assess the appropriateness of the analytical strategy in terms
response rates for survey studies. of the assumptions associated with the approach as differing
- Did the authors describe the reasons for non-response methods of analysis are based on differing assumptions about
and compare persons in the study to those not in the the data and how it will respond. Prevalence rates found in
study, particularly with regards to their socio-demographic studies only provide estimates of the true prevalence of a
characteristics? problem in the larger population. Since some
- Could the not-responders have led to an underestimate of subgroups are very small, 95% confidence intervals are
prevalence of the disease or condition under investigation? usually given.
- If reasons for non-response appear to be unrelated to the
outcome measured and the characteristics of non-responders 9. Are all important confounding factors/ subgroups/differences
are comparable to those in the study, the researchers may be identified and accounted for?
able to justify a more modest response rate. Incidence and prevalence studies often draw or report findings
- Did the means of assessment or measurement negatively regarding the differences between groups. It is important that
affect the response rate (measurement should be easily authors of these studies identify all important confounding
accessible, conveniently timed for participants, acceptable in factors, subgroups and differences and account for these.
length, and suitable in content).
10. Were subpopulations identified using objective criteria?
6. Were objective, standard criteria used for measurement of Objective criteria should also be used where possible to
the condition? identify subgroups (refer to question 6).
Here we are looking for measurement or classification
128 International Journal of Health Policy and Management, 2014, 3(3), 123–128