Professional Documents
Culture Documents
org
Published in IET Software
Received on 18th February 2013
Revised on 24th July 2013
Accepted on 23rd August 2013
doi: 10.1049/iet-sen.2013.0029
ISSN 1751-8806
Abstract: Systematic Reviews (SRs) have recently intensified in Software Engineering. However, there is a lack of work that
makes explicit how the process to perform SR is in practice. The goal of this paper is externalising the process that reflects
how SRs are currently performed, transforming tacit knowledge into explicit knowledge. We describe the phases and
activities that compose the process and the relationship among them, and explore the iterative characteristic of this process,
focusing on intra- and inter-phase iterations that are necessary to conduct the process appropriately. To achieve the proposed
goal, we devised the process based on practical experience acquired for several years by research groups in Software
Engineering, which include graduate students and researchers who have applied SR. The process has been applied in several
SRs and seems to be effective in keeping the focus of the review at all phases. Moreover, the externalisation of the process
has been useful to help researchers improving the process execution quality. As the process reflects the practice and is
explained in details, it can be used as a guide to better understand the SR process and its details. This shall contribute to
improve all SR process phases, and hence the quality of SR results.
Several authors have reported and discussed obstacles faced 3 Process overview
by them – or by their research groups – while performing
SRs in the SE field. The major challenges are next The incremental SR process, hereafter called I-SR, is
summarised. compound of four phases, namely: (i) planning, (ii) initial
selection, (iii) final selection and extraction and (4)
2.2.1 Rigour of the search process: Zhang and synthesis. Its overall structure is depicted in Fig. 1. The
Muhammad [15] and Zhang et al. [22] highlight that the process layout is inspired in the evolutionary models for
rigour of the search process is one critical factor that software development. Such models aggregate iteration and
distinguishes SR from traditional, ad hoc literature reviews. evolution in the development process, in which the
However, effective searching is far to be a trivial task. produced software artefacts evolve after each cycle of the
Indeed, many SE researchers [1, 11–14, 16] have lately model. In I-SR, the artefacts evolve after executing each
reported difficulties to search relevant studies. Effective phase similarly. In Fig. 1, phases are each represented as a
searching strongly relies on the quality of the search string; circle that comprises a set of activities that can be done
devising it is a time-consuming and error-prone activity iteratively. The phases are briefly described in the sequence.
[22]. The performance of automated search itself is highly Their details are provided in Section 4.
dependent on the quality of the string, which implies a The planning phase comprises the definition of the SR
continuous refinement process. However, defining a search protocol, a pilot search and the design of a preliminary
string may be a ‘double-edged sword’. As emphasised by version of the data extraction form. It is the first phase of
Boell and Cezec-Kecmanovic [23], on the one hand, the the process and has the ‘define/revise protocol’ activity as
more inclusive is the search string, the more irrelevant its starting point.
The initial selection phase comprises activities related to update the data sheets because of novel, up-to-date research
the pre-selection of primary studies. Basically, this phase results that make previous ones obsolete.
requires the customisation of the search string for each It is important to highlight that packaging is performed in
target repository, running the searches, pre-selecting the parallel with the whole process execution. Furthermore, the
primary studies and checking the occurrence of new protocol permeates the whole SR lifecycle. Every designed
candidate keywords to enhance the search string. or produced artefact, every decision taken and every
The final selection and extraction phase encompasses procedure performed by the researchers are driven by the
activities related to the final selection of primary studies and protocol. Reporting all these items creates a package that
data extraction. As in initial selection, during this phase enables both process and results audit, as well as SR
some new candidate keywords may emerge, and hence must replication, update and extension.
be checked. Besides that, additional data extraction fields
may be necessary and thus must be considered in a revised 3.1 Supported execution scenarios
version of the data extraction form.
Finally, the synthesis phase addresses the analysis of the The I-SR process is devised to support SR in four different
extracted data, and reporting and publishing the results. scenarios: (i) original SR, (ii) SR replication, (iii) SR
Filtering is required, for example, when it is necessary to extension and (iv) SR update (the last three are called
4.2 Initial selection phase 3. Perform initial selection: all retrieved studies, from all
repositories, must be selected taking into account their titles
This phase aims to fetch studies in all repositories specified in and abstracts (and possibly other excerpts). If a study can
the protocol. Besides, it includes a first (or initial) selection of be useful for answering the research question, it must be
studies based on the reading of title and abstract (or even other selected; if not, it must be discarded. This decision is based
study excerpts, depending on the protocol definition). This on the inclusion and exclusion criteria defined in the protocol.
phase requires the application of the inclusion and
exclusion criteria defined in the protocol. During the † ‘Filtering’: accomplishing this activity may require a
selection, filters are applied to discard retrieved documents filter, which may be applied while running any SR
not of interest (e.g. studies that are retrieved from more execution scenario. For example, in an SR extension,
than one repository). Besides, if new keywords are after the primary studies are selected, one (or some) of
them may be identified as a new, up-to-date version of
studies which were previously selected. In this case,
according to Kitchenham’s guidelines [17], only the
up-to-date study will populate the dataset. Note that such
situation is likely to occur in any execution scenario, be
it an original SR or a re-execution.
Fig. 3 Checklist for protocol approval Fig. 4 Checklist for initial selection approval
Fig. 5 Checklist for final selection approval Fig. 6 Checklist for extraction approval