You are on page 1of 3

2020 IEEE World Congress on Services (SERVICES)

ViewBovine: A Microservices-Powered Web Application to Support Interactive


Investigation of Bovine Tuberculosis Infection Pathways

Fan Yang-Turner, Denis Volk, Tim Peto Tony Roberts, Juan Herreros, Richard Ellis
Nuffield Department of Clinical Medicine Animal and Plant Health Agency
University of Oxford Addlestone, United Kingdom
Oxford, United Kingdom

Abstract—Bovine tuberculosis (bTB) is a chronic infectious Sequencing (WGS) has been applied to the study of bTB in
disease caused by Mycobacterium bovis (M. bovis) which affects GB and approximately 10,000 M. bovis isolates from cattle
cattle and a broad range of other mammalian hosts, including have already been sequenced and analysed by the Animal and
humans. Animal health authorities face the challenge of Plant Health Agency (APHA).
identifying epidemiologically linked cases with scattered or Single Nucleotide Polymorphism (SNP) analysis is a
incomplete information. Here, we report a novel microservices- common initial step to rapidly detect the relationships between
powered web application, ViewBovine, to support investigations sequences of bacteria species. Genetic relatedness can be
into bTB incidents by government veterinary inspectors and
defined by the SNP distance, i.e. the number of base-pair
epidemiologists. ViewBovine is powered by microservices that
differences between the genomes of two isolates of the same
integrate genetically related isolates of M. bovis and cattle
movement information, with web-based visualisation and
bacterial species. In some cases, the SNP analysis is sufficient
interactive features. This enables users to quickly investigate, to generate epidemiologically important information. In other
evaluate and gain insights into the relatedness of different bTB cases, phylogenetic trees based on evolutionary inference are
cases. We present an evaluation of this tool using three cases also needed to adjust estimates of divergence [3].
which demonstrate the strengths of the tool and showcase its The main route of transmission of bTB among cattle is
usage in practice. Although ViewBovine is designed for bTB, it through respiratory aerosols, which can result in within-herd
demonstrates the successful design and implementation of an spread of infection. It is also known that a large proportion of
interactive web-based application for routine monitoring of new herd infections result from direct movement of infected
pathogen infections and outbreak investigation in a timely animals from endemic bTB areas [4]. Three policy control
manner. zones have been delineated in England according to bTB
incidence risk: a high risk area (HRA) in the south-west, the
Keywords-genome analysis; bovine TB; interactive design; Low bTB Risk Area (LRA) and a buffer zone Edge Area (EA)
microservice; epidemiological study. in between [5].
Epidemiological data such as herd location and historical
I. INTRODUCTION movement records of the infected animals are essential
Bovine tuberculosis (bTB) is an infectious bacterial resources for bTB outbreak investigation. Integration of
disease caused by Mycobacterium bovis (M. bovis) that genetic and epidemiological data to determine transmission
threatens cattle industry and presents risks to other livestock, pathways can be an effective approach in traditional
wildlife and domestic pets. The bovine epidemic in cattle and algorithm-statistical analysis [6]. This methodology has
badgers in the UK, with occasional spill-over into other proved that more accurate estimation of transmission
domestic species also represents a low but ongoing public pathways from epidemiological data could be achieved by
health risk. For example, almost 4,400 herds experienced new incorporating information about the possible movement of
cases (incidents) of bTB in Great Britain (GB) in 2018 [1]. animals between farms [7][8]. In the public health domain,
Investigating a bTB case or groups of cases is a complex visual analytics have been recognized as useful tools for
activity that requires a grasp of epidemiological, laboratory molecular epidemiological studies [9]. A few applications
have been developed to explore this approach [10][11][12].
and environmental assessments.
However, the visualization applications have no interactive
Animal health authorities often face the challenge of
features, setting different parameters for dataset queries and
identifying epidemiologically linked cases over a short period
customizing the views etc. To the best of our knowledge,
of time in a specific geographical area, or clusters of cases
ViewBovine is the first web-based application that integrates
with a presumptive common source not necessarily clustered
epidemiological and genetic data analysis in an interactive
geographically. In this regard, identifying epidemiologically
way to investigate bTB transmission.
linked cases and the infection source are essential for risk
assessment, outbreak investigations and to prevent further III. MICROSERVICES APPROACH
bTB transmission.
To enable our users to explore genetic analytics results and
II. BACKGROUND epidemiological data in an integral view anywhere and
anytime on the web, we took a microservices approach and
Genome analysis of disease-causing microorganisms, likc
designed a substantial API (Application Programming
bacteria or virus has the potential to radically improve the
Interfaces) layer to support data aggregation. We used web
detection of disease transmission [2]. Whole Genome

2642-939X/20/$31.00 ©2020 IEEE 4


DOI 10.1109/SERVICES48979.2020.00014

Authorized licensed use limited to: Newcastle University. Downloaded on May 19,2021 at 00:30:07 UTC from IEEE Xplore. Restrictions apply.
service standard RESTful API [13] for integrating the data from test negative but infected (undisclosed infection) cattle.
from different sources. Our implementation approach has Analysis of the WGS using ViewBovine indicated that this
three advantages. Firstly, an API server has more particular M. bovis isolate was genetically highly similar to
computational resources than a web server, managing large isolates derived from a cluster of cattle cases in Oxfordshire
datasets using multiprocessing CPUs to build aggregated which included a farm where this animal had been previously
results from different sources. Secondly, APIs cache the result resident. Using Cohabitation Records (an additional output of
of searching query and enable repeated queries to respond the application), we identified that at this farm, the
quickly. Thirdly, APIs use background processing to update Bedfordshire case animal was resident with two infected
the new dataset, allowing the current version of data to be
animals which also only tested positive after being moved to
accessed by the end user without any interruption.
other farms (Figure 2). The M. bovis bacteria isolated from
ViewBovine APIs aggregate both structured and
unstructured data. To consistently consume the data, it has all three cases were part of the same genetic cluster. The
been serialized to a JSON format. The APIs are grouped into Oxfordshire farm had no bTB history and therefore was not
three categories to support our three interactive features: 1) registered as an infected farm. ViewBovine was able to
herds with genetically-related samples; 2) genetically related identify this farm as the likely source of infection through
samples with cattle movement data; and 3) genetic clustering combining cohabitation and WGS similarity records. Using
of isolates with herd location. this visualisation allowed the very rapid identification of the
As shown in Figure 1, ViewBovine has a microservices Oxfordshire holding as the source of infection. Without this
architecture: (i) Data layer that consists of three data sources, information, case investigation would have taken many more
epidemiological data including cattle data (herd location and resources and remained inconclusive, or worse, incorrect.
cattle movements); Genetic relatedness from WGS data
analytics via a genetic relatedness server [14]; and
Phylogenetic tree data from a tree building server [15] (ii)
Microservices layer that takes epidemiological data, genetic
analytics data and phylogenetic tree from data layer and
Figure 2. ViewBovine Cohabitation Records allows visualisation of case
aggregates them for a user interface; (iii) UI layer that displays cohabitation. Three cases were resident on the same farm in different
data retrieved from a microservices layer and provides periods.
interactive visualisation of bTB related cases.
B. Case Study B: Differentiating Infection Sources
In the Edge Area (EA), incidence of bTB has been
increasing for the past few years in contrast to a slight decrease
observed in the HRA [16]. ViewBovine was used to
investigate the underlying cause for the formation of a new
cluster of twenty cases in the EA of the Midlands and in
particular to determine whether it was an area of potential
emerging endemicity for bTB.
A matrix produced from WGS comparative analysis
identified degrees of relatedness between the isolates derived
from cases in the cluster identifying some cases as unrelated.
Use of ViewBovine allowed rapid identification of the closest
Figure 1. Microservices Architecture of ViewBovine related isolates in the database and geographic locations. This
visual matching of movement history, and locations of related
bTB isolates had provided clear evidence that infection had
IV. CASE STUDIES originated in another location, outside the area under study
(Figure 3).
A. Case Study A: Establishing the Most Likely Source of
The remainder of the cases in the cluster which were
Infection closely related, were concluded as likely to have originated by
contact, direct or indirect, with endemic infection (i.e. local
In the LRA, the majority of animals found to be infected have infection involving cattle and/or wildlife).
been purchased from an area of the country with relatively ViewBovine was subsequently used for every new local
high prevalence of bTB and were infected before they were case, facilitating rapid comparative analysis of the closest
moved [4]. It can be difficult and time consuming to find the genetic isolates in the database and immediate spatial
evidence for this. This case was an animal that had tested visualisation. Critically, this informed the epidemiological
positive for bTB on a farm in Bedfordshire located in the investigation identifying probable infection pathways faster
LRA, but had been resident on three farms previously, none and with a higher degree of certainty than standard methods
of which had a history of bTB. The initial hypothesis was that using movement records, and genotype databases (see Case
infection had occurred on the farm where it was disclosed Study C: Local Infection).

Authorized licensed use limited to: Newcastle University. Downloaded on May 19,2021 at 00:30:07 UTC from IEEE Xplore. Restrictions apply.
opportunities to include data from wildlife reservoirs of M.
bovis, and to provide scientific evidence on the transmission
of bTB between wildlife animals and cattle. Finally, we
believe that ViewBovine has set a benchmark for
investigating pathogen transmission using a visualisation and
data aggregation approach powered by microservices. We
suggest that the time is ripe to start building interactive
visualisation tools with genetic analytics data augmented with
epidemiological information in the routine monitoring and
surveillance in public and animal health domain.
REFERENCES
Figure 3. ViewBovine facilitates visualization of case movement history. An
[1] Bovine TB statistics: Great Britain, available at:
isolate from new bTB breakdown (in red) is displayed against the closest
https://researchbriefings.parliament.uk/ResearchBriefing/Summary/S
related –within 2 SNP distance- isolates (in blue). The red line indicates the
N06081
movement of the cattle from which the isolate originated, illustrating that the
animal had been moved from the area with clustered genetic related cases. [2] Walker TM, Monk P, Smith EG, Peto TE. (2013). Contact
investigations for outbreaks of mycobacterium tuberculosis: advances
C. Case Study C: Local Infection through whole genome sequencing. Clin Microbiol Infect.
2013;19(9):796–802
For twenty years, the APHA has classified M. bovis
[3] Eyre DW, Cule ML, Wilson DJ, Griffiths D, Vaughan A, O'Connor L,
isolates into genotypes based on spoligotype and VNTR Ip CL, Golubchik T, Batty EM, Finney JM, et al. (2013). Diverse
(Variable Number Tandem Repeat) type which provides less sources of C. Difficile infection identified on whole-genome
exact information about genetic relatedness than WGS. This sequencing. New England Journal of Medicine. 2013; 369(13):1195–
is currently used in parallel to WGS. A cluster of four bTB 1205
incidents occurred within a relatively small area of [4] Godfray, H.C., Donnelly, C.A., Kao, R.R., Macdonald, D.W.,
Oxfordshire in 2018. M. bovis isolates from each affected McDonald, R.A., Petrokofsky, G., Wood, J.L., Woodroffe, R., Young,
D.B., McLean, A.R., 2013. A restatement of the natural science
cattle farm were all identified with different genotypes, which evidence base relevant to the control of bovine tuberculosis in Great
in the absence of whole genome sequence information Britain. Proc Biol Sci 280, 20131634.
appeared to be unrelated. Apart from one genotype which was [5] Defra, 2014. The Strategy for achieving Officially Bovine
endemic to this area, the remaining three were concluded as Tuberculosis Free status for England, April 2014
likely caused by movement of infected cattle from the HRA. [6] Samuel AR, Knowles NJ. Foot-and-mouth disease type O viruses
ViewBovine demonstrated the close geographical clustering exhibit genetically and geographically distinct evolutionary lineages
using the map interface, and that the endemic genotype was (topotypes) J Gen Virol. 2001;82:609–621.
only one SNP different to each of the other three. This [7] Cottam EM, Thébaud G, Wadsworth J, Gloster J, Mansley L, Paton DJ,
King DP, Haydon DT. Integrating genetic and epidemiological data to
information dispelled the confusion created by the presence of determine transmission pathways of foot-and-mouth disease virus.
four different genotypes and instead suggested that the M. Proc R Soc B. 2008;275:887–895.
bovis isolates from the four farms were closely related [8] Jamal SM, Ferrari G, Ahmed S, Normann P, Curry S, Belsham GJ.
genetically, and all part of local endemic infection. . Evolutionary analysis of serotype a foot-and-mouth disease viruses
circulating in Pakistan and Afghanistan during 2002–2009. J Gen irol.
V. CONCLUSIONS AND FUTURE WORK 2011; 92: 2849–2864.
The ability to control and conduct epidemiological [9] Huber, W. et al. (2015). Orchestrating high-throughput genomic
analysis with Bioconductor. Nat. Methods 12, 115–121.
investigations of bTB incidents is a major task of animal
[10] Ola O, Sedig K. The challenge of big data in public health: An
health authorities. Establishing the likely source and route of opportunity for visual analytics. Online J Public Health Inform.
transmission of bTB cases can be quite challenging and time- 2014;5(3):223
consuming. With advances in whole genome sequencing of [11] Smith CM, Hayward AC. Dotmapper: an open source tool for creating
pathogens and modern bioinformatics pipelines, visualising interactive disease point maps. BMC Infect Dis. 2016;16(1):145.
both genetic analytics and epidemiological data could play a [12] Hadfield J, et al. (2018) Nextstrain: real-time tracking of pathogen
critical role in understanding disease behaviour. evolution Bioinformatics, 34 (2018), pp. 4121-4123.
Algorithmic-statistical analysis is the dominant method in [13] Richardson, Leonard; Ruby, Sam (2007). RESTful Web Services.
genomics data analysis. As more and more bacterial genomes Sebastopol, California: O'Reilly Media.
are sequenced and analysed using high performance [14] Mazariegos-Canellas O, Do T, Peto T, Eyre DW, Underwood A, Crook
computing in time spans of a few hours, novel data D, Wyllie DH. (2017). BugMat and FindNeighbour: command line and
server applications for investigating bacterial relatedness. BMC
visualisation tools promise to support scientific discovery to Bioinformatics 18:477
public and animal health. We introduce a microservices- [15] Nguyen, L.-T.,Schmidt, H.A., von Haeseler A., Minh, B.Q. (2015). IQ-
powered web application, ViewBovine, to support TREE: A fast and effective stochastic algorithm for estimating
investigating bTB cases in a few minutes. The cases described maximum likelihood phylogenies. Mol. Biol. Evol., 32:268-2
herein are examples of how ViewBovine has helped [16] APHA, 2018. Bovine tuberculosis in England in 2018:
epidemiologists to rapidly investigate and gain insights into Epidemiological analysis of the 2018 data and historical trends.
the likely infection sources. In the future, we will seek

Authorized licensed use limited to: Newcastle University. Downloaded on May 19,2021 at 00:30:07 UTC from IEEE Xplore. Restrictions apply.

You might also like