Professional Documents
Culture Documents
trajectories comes in different forms such as destination and future route prediction
from personal movement data [30], real-time monitoring of water quality using
trajectories of live fish [71], analyzing the trajectories of migrating birds [111],
searching for similar trajectories in spatial networks [118], or understanding the
traffic flow using trajectories in road networks [86].
Discovering interesting, but implicit patterns from spatiotemporal datasets is
crucial for many scientific domains such as astronomy [57, 74], ecology [130],
meteorology [52], geophysics [105], and criminology [127]. The ever-growing
nature of data being generated and collected from various scientific sources makes
the data-driven knowledge discovery process very challenging to the researchers in
these fields. The manually performed data analysis tasks are no longer feasible with
the volume and velocity of massive spatiotemporal datasets. Thus, automated dis-
covery of relevant information from spatiotemporal datasets is important for many
organizations that employs these datasets in their decision making processes [109].
The spatiotemporal frequent pattern mining can be useful for the verification and
prediction of scientific phenomena in a broad range of scientific fields including
meteorology, geophysics, epidemiology, and astronomy [37]. The discovered spa-
tiotemporal patterns can be used for modeling various scientific phenomena (e.g.,
4 1 A Gentle Introduction to Spatiotemporal Data Mining
Fig. 1.1 Polygon-based representations of two coronal holes reported to Heliophysics Event
Knowledgebase (HEK) [69] between ‘23 January 2012 07:00’ and ‘25 January 2012 07:00’
1.2 Motivation and Challenges 5
events such as coronal mass ejections and solar flares. Coronal mass ejections
and solar flares impact radiation in space, can reduce the safety of space and air
travel, disrupt intercontinental communication and GPS, and even damage power
grids [67].
Fig. 1.2 In [132], Wong et al. illustrate their cell tracking results, and compare its accuracy with
manual image analysis performed by human experts. They argue that these two methods have
excellent agreement. The tracking software models the embryos as a collection of ellipses with
position, orientation, and overlap indices. Images in top row show the frames from original time-
lapse sequence. Images in bottom row show the overlaid ellipses found after tracking. Wong et
al. claims that with these models, the duration of cytokinesis and time between mitoses can be
identified. (Image is copied from [132]—See Figure 3.a)
Fig. 1.3 In [32], Conaghan et al. present the results of the tracking software they used. The
primary features tracked by the software are the cell membranes. By using a data-driven
probabilistic framework, the software generates an embryo model that includes an estimate of
the number of blastomeres, as well as spatiotemporal attributes such as size, location, and shape,
as a function of time. (Image is copied from [32]—See Figure 2.a)
1.2.3 Epidemiology
1.3 Challenges