A Survey and Quantitative Study On Map Inference Algorithms From GPS Trajectories

This article has been accepted for publication in a future issue of this journal, but has not been
fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TKDE.2020.2977034, IEEE
Transactions on Knowledge and Data Engineering
1
A Survey and Quantitative Study on Map

Inference Algorithms from GPS Trajectories
Pingfu Chao, Wen Hua, Rui Mao, Jiajie Xu, and Xiaofang Zhou, Fellow, IEEE
Abstract—Map inference algorithm aims to construct a digital map trajectory. In general, the map inference from aerial im-
from other data sources automatically. Due to the labour intensity of agery extracts the map layout from static satellite images
traditional map creation and the frequent road change nowadays, map through image processing, whereas the map inference from
inference is deemed to be a promising solution to automatic map GPS trajectories identify the road network by clustering
construction and update. However, existing map inference from GPS
the users’ vehicular traces. Despite their common objective,
trajectories suffers from low GPS data quality, which makes the quality
of the constructed map unsatisfactory. In this paper, we study the ex-
these two types of methods rely on completely different
isting map inference algorithms using GPS trajectories. Different from data sources and methodologies, and they show unique
previous surveys, we (1) include the most recent solutions and propose characteristics, respectively. Therefore, several recent works
a new categorisation of method; (2) study how different types of GPS are also proposed which integrate both data sources for
errors affect the quality of inference results; (3) evaluate the existing better performance [5].
map inference quality measures regarding their ability to identify map
Considering the equal popularity of both map inference
quality issues. To achieve these goals, we conduct a comprehensive
experimental study on several representative algorithms using both real-
types and their significant technical differences, in this pa-
world datasets and synthetic datasets, which are generated from our per, we mainly focus on the study of trajectory map infer-
proposed synthetic trajectory generator and artificial map generator. ence. Hitherto, several survey papers [6]–[8] and a book [1]
Overall, our study provides insightful observations regarding (1) which have been published summarising existing trajectory map
inference method performs better in each working scenario, (2) the inference algorithms from different aspects. In particular,
general data quality requirements for map inference, (3) the direction Ahmed et al. [1], [7] present a well-rounded introduction
of future works for quantitative map quality measures. to previous map inference algorithms, together with evalu-
ation measures, experiments and comparison. As the most
Index Terms—Map inference, experimental study, GPS trajectory, GPS representative surveys, these works help new researchers
error, metrics/measurement quickly dive into this area with some available codes [1].
However, as they mainly focus on delivering the idea of map
inference, most of their experiments are conducted through
1 I NTRODUCTION visualisation, aiming to showcase the algorithms’ properties
rather than performance. Although some surveys compare
Nowadays, digital maps have been used as the foundation the solutions via quantitative measures, they all lack enough
of various map services, such as navigation, vehicle tracking candidates [1], [6], [8] and proper reasoning for their mea-
and location-based advertising. Previously, digital maps sures. Overall, none of the existing work clearly illustrates
were constructed by labour-intensive field survey which the advantages of each type of solutions, and there is no
requires long processing time and huge labour investment. research on how GPS data quality affects their performance.
Hence, most of the map data vendors (e.g., TomTom and As plenty of new solutions [9]–[17] have emerged since the
Google) only focus on constructing and updating maps of last comprehensive survey [1] and some of them [15], [17]
urban areas for maximum commercial profit. The recent violate the previous way of categorisation [7], in this paper,
emergence of Volunteered Geographic Information (VGI) we briefly review the existing map inference methods and
complements maps with the contribution from voluntary propose a new categorisation. More importantly, we discuss
users. As a successful example, OpenStreetMap has been the the main factors to the map inference quality and address
largest free editable map created by VGI. However, since the issues in the current quantitative evaluation system
volunteered information may contain inaccurate input or through our experimental comparison on representative
even intended errors, dedicated editors are still necessary algorithms. In general, our contributions over the existing
to manually validate the map. Moreover, as the road net- surveys are as follows:
work changes frequently over time, neither field survey
nor volunteered information can update the map timely, • We conduct a comprehensive experimental study on the
which motivates the research of automatic map inference existing trajectory map inference algorithms. The chosen
(also known as map construction [1]) and map update [2]. candidates, including both up-to-date and classical solu-
Currently, two major resources are being used for auto- tions, are more diverse methodologically. Moreover, by
matic map inference: the aerial imagery [3], [4] and GPS introducing datasets with different scales and characteris-
1041-4347 (c) 2019 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Authorized licensed use limited to: University of Melbourne. Downloaded on September 28,2021 at 15:41:28 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI 10.1109/TKDE.2020.2977034, IEEE
2
tics and deploy the solutions under the same platform, we the number and the stability of satellite connections oscillate
are able to evaluate both their effectiveness and efficiency from time to time, the sampled trajectory point usually lands
under different input features, which has not been done to an arbitrary place nearby the actual location, which is
by previous works. We share our code online1 for future assumed to follow the Gaussian distribution [19] in most
research in this field. cases.
• We study the influence of GPS data quality issues on Sampling Error: Although an object is moving continuously,
the performance of various inference algorithms. To this its position can only be sampled periodically by the GPS
end, we propose a synthetic dataset generator to simulate device. Hence, the frequency of GPS position sampling,
different types of GPS errors. The candidate algorithms which is called the sampling rate, is an influential factor to
are tested on synthetic datasets to reveal the influence of the accuracy of GPS trajectory representation. For example,
noisy data to the constructed map quality, and we draw considering a vehicle runs at a speed less than 50km/h
conclusions on what types of noisy trajectories should be and is sampled every 30 seconds, the maximum travelled
filtered beforehand to achieve better inference quality. distance between two consecutive samples p1 and p2 is
• We discuss the difficulty of map inference evaluation and already 417 meters. Therefore, instead of the straight-line
the weaknesses of current evaluation system. To address representation p1 p2 , the actual trajectory over the past 30
the issues of existing quality measures, we first elabo- seconds is missing. Moreover, the sampling rate may vary
rate on the possible map quality issues of constructed along a trajectory due to missing samples caused by the loss
maps and propose an artificial map generator, which of the GPS signal, which makes the trajectory quality even
produces maps with a certain type of errors, to evaluate worse.
the effectiveness of the current quantitative measures in In addition to the GPS errors, another problem of tra-
identifying different map problems. jectory data, which is trajectory disparity, also affects the
The rest of the paper is organised as follows: we in- performance of map inference process. Note that the roads
troduce some fundamental concepts in Section 2. We then on the map are travelled in different frequencies. Regarding
summarise existing inference algorithms and discuss quan- the map inference problem, a road that is never travelled
titative map evaluation in Sections 3 and 4 respectively. is impossible to be inferred as it does not appear in the
Our experiments and observations are reported in Section trajectory dataset. Hence, the trajectory disparity makes the
5, followed by a brief conclusion in Section 6. rarely travelled road harder to be inferred.
In fact, none of the aforementioned errors occurs solely
in practice. Moreover, due to various GPS settings, real-
2 P RELIMINARIES
world trajectory datasets usually show different levels of
In this section, we formally define the input and output measurement errors, sampling errors and disparity, which
of map inference, namely GPS trajectory and map. We also makes it hard to evaluate the influence of single error type
introduce our synthetic trajectory generator to simulate on the map inference performance, separately. Therefore,
various GPS trajectory errors. we design a synthetic trajectory generator to simulate each
type of errors with multiple severity levels to measure its
2.1 GPS Trajectory influence, respectively.
Nowadays, Global Positioning Systems (GPS) have been
vastly integrated into various types of portable devices, 2.2 Synthetic Trajectory Generation
including vehicles, mobile phones and other equipment.
These devices keep generating a wealthy amount of trajec- To make sure the synthetic dataset follows the same pat-
tory data that record vehicle and pedestrian’s movement. In tern of the actual vehicle movement, instead of generating
general, a trajectory is defined as follows: trajectories randomly, our generator takes the actual road
sequence that a vehicle travelled as input to generate a
Definition 1 (Trajectory). A trajectory T r is a sequence of synthetic trajectory. To achieve this, we first conduct a map-
spatial points T r : p1 → p2 → ... → pn sampled from matching [20] on a real trajectory dataset to obtain the actual
a continuously moving object. Each point pi consists of a 2- travel route of each trajectory (this step can be skipped if
dimensional coordinate hxi , yi i ∈ R2 and a timestamp ti . Points the ground-truth map-matching result is available). Then,
in T r are chronologically ordered and every two consecutive for each trajectory T r and its matching result M (T r), we
points pi pi+1 are connected and form a trajectory segment. perform the following process:
Note that, in some of the map inference solutions [16], (1) For each point pi ∈ T r, we find its matching point M (pi )
[18], the speed (spd) and heading (heading ) information on M (T r). M (pi ) is regarded as the estimated location of
are included as point attributes and utilised in the inference the vehicle at time ti .
process. (2) For each pair of matching points M (pi ), M (pi+1 ) of
In fact, due to the low precision of the GPS devices [19], two consecutive points pi , pi+1 , we find ∆t − 1 interme-
the trajectory points are usually sampled inaccurately, which diate points along the route between M (pi ) and M (pi+1 )
becomes the main challenge for trajectory-based applica- (∆t = ti+1 − ti ) which evenly divide the route into ∆t
tions. Overall, GPS errors mainly consist of two types: pieces. Therefore, each intermediate point represents the
Measurement Error: In a GPS, the location of a device is estimated location of the vehicle at a certain timestamp.
calculated by its relative distance to multiple satellites. As (3) We collect all points of M (pi ) and intermediate points to
form a point sequence. The sequence is sorted chronologi-
1. https://github.com/Hellisk/map-service cally and form a trajectory, called primitive trajectory.
3
Here, the primitive trajectory is the estimated vehicle and the GPS trajectory. The map inference from aerial im-
locations at every moment during the trip. According to the agery tries to identify the roads in the input aerial images
above analysis, map inference algorithms could encounter using the techniques of image processing and pattern recog-
following types of data quality issues: (1) Inaccurate trajectory nition. It has been an active research topic and recent trend
point caused by measurement errors. (2) Low sampling rate refers to deep learning solutions, like Convolutional Neural
which makes the shape of a trajectory much different from Network (CNN) and its variations, to generate pixel-level la-
the actual vehicle movement trace. (3) Trajectory disparity belling of roads [4], [24] or detecting the road skeletons [25].
which causes unpopular roads harder to be inferred. To The main challenge for the image-based map inference is
evaluate the impact of these issues on the performance that many of the roads are partially or completely occluded
of map inference, our generator produces corresponding by tree canopies, the shade of the high buildings, bridges
synthetic trajectory datasets by manipulating primitive tra- and interchanges. In contrast, the map inference from GPS
jectories in certain ways: trajectories does not have the continuity issue, but it faces
• To simulate the GPS measurement errors, we follow a severer data quality issues. Therefore, some recent works
bivariate normal distribution with N (0, σ 2 ) on both axes. combine both aerial imagery and GPS trajectory data for
By varying the standard deviation σ , we are able to map inference [5]. One simple but effective solution is to
generate trajectories with different measurement accuracy. add trajectory points as a feature layer to the existing deep
Hence, for each point in primitive trajectory, we randomly learning model for aerial imagery inference [5]. Meanwhile,
choose a new location according to the distribution and due to the real-time feature, the GPS trajectories are also
finally form a synthetic trajectory. used for map update. However, most of the state-of-the-
• For sampling errors, we re-sample the primitive trajectory art map update solutions follow the same pattern [2], [26],
by sampling periodically based on the sampling rate ∆t. which consists of map-matching (align trajectories to map),
• As each primitive trajectory is derived from a set of roads, map inference (infer new roads from trajectories that fail to
to achieve a certain percentage pct% of road coverage, be matched) and map merge (merge new roads with the
we incrementally add primitive trajectory that passes new previous map). As the current map-matching algorithms
roads until pct% of the total map roads are travelled. [20] have achieved high accuracy, the quality of map update
These synthetic datasets will be used in our experiments results heavily rely on the performance of map inference al-
to evaluate the robustness of the inference algorithms when gorithm. Hence, the map inference is still the core problem.
facing different types of errors. On the other hand, the map inference on GPS trajectory
has been studied for 20 years and is still a hot topic [13],
[15], [27]. To the best of our knowledge, three papers [6]–
2.3 Road Network [8] and a book [1] are published surveying the existing
In general, in the map inference problem, as we only focus methods. Regarding the surveys, Biagioni et al. [6] first
on the road layout on the map, we regard the map as a road reviewed the map inference methods in 2012, the author
network defined below. classifies the existing methods into three categories, namely
k-means clustering [28], [29], Kernel Density Estimation (KDE)
Definition 2. (Road Network) A road network (also known as [21], [30] and trace merging [31], [32], the paper chooses
road map) is a directed graph G = (V, E), in which a vertex three representative solutions [21], [28], [32] from the cat-
v = (x, y) ∈ V represents an intersection or a road end, and an egories and compare their inference results through both
edge e = (u, v, l) is a directed road connecting vertices u and v visualisation and a new quantitative measure proposed in
with a polyline represented by a sequence of spatial points, termed the paper. Results show that the KDE-based methods have
as mini nodes, i.e.: l : u → n1 → n2 → ... → nk → v . overall better accuracy and much lower running time, which
The road network is a highly geographical-based graph, is one of the motivations of their follow-up work on KDE
storing actual locations of intersections, shapes of road method [22]. [7] and [1] combine the k-means and KDE
segments, the length of paths, etc. Apart from the general into clustering-based method category. They add another
definition, in map inference, the road network generated by recent work [33] to the trace merging and propose the
different algorithms may contain various features. Most of intersection linking [34] category. They also summarise the
the algorithms [21], [22] and evaluation measures [23] are existing quantitative measures, including the graph sam-
only able to construct/measure planar maps, which require pling proposed in Biagioni’s survey [6], direct Hausdorff
that any two roads traversing each other must generate distance and path [23]/shortest-path-based [34] distances.
a vertex. The issue of planar maps is that they fail to Seven representative methods are introduced in detail and
represent bridges and tunnels where two roads overlap compared visually, which is by far the most comprehensive
without connection. In addition, some of the map inference works in surveying the map inference methods. Also, as
algorithms can only generate undirected maps and are they share their implementation publicly, it enables the
unable to infer features like turning restriction and speed subsequent authors to easily compare their solution with
limit, which should be remedied by a post-processing step the representatives. However, since they focus more on
such as trajectory map-matching [20]. the visual comparison and the quantitative measures [6],
[23], [34] are only conducted to three of the algorithms
[22], [33], [34] on one small dataset. There is a lack of
2.4 Map Inference Problem evidence to draw any conclusion on those candidates and
The current automatic map inference algorithms are mainly methodologies. Instead of conducting extensive evaluations,
built on top of two types of data sources: the aerial imagery the [8] briefly summarises the main features of existing
4
methods and publishes plenty of datasets, with detailed of the work from k-means clustering [9], [14], [28], [29], [40]
specifications, for future map inference evaluation. As a and KDE-based clustering [6], [21], [22], [42] mentioned by
great number of new methods [11], [13]–[18], [27], [35]– previous surveys fall into this category, together with some
[37] and new directions, like parallel processing [9], [36], new solutions [15], [36], [39]. The existing road abstraction
online map inference [16] and certain map type [38], have methods can be further classified based on their input data
been proposed after the last comprehensive survey, it is types (trajectory point/segment) and clustering methods (k-
worth revisiting the existing categorisation and conducting means/KDE/mean-shift/etc.). In general, the road abstrac-
a more comprehensive experimental study on both new tion method mainly contains the following three steps:
and classical methods to comprehensively contrast their Data preparation: An input trajectory can be regarded as
performance in more dimensions. either a set of points or a set of trajectory segments. For point
clustering, besides the point coordinates, the heading of the
point is also utilised in most clustering algorithms [14], [16],
3 I NFERENCE A LGORITHM S URVEY [36], [39] to distinguish roads with adverse directions and
The recent upsurge of map inference solutions introduces the intersections. For segment clustering, one of the major
new techniques to this field, which forces a review of concerns is that the trajectory segment cannot correctly
the current categorisation. We observe that some solutions depict the object’s movement trace when the sampling rate
improve the existing methods by providing new alterna- is low, which gets even worse when the object makes a turn
tive sub-modules [37], [39], while others belong to neither around intersections. Hence, some line-based methods [43]
of the existing categories [17]. Therefore, we propose a remove the segments around intersections by their heading
new categorisation, which consists of three classes: road changes while others, especially the KDE-based methods
abstraction, intersection linking and incremental branching. The [6], [21], [42], filter out such errors when generating the
major difference between these classes is their map inference skeleton map.
procedures. Both road abstraction and intersection linking take Clustering: The purpose of the clustering is to find
the entire trajectory dataset as input and construct the map the representative points/segments from the input
in one shot, whereas the incremental branching starts from an points/segments so that the final map can be obtained by
empty map and incrementally expand the map by inserting linking them. K-means is the most widely used algorithm
one or multiple trajectories at a time. Meanwhile, the road ab- for point-based clustering [9], [14], [28], [29] due to its
straction aims to directly extract the map skeleton by finding simplicity and effectiveness. Starting from a set of random
representative points/edges from the trajectories, while the seeds on the map, the algorithm iteratively adds GPS points
intersection linking first identifies the intersections and then to ”closest” clusters, relocates cluster centres, and remove
connects them based on the knowledge from trajectories. In distant points until no more adding or removing happens.
summary, we present most of the existing map inference al- In fact, the performance of the k-means clustering is sen-
gorithms in Table 1 based on our classification. In this table, sitive to the number of initial seeds and their locations, so
we summarise the solutions and their details in terms of [14] first identify the straight lines in the map by clustering
input trajectory feature (location only/heading/speed), tra- the trajectory points using DBSCAN algorithm and contin-
jectory sampling rate, output map type, evaluation method uously sample points as seeds alongside the straight lines,
(V=visual, GM=Graph Item Matching, GS=Graph Sampling, which ensures the k-means clusters to be close to the road
PD=Path/shortest-path-based Distance; details in Section 4) centreline. Mean-shift clustering works in a similar fashion
and if they are evaluated or compared in other works. Note as k-means but is claimed to be less vulnerable to low
that, the input trajectory feature does not include those used sampling trajectories [36], [39]. Most of the line-based clus-
in pre-processing or post-processing. tering utilises the idea of image processing. It first rasterises
In the following, we will summarise the major differ- the map region into a grid whose grid cells correspond to
ences between these categories, and introduce some repre- pixels in an image. The trajectory segments are then drawn
sentative methods which will be compared in our experi- on the map and each cell receives a count of trajectories
ments. passing it. As the trajectory segments may distort around
intersections due to low sampling rate, a smoothing process,
Kernel Density Estimator (KDE) [11], [21], [22], [37], [42], is
3.1 Road Abstraction usually applied to reduce the significance of noisy cells and
The road abstraction regards the regions where more trajec- emphasise the road centreline. Finally, the dense area on the
tories passing through to be more likely roads or intersec- map is regarded as roads and intersections. The simplest
tions. Given that trajectory measurement errors are usually way of extracting the road areas is to binarise each cell
modelled by 2D Gaussian distribution, there is a higher using a global threshold [21]. However, it does not work
probability that the trajectory sample is closer to the road well as the trajectories are not evenly distributed on the
centreline it travels. Therefore, the main process of road map. Hence, other works improve the thresholding by pro-
abstraction is to find the densest areas on the map and viding multiple thresholds for different road popularities
extract a road network from them. Clustering technique, [22] and road widths [42]. Alternatively, the discrete Morse
which divides the input points/segments into groups and theory is introduced for better centreline extraction [11], [37]
finds the representative point/segment for each group, is especially for less popular roads. Note that although the
the major technique used in this category. Due to its simple KDE-based methods are proved to be effective [13]–[15],
idea, plenty of solutions have been proposed under this [40], their output map does not have direction and other
category with various types of clustering algorithms. Most map features apart from the road skeleton since only the
5
TABLE 1
Summary of map inference methods
Category Name Input Features Sampling Rate Map Feature Evaluation Compared in
Road Abstraction
Edelkamp03 [28] high(1-15s) directed V [16], [22], [40]
Elleuch15 [9] undirected V
Point-based, k-means
Qiu16 [14] heading low(45s) directed V, GM
Stanojevic2018 [16] heading high(1-5s) directed,planar V, GS [17]
Chen16 [39] heading,speed high(1-5s) directed,non-planar V, GS, PD [16]
Point-based, mean-shift
Dorum17 [36] heading directed V, GM
Li16 [13] high(15s) directed V, GM
Point-based, others
Agamennoni11 [30] heading high(<3s) directed V
Davies06 [21] high(1s) undirected V [13], [22], [41]
Biagioni12 [22] high(2-6s) undirected V, GS, GM [7], [13]–[17], [40]
Line-based, KDE Ahmed15 [42] low(60-180s) undirected,planar V
Wang15&Dey18 [11], [37] low(40s) undirected,planar V
Line-based, topic Zheng17 [15] high(1-15s) undirected V, GS
Line-based, others Liu12 [40], [43] heading low(16-60s) directed V, GS
Incremental Branching
Cao09 [32] directed V [16], [22], [40], [41]
Niehoefer09 [31] high(1s) undirected V
Li12 [44] low(30s) directed V
Trace merging
Ahmed12 [33] low(30s) undirected V, GM [15]
Buchin [27] undirected V
Zheng18 [35] low(10-300s) undirected V
Map expanding He18 [17] heading directed,non-planar V, GS, PD
Intersection Linking
Fathi10 [45] high(1-5s) directed V
Karagiorgou17 [18], [34] heading,speed low(15-90s) directed V, GM, PD [15]
Wu13 [46] heading low(30-120s) directed V
Xie141516 [12], [41] heading,speed high(1-5s) directed V, GM
Wang15Novel [10] heading high(1s) directed V
trajectory locations are preserved after converting the map

region into an image.
Road generation: After the representative points/segments
are found, the final step is to connect them and form the
output map. For point-based clusters, one simple solution
utilises the trajectory continuity and connects clusters with
consecutive GPS points from one trajectory [28]. Li et al.
[13] further improve this idea by calculating a representative
point sequence from trajectories that connect the same sets
of clusters as a road. Other works directly create roads Fig. 1. Example of a wrong connection
without the help of trajectory information. The principal
curve method, which creates a representative curve from
a set of points, is perfect for this scenario [30], [36]. Qiu et category for our experimental study:
al. [14] assumes that most of the real-world roads are near
straight, so it connects the nearby clusters with a similar 3.1.1 Stanojevic and Abbar 2018 (RA-K-MEANS)
direction and extends the road until the direction changes.
The RA-K-MEANS [16] is the most recent k-means solution
For line-based clustering, the density-based image obtained
with a bunch of optimisations. It regards map inference
from KDE methods can be further thinned to extract road
as a multiple network alignment problem (MNAP), which
centrelines using the techniques like spline fitting [21] and
aims to infer the underlying graph given a set of observable
skeletonisation [22]. After the map skeleton is obtained, a
sub-graphs (GPS trajectories). For the k-means clustering,
scan of the image is needed to identify intersections based
the pair-wise point distance considers both geographical
on the value of nearby pixels and edges to be obtained
distance and heading difference, and each initial seed is
subsequently.
selected along the trajectories with no less than a certain
Overall, the road abstraction algorithms regard trajectory
distance away from existing seeds. Moreover, to further
as a set of points or segments. One advantage is its ability
reduce the inference of spurious roads, a graph spanner is
for parallel processing [9] as most of the operations are
applied during the road generation to remove roads that
localised. However, one major problem is the potential con-
are potentially generated by noisy or low-sampling-rate
nectivity issue. As exemplified in Fig. 1, two parallel roads
trajectories. Besides, it also provides an online model for
can be incorrectly connected if the trajectories are broken
stream-based map inference and map update.
down into points/segments. Although some effort has been
devoted to rectifying such errors using the knowledge from
typical intersection design [39] or additional map-matching 3.1.2 Biagioni and Eriksson 2012 (RA-KDE)
in post-processing [22], it is still challenging for such meth- The RA-KDE [22] is a typical KDE-based method with a
ods to infer non-planar features in the map. focus to tackle trajectory disparity problem. Instead of set-
Three representative methods are selected under this ting a global threshold when binarizing the grid density, it
6
proposes a grey-scale multilayer solution so that grids with incremental solutions, since all the trajectories passing the
different densities fall into different density layers. By doing same node are processed once, it is much easier to infer the
so, roads with fewer visits can be preserved in low-density outgoing directions correctly. Therefore, it performs better
layers. After smoothing the map by kernel density estimator, when inferring complex map elements, such as complex
the algorithm generates centrelines from each layer using intersections, non-planar tunnels and interchanges.
skeletonisation method, and merge them to form a final Overall, different from road abstraction which treats
map. The algorithm also further post-processes the map to each trajectory point/segment separately, the incremental
merge unnecessary intersections, remove unmatched roads, branching methods fully utilise the continuity of an entire
simplify road shapes and assign road directions. Currently, trajectory so that neighbouring roads that lead to different
the RA-KDE [22] is regarded as the representative method destinations can be better separated. However, it is hard
in this category. As shown in Table 1, it has been compared for such incremental methods to run in parallel. In addition,
the most times with various methods in different categories the scalability of these iterative methods, especially the trace
and performs well in both effectiveness and efficiency. merging, can be really poor as the size of the input dataset
increases. In our experimental study, we choose the follow-
3.1.3 Zheng and Liu 2017 (RA-TOPIC) ing two methods as the representative of this category.
Although being rasterised into grids, the RA-TOPIC [15]
does not leverage the KDE method for smoothing and road 3.2.1 Ahmed and Wenk 2012 (IB-TM)
generation. Instead, it converts map inference into a topic The IB-TM [33] is the only trace merging solution whose per-
extraction problem in Natural Language Processing (NLP), formance has been quantitatively evaluated [1]. The algo-
where a document (trajectory) is built up by a set of words rithm utilises the Fréchet distance to merge a new trajectory
(grid cell) and each word is related to some topics (road with existing roads. It can efficiently extract matched tra-
segment). It then utilises LDA and pLDA to extract topics jectory parts via a maximum distance threshold , and gen-
(road segments) from documents (trajectories). As a new erate new representative roads using minimum-link paths
method inspired by a different research area, we also include algorithm. Unmatched portions are inserted into the map
this algorithm in our experimental study. as new roads and connected to the matched road to form a
new branch and intersection. This method has a theoretical
3.2 Incremental Branching quality guarantee that the inferred roads are well-separated.
The idea of incremental branching is to incrementally insert
new roads to an empty map until all trajectories are exam- 3.2.2 He and Bastani 2018 (IB-ME)
ined. Overall, a map can be constructed in two incremental The IB-ME [17] is the only candidate for map expanding
ways: solutions, and it mainly focuses on high inference precision
Trace merging: The trace merging method incrementally and non-planar map features. The algorithm first infers
inserts trajectories into the map. For a new trajectory, the partial map correctly and then merges the constructed map
algorithm first tries to merge it to existing roads by map- portion to another map inferred by other high recall meth-
matching or distance measure calculation, such as simple ods [6], [16]. As introduced above, the method starts from
pairwise distance [32], [44] or Fréchet distance [27], [33], an initial map node and grows until covering the entire map
[35]. Successfully matched trajectory segments are utilised region. It can also infer complex regions as it fully utilises
to adjust road shape by introducing physical attractions the long-term travel information from the trajectories.
[32]. Meanwhile, the unmatched portion either forms a new
road from the existing map (partially matched) or creates
3.3 Intersection Linking
a completely new road (completely isolated). As one of the
early works in this field, Cao et al. [32] merges new trajec- The intersection linking approaches emphasise the correct
tories by only considering their segment-based distance and detection of intersections. Once the intersections are in-
direction difference. Once the merge process is completed, ferred, the remaining step is to link intersections using the
the algorithm defines the physical attraction force between information from trajectories. Different from the points on
pair-wise points so that the matched roads run towards the roads, trajectories passing intersections usually have differ-
newly merged trajectory whereas the unmatched roads run ent heading or speed, and more importantly, their heading
against it. Considering that roads have different lengths in or speed may change significantly during a short period.
the real world, Buchin et al. [27] proposes a method that Therefore, the intersections can be identified based on either
predefines multiple road lengths and generate roads with trajectory movement characteristics (speed, direction) [18],
different scales accordingly. This helps to infer small roads [34], [41] or point density [46]. The main steps include:
that are dominated by nearby main roads. (1) Scan the input trajectory points and extract the point
Map expanding: The map expanding is a new type of sequences around intersections, which satisfy the heading
methods initiated by He et al. [17]. Instead of iterating on and speed requirements (for example, heading changes
the trajectories, it starts from a map node. The major process more than 15◦ and average speed under 40km/h [34]).
of map expanding is to incrementally infer new roads from (2) Cluster the intersection points based on their proximity
the starting node based on the trajectories passing through using k-means [10]/random sampling [41] or based on turn
it. After the current node is visited, the map has branched similarity [34], [45] to extract the intersection region.
out and generated multiple new starting nodes, which are (3) Link intersections connected directly by trajectories. In
the starting points of new iterations. Different from other addition, since each intersection covers a region instead
7
of a point, the actual type of the intersection (crossing, T-

junction, roundabout, etc.) is also considered when connect-
ing the edges inside the intersection region.
As the intersections are usually very complex in real
life, a correct inference of intersection, especially its turning
pattern, can significantly improve the usefulness of the con-
structed map. Moreover, this can be conducted concurrently,
and the machine learning technique, which has been used
in road abstraction methods [30], [39], is also applicable. We
consider the following algorithm in our experiments.
Fig. 2. Visually compare two constructed maps, no clear winner
3.3.1 Karagiorgou and Pfoser 2017 (IL-TURN)
The IL-TURN [34] is a typical intersection linking approach.
Cheong et al. [48] proved that the problem is still NP-hard
Intuitively, a vehicle is making a turn at an intersection
even with the geometric information embedded. Therefore,
when its trajectory sees a significant direction change at a
all the existing map quality measures evaluate the map
low speed. Hence, this method first calculates the direction
similarity heuristically based on various intuitions, such as
change for each trajectory point and finds all points likely
the vertex/edge closeness [6] and the navigation correctness
to be near an intersection (significant direction change and
[23], [34]. However, since each measure only focuses on
low speed). The turning points are then clustered accord-
one or few map features, there is a lack of discussion that
ing to their proximity and a given maximum intersection
treats the map quality as a compounded problem which is
radius. One intersection is generated from one cluster, and
determined by multiple factors. In this section, we will first
the intersections are finally linked based on the trajectories
elaborate on the current map quality issues, introduce the
connecting them. To avoid large intersections, the algorithm
state-of-the-art measures, and then propose our design to
is further improved [18] by sorting turning points according
experimentally evaluate the existing measures.
to proximity and adaptively deciding the intersection size.
Moreover, in the preprocessing step, it categorises the input
trajectories according to their speed to infer three maps for 4.1 Map Quality Issues
different speed level, ranging from highways to low-level Note that in this section, the map quality issues we refer
lanes, which are conflated to form a final map. to are not the general issues in real-world map, instead, we
focus on the map errors that (1) can possibly be produced
by inference algorithms and (2) may cause quality issues
4 Q UANTITATIVE M EASURE E VALUATION when the map is used in applications such as navigation
As the performance of map inference algorithms is eval- and location-based services. Hence, we categorise map quality
uated by the quality of its constructed map, measuring issues as follows:
the map quality is crucial especially when comparing the Topological Error: Topological correctness is crucial for
performance of multiple inference methods. Note that most navigation systems. However, it happens quite often that
map inference datasets come with a ground-truth map (usu- two roads connected in the ground-truth map happen to
ally obtained from public map providers like Google Map be disconnected in the constructed map, or vice versa, as
and OpenStreetMap) for validation, so the core of the evalu- exemplified in Fig. 1. Moreover, most inference methods
ation is to quantify the similarity between the constructed [21], [22]/map quality measures [23] are only able to gen-
map and the ground-truth. Previously, most of the map erate/evaluate planar map, which does not match the real-
inference algorithms demonstrate their performance by vi- world scenario where transverse roads do not necessarily
sually overlaying their maps with the ground-truth. Despite generate intersections, such as underpass tunnels and inter-
its simplicity, intuition and usefulness in troubleshooting, changes.
visual comparison cannot quantify the map difference, as Geographical Error: Since the input trajectories suffer from
illustrated in Figure 2, making it hard to compare the GPS errors, it is common that the inferred road nodes/edges
performance of different algorithms. Moreover, many road deviate from their actual locations. These errors can affect
features, such as road direction, turning restriction and road the correctness of both navigation applications and location-
connectivity, are not displayable. Therefore, there is a strong based queries. Besides, the severity of a geographical error
need for quantitative measures in the map inference field. is not only determined by its absolute deviation, but also
However, the design of the quantitative measure is chal- relative distance which considers the map density of the
lenging. Intuitively, since the ground-truth map is usually surrounding area.
obtainable, the quality of a constructed map should be mea- Road Loss and Spurious Road: Due to the trajectory dis-
sured based on its differences to the ground-truth. It is obvi- parity [22], many of the roads in ground-truth map are not
ous that a map with better quality should be more similar to travelled by any vehicle. Hence, there is no chance for them
the ground-truth. However, the definition of similarity is am- to be constructed. Also, since the input trajectories contain
biguous. Note that a map is usually represented as a graph, outliers, many inference algorithms remove the roads con-
this problem is related to the graph matching problem [47] structed from fewer trajectories to avoid generating spuri-
in graph theory, which is associated with some difficult ous roads. Therefore, the constructed map usually contains
problems, like subgraph isomorphism (NP-complete), graph only a subset of roads in the ground-truth map, while
edit distance (NP-hard) and network alignment. Moreover, the road coverage is an important indicator of algorithm
8
effectiveness. Meanwhile, even with the noise reduction, it consists of four steps: (1) Randomly select a set of points on
is still common to have spurious roads constructed in the the constructed map as seeds and find the corresponding
map especially around the intersections. Those roads are seed points on the ground-truth map, each of which is
usually caused by incorrect linking between intersections or the closest point on road to a seed location. (2) Traverse
inexistent shortcuts. Both road loss and spurious road in- the constructed map starting from each seed point and
sertion can affect the navigation and location-based services generate a new point, named as marble, each time a certain
significantly. distance is passed. Stop the traversal until a given maxi-
Other Errors: These errors may not affect the quality of map mum distance is reached. (3) Do the same process for seed
navigation and location-based services considerably, but it points in the ground-truth map, and the generated points
can still cause confusion and misunderstanding to the map are named as holes. (4) Match the marbles with holes to
users. For example, many inference algorithms generate generate the matching ratio. The graph sampling method
roads and intersections with odd shapes, like the zigzag is able to detect topological errors, especially its variation
road pattern occurred frequently in KDE-based method [21] (TOPO method [16]), as the disconnect edges are not tra-
or wrong intersection layout. Besides, it is challenging to versed when generating the marbles/holes. Meanwhile, it
infer/measure parallel roads as their distance is usually is also good at evaluating geographical precision through
similar to the GPS error radius. Overall, these errors should the matching ratio. However, as pointed by Hashemi [8],
also be measured. the performance relies on the number of samples and the
traverse distance. It may overestimate the connectivity and
4.2 Quantitative Measures incur a huge computational cost if the extracted subgraphs
overlap a lot.
Several quantitative measures have been proposed recently
for map comparison, which falls into three categories ac- 4.2.3 Path-based Distance (PD)
cording to their methodologies: graph item matching, graph
sampling and path-based distance. This category [17], [23], [34] assesses the map correctness by
its performance of navigation. In the path-based evaluation,
4.2.1 Graph Item Matching (GM) the algorithm first picks up a set of source and destination
location pairs, which are randomly selected on map [34]
In this category, the constructed map G is regarded as a set
or from existing trajectories [17], and projects them on the
of nodes V and a set of edges E , and the ground-truth map
closest road on both constructed and ground-truth maps.
is G∗ = (V ∗ , E ∗ ). Hence, the quantitative measure can be
For each pair, the shortest paths on both maps are then
converted to a set similarity problem and answered using
obtained and their pair-wise distance is calculated using
the well-known measures of precision/recall/F-score as long
Discrete Fréchet distance [17], [18], [34] or average vertical
as the match of node (match(v, v ∗ )) and edge (match(e, e∗ ))
distance [18], which measures the largest distance between
is properly defined. In fact, match(v, v ∗ ) and match(e, e∗ )
two polylines. Overall, the map which has less gross pair-
are defined differently in various papers. We present a basic
wise distance is deemed to be more similar to the ground-
definition [18] as follows: For match(v, v ∗ ), two points are
truth. This type of methods mainly targets at the topology
matched only when their distance is less than a threshold
correctness of the map. The map with high correctness
. Meanwhile, the match(e, e∗ ) defines that two edges are
in path-based evaluation can better serve the navigation
matched only when both endpoints are matched, respec-
purpose. However, since the pair-wise distance is usually
tively. Based on these definitions, the precision/recall/F-score
dominated by the most significant difference along the path,
can be evaluated separately for nodes and edges [18] or
it has weak power to detect the geographical error, spurious
combined.
roads and other errors. Moreover, since it requires a large
Despite its simplicity, the GM also faces several issues:
number of sampled pairs to evaluate the road coverage,
(1) Nodes are not one-to-one matched, which means multi-
it can be very expensive computationally considering the
ple nodes can be mapped to the same ground-truth node.
complexity of the shortest path calculation.
It can potentially lead to some undetectable connectivity
issues. (2) The value of threshold used for defining node
matching may significantly affect the evaluation results. 4.3 Measure Evaluation Design
(3) Since the edges are defined by polylines, it is possible Overall, we think the current study on the quantitative
that two edges with extremely different shapes may share measure is still insufficient: (1) Given that the existing
the same endpoints, which is undetectable in this metric. measures are designed for different purposes, no research
Although some distance metrics, like Fréchet distance [33] has been done studying what types of map quality issues
and Hausdorff distance [43], were introduced to further each measure can actually identify. The measures are always
restrict the edge matching to solve the issue (3), it is still hard used by previous works blindly. (2) Some of the map errors,
to measure both the topological and geographical features like the road shape error and wrong intersection layout,
of the map simultaneously. are not mentioned by previous works. Hence, they may not
be detected by any of the existing measures. (3) Evaluation
4.2.2 Graph Sampling (GS) results from some measures, like the path-based evaluation,
This method was first proposed in [6] and adopted in [1], are not quite informative as the significance of the distance
[7], [15]–[17], [40]. The main idea is to randomly extract a value does not justify the quality of the map without proper
set of subgraphs from both the constructed and ground- reasoning. In this subsection, we will introduce our design
truth maps and compare their similarity. The whole process of experimental evaluation on existing measures to test
9
their ability in identifying different types of map errors spurious road inserted. Regarding the road loss (RL) error,
while finding the potential in developing new quantitative we remove a pct% of the roads according to their total length
measures. Our design mainly relies on a synthetic data tool: over the total map length.
the artificial map generator.
The idea of the map generator is to generate a set of 4.3.4 Other Errors
maps, each of which contains only one specific type of We simulate the zigzag road shape error (RSE) by interpo-
errors mentioned above that may occur in a constructed lating mini nodes to the roads. For each road in ground-
map. Note that the map constructed by map inference truth map, we interpolate a new point p0 for every θ meters
algorithms usually contains multiple types of map errors, along the road. In fact, the actual interpolated mini point
these synthetic maps can help evaluate how the quantitative p is not at p0 . Instead, it is chosen randomly with two
measures react to a certain type of errors. Given a ground- constraints: (1) pp0 is perpendicular to the road direction
truth map G∗ = (V ∗ , E ∗ ) and a set of trajectories running at p0 ; (2) pp0 .length = θ where θ is a distance threshold that
on it, the map generator produces an artificial map for each is very small to avoid introducing too many geographical
error by modifying the ground-truth map respectively as errors. Therefore, the severity of the road shape problem is
follows: determined by the distance θ, since more interpolated mini
points lead to more serrated road shape. For the intersection
4.3.1 Topological Error (TE)
layout error (ILE), we create the spurious intersections by
The topological error implies the case that an edge exists splitting the existing intersections. Specifically, we select
at the right location without connecting to the right node. a pct% of vertices Vil from V ∗ whose degree is no less
To simulate such error, we randomly select a pct% of roads than four. For each vi ∈ Vil , we split it into two points
Ete = e1 , e2 , ..., en from E ∗ to be the candidates. For each vi1 and vi2 which are symmetric, centred at vi , and satisfy
candidate ei ∈ Ete , we disconnect one of its endpoints, that dist(vi1 , vi ) = dist(vi2 , vi ) ≤ θ. Lastly, the previously
va for example, and find a new endpoint va0 on ei whose connected edges are redirected to the vertex that is closer
distance to va is: and vi1 and vi2 are connected to each other eventually.
dist(va0 , va ) = min(, µei .length) (1)

where refers to the GPS measurement error threshold,
which is used in various map inference and quantitative
measures. µ ∈ (0, 1) is a weight that ensures the distance
not to be too long compared with the original road length.
Then, we set va0 as the new endpoint, add va0 to V ∗ , remove
the adjacency information in va and eventually remove va
from V ∗ if it is no longer an intersection. By doing so, the (a) Original intersection (b) Topological error (pct = 30%)
map is barely changed geographically, while the roads are
disconnected topologically.
4.3.2 Geographical Error (GE)

We generate such error by introducing randomness to the
intersections. Here, we first set an error radius r and select a
pct% percent of vertices from V ∗ as Vge . For each vertex vi ∈
Vge , we randomly choose a location within error radius as (c) Geographical error (pct = 30%) (d) Spurious road (pct = 30%)
its new position. Then for each edge in E ∗ whose endpoint
has been drifted, we offset its mini nodes proportionate to
the drifted amount to maximally preserve the road shape.
Without changing the road connectivity, it is clear that the
modified map Gge is topologically isomorphic to G∗ .
4.3.3 Road Loss (RL) and Spurious Road (SR)

The introduction of the spurious road (SR) requires the use (e) Intersection error (pct = 30%) (f) Zigzag road shape (θ = 2m)
of trajectories. Following the idea that most of the spurious Fig. 3. Various map errors generated by synthetic map generator
roads are inferred from some noisy trajectories, we aim to
find the noisy trajectories by performing the following two In Fig. 3, we visualise different types of map errors
steps: (1) We perform a map-matching algorithm [20] on the generated by our artificial map generator. Besides, all the
trajectory dataset and select the trajectories whose match- pct% used in aforementioned road-related cases (TE, RL)
ing result contains breakpoints. (2) For each consecutive requires a random selection, which can be achieved in two
breakpoint sequence, we extract the matching roads of its modes: complete random (CR) and weighted random (WR).
preceding and succeeding points and create a spurious road Different from the complete randomness, the weighted ran-
to connect them directly. We then randomly select a set of dom assigns a weight to each edge/vertex so that higher
spurious roads and insert them to the ground-truth map. weight items are more likely to be selected. Here, the weight
The rate of outliers is controlled by the total length of the for each edge/vertex is calculated by the following steps:
10
(1) We perform map-matching on the trajectory dataset 5.2 Evaluation of Measures

to the ground-truth map, and each trajectory is converted
to a sequence of road edges. (2) For each road edge, we Before comparing the map inference algorithms, we first
count the total number of appearance in the map-matching evaluate the existing quantitative measures in identifying
results as its weight. (3) The weight of each vertex is the map errors discussed in Section 4. Our candidate measures
sum of all the weights of its adjacent edges. Therefore, the includes graph item matching [34] (GM), graph sampling
weighted random considers the popularity of road. Such [6] (GS) and path-based distance [23] (PD). We test them on
randomness can help test if the measures are sensitive to six types of generated maps (GE, TE, RL, SR, RSE, ILE) with
road importance. two of them using both complete-random and weighted-
random sampling strategies (TE-CR, TE-WR, RL-CR, RL-
WR). Tables 3 and 4 show the evaluation results under
5 E XPERIMENTS different error types. Due to the space limit, we only list
Our experimental study consists of two main components: the F-score for vertex graph item matching (GMV), edge
(1) Evaluate the ability of the current quantitative measures graph item matching (GME) and graph sampling (GS), and
in identifying various types of map errors; (2) Conduct we calculate the average Fréchet distance for path-based
comprehensive experiments on the representative map in- distance (PD). In general, it is clear that each measure has its
ference algorithms and compare their performance under weakness in identifying certain types of errors. For instance,
both synthetic and real GPS data. it is expected that the vertex-based graph matching is not
able to identify the topological error, road loss, spurious
5.1 Experimental Settings roads and road shape errors as the intersections remain
5.1.1 Datasets unchanged in those maps. However, it is also less sensitive
There are a few public datasets available, like the Chicago [6], to the cases where the intersections shift slightly (GE), or are
[7], [11], [14]–[16], [18] and Berlin [11], [37] which are widely duplicated (ILE) since most of those noise does not escape
adopted in existing map inference algorithms. However, the radius of vertex match and duplicate intersections can
they both suffer from severe trajectory disparity issue, as be matched to the same ground-truth without penalty. On
shown in Table 2. They are viable for visualisation but are the other hand, the edge-based graph matching and graph
not suitable in our quantitative experiments. Therefore, the sampling are capable of detecting the topological changes.
experiments are conducted on the Beijing dataset, which is In particular, both measures decrease considerably when
a commercial dataset collected by our industry collaborator. more topological errors (TE) or road removal (RL) takes
It contains trajectories by 5,000 taxis in Beijing for 5 days. place. Interestingly, the decline becomes less significant
From the original map Beijing-L, we further extract two sub- when the errors are introduced based on road popularity.
areas, namely Beijing-S and Beijing-M, which represent two The reason is that in the weighted random, we tend to
urban areas with different scales. Table 2 compares these remove less number of roads but with higher importance.
datasets and shows their specifications. In addition to basic However, both GME and GS scores only reflect the reduced
information, we also provide three additional statistics for number of removals, in other words, these measures do not
each dataset: (1) map density: the average length of roads take into account the importance of different roads.
(km) in a unit of map area (km2 ), (2) trajectory density: the Overall, the path-based measure is underwhelming in
average number of trajectory points in a unit of map area most scenarios, except SR and GE, due to multiple reasons:
(km2 ), (3) trajectory disparity: the percentage of roads that (1) Different from a percentage, an absolute value of dis-
are never or rarely (≤ 5 times) visited by trajectories. Over- tance sometimes cannot measure the difference between two
all, compared with Chicago and Berlin, the Beijing datasets maps, especially when comparing the results between maps
contain multiple advantages: (1) The variety of map scale with different scale and road density. (2) Since the measure
and map/trajectory density enables the test of algorithm’s tries to find the matching ground-truth to each path in a
efficiency and scalability. (2) Most of the roads (> 63%) in constructed map, it can hardly detect the road coverage and
the urban area (Beijing-S and Beijing-M) are travelled by connectivity problems, like RL and TE. Instead, it can better
at least one trajectory, which is much higher than Chicago detect spurious roads (SR) rather than other measures as
(3.2%) and Berlin (21.4%). It means that the generated map those roads are usually hard to be matched to ground-truth.
will have significant less road loss problem. (3) It provides (3) As it compares each generated link to all possible links on
heading and speed information for each trajectory point, the map to find the one with minimum Fréchet/Hausdorff
which is required by some of our candidate algorithms. distance, the complexity is at the level of O(V 3 ) [23]. Due
to the high complexity of Fréchet distance calculation and
5.1.2 Environment candidate generation, the measure runs extremely slow even
We perform our experiment on a single server, which con- with medium map size.
sists of two Intel(R) Xeon(R) CPU E5-2630 with 10 cores/20 In general, the graph sampling and edge-based graph
threads at 2.2GHz each, 378GB memory and Ubuntu 16.04. item matching have better overall performance on most
The server has large enough memory to ensure the al- error types. Compared with graph item matching, the graph
gorithms can be fully processed in memory. Algorithms sampling is slightly more sensitive to the noise and is very
run on three languages, namely Python-2.7 (RA-K-MEANS, sensitive to low road coverage. However, none of these
RA-KDE, RA-TOPIC and IB-ME), Java-1.11 (IB-TM and IL- measures takes into consideration road importance. As an
TURN) and Go-1.12.5 (IB-ME)) due to their reliant on differ- incorrect inference of an arterial road can lead to more
ent libraries. serious problems than a missing rural road, the lack of
11
TABLE 2
Comparison of public and private datasets
Name Input Trajectory Road Network Statistics

Trajectory Trajectory Sampling # of vertices # of Area Map Density Trajectory Visit Count Visit Count
Count Point Count rate(sec) + mini nodes edges Size(km2 ) (km/km2 ) Density(pts/km2 ) x = 0(%) 0 < x ≤ 5(%)
Chicago 889 118,360 3.62 9,391 23,602 31.48 38.47 3759.34 94.63% 2.18%
Berlin 27,185 192,194 41.65 5,894 13,678 31.16 22.96 6,168.74 39.70% 38.90%
Beijing-S 55,215 1,329,606 9.67 7,672 4,484 9.92 27.26 134,058.86 9.34% 27.55%
Beijing-M 268,492 8,837,029 11.20 41,353 22,580 56.99 24.18 155,075.62 8.11% 21.50%
Beijing-L 1,994,770 126,929,547 7.66 2,459,768 602,455 33404.56 2.58 3799.77 39.37% 23.69%
sensitivity to road importance can be a critical problem to the map quality gradually decreases when the sampling rate
map quality evaluation. Moreover, they find a hard time drops. This happens more seriously to line-based methods,
identifying intersection layout errors (ILE) and are unable to namely RA-KDE and RA-TOPIC, as the lines can no longer
detect road shape changes. Since both errors happen quite represent the road shape when the sampling rate is very low.
often in various inference algorithms [6], [21], it is clear In terms of the trajectory disparity, we test the algorithms by
that the current measurement system still requires more providing various percentages of the road coverage which
attention and further development. contains neither measurement nor sampling error, shown in
Fig. 4b. Note that although the road coverage reaches up to
60% in our experiment, most of the roads are travelled by
5.3 Evaluation of Inference Algorithms
only several times since the input trajectory set is chosen by
We evaluate our candidate inference algorithms on Beijing maximising the road coverage while minimising the input
dataset using the above quantitative measures. Note that the size. As shown in the figure, although we can clearly see
parameters used in the algorithms are mostly set to default the improvement of map recall, only less than half of the
value unless it is related to the expected GPS measurement roads can be inferred, which means that finding the rarely
error range. As all the algorithms assumes the input is travelled roads is still a challenging task. Many of them are
always inaccurate, they always set a threshold for tolerable either removed as noise or merged with neighbouring roads.
error radius which is usually between 0 50 meter. In this Among all the candidates, the trace merging method has
case, we set it as 20 meter according to our dataset. We con- the best detection rate. Since it processes every trajectory
duct the experiments from three perspectives: robustness, individually, it is more likely to find new road segments as
scalability and overall performance comparison. long as the trajectory is partially different from the existing
roads.
5.3.1 Robustness
The goal of robustness test is to reveal which algorithm is 5.3.2 Scalability
good at handling certain data errors and how they react The main objective of the scalability test is to answer
to those errors in general. Therefore, we generate synthetic three questions: (1) How the quality of the constructed
trajectory datasets through our generator on Beijing-S map map improves as the size of the input trajectory increases.
which only contains measurement error, sampling error (2) How the efficiency of algorithms scales with the size
and trajectory disparity issue, respectively. Note that we do of trajectory dataset. (3) How is the map size affects the
not evaluate RA-K-MEANS and IL-TURN as they requires efficiency of algorithms. Fig. 4c, 4d and 4e answer these
speed and heading as input, which is not available in questions, respectively. In Fig. 4c, we use graph sampling
synthetic dataset. Fig. 4a shows the inference quality of F-score to evaluate the map quality. Interestingly, not all
the candidate algorithms under different GPS measurement algorithms’ performance benefits from the increase of the
accuracy. It is shown that the increase of measurement error trajectory dataset, in particular, the KDE, k-means and inter-
does not affect the overall performance significantly until section linking methods. It is because although the increase
the error exceeds the preset error radius (20m). The reason is of trajectory size can slightly improve the road coverage,
that when a GPS sample exceeds the tolerable error range, it however, due to the poor denoise mechanism, more spu-
is likely to be treated as a new road element, so its precision rious roads are also introduced, which affect the precision
drops quickly as the graph sampling measure is sensitive significantly. For example, in KDE method, the increase of
to spurious roads. However, in another experiment we’ve input size can lead to more ambiguous road intersections,
done, we set the error radius to a higher value (50m) and which can potentially create more incorrect branches af-
found that the overall F-score of most algorithms dropped ter the skeletonisation process. Therefore, increasing input
by a visible amount due to a significant decrease in recall. dataset is not always a good idea unless the quality of input
The combined results show that a proper preset error radius is ensured by data preprocessing or a more robust algorithm
that matches the input is crucial. A strict preset value on a is chosen. The quality of intersection linking suffers from
low-quality input can cause low inference accuracy, while a low input size, especially when the trajectory sampling rate
loose preset range guarantees an incomplete map, even with is low where the intersections are hard to be detected.
high-quality input. Nowadays, as the accuracy of GPS po- Besides, although there are two algorithms that have very
sitioning devices keeps improving, the performance of map poor F-score, namely RA-TOPIC and IB-ME, their actual
inference algorithm can also benefit from it in the future. performance is not as bad as indicated. The main reason
On the other hand, we also varies the sampling rate while for their poor performance is low recall. Plus, the result
removing the measurement errors entirely. It is clear that from graph sampling measures usually further decrease the
12
TABLE 3
Measure results under different topological error (TE) and road loss (RL) ratio
Pct TE-CR TE-WR RL-CR RL-WR

(%) GMV (%) GME (%) GS(%) PD (m) GMV GME GS PD GMV GME GS PD GMV GME GS PD
10 99.35 96.94 91.54 1.07 99.49 97.29 94.88 1.03 98.93 95.76 93.13 1.02 99.54 96.72 96.69 1
20 98.77 94.16 83.31 1.16 99.27 95.20 90.86 1.05 97.97 91.01 86.71 1.01 99.40 93.78 93.78 1
30 98.39 90.89 73.58 1.39 99.22 92.47 85.69 1.09 97.27 86.19 81.88 1.01 99.06 88.56 91.23 1
40 97.91 88.40 65.41 1.60 99.04 89.74 80.20 1.17 96.61 81.25 77.94 1.01 98.64 82.38 88.14 1
50 97.71 85.98 60.92 1.81 98.85 87.03 74.66 1.32 95.92 74.55 76.32 1.01 98.21 74.77 85.27 1
60 97.39 82.66 57.11 2.00 98.64 84.14 68.34 1.54 95.78 68.37 75.06 1.01 97.96 65.35 81.61 1
TABLE 4
Measure results under other error types
Pct SR ILE σ GE θ RSE

(%) GMV GME GS PD GMV GME GS PD (m) GMV GME GS PD (m) GMV GME GS PD
5 99.09 97.43 98.42 1.41 1.0 99.77 97.84 1.45 5 1.0 1.0 95.01 6.80 2 1.0 1.0 86.95 13.91
10 98.16 94.95 97.02 2.32 1.0 99.46 95.11 1.88 10 1.0 1.0 91.84 8.12 4 1.0 1.0 87.74 13.37
15 97.75 93.20 95.29 3.33 1.0 99.20 92.27 2.31 15 1.0 1.0 90.74 9.37 6 1.0 1.0 87.75 13.03
20 97.51 91.81 93.80 3.82 1.0 99.14 90.38 2.63 20 1.0 1.0 89.08 10.74 8 1.0 1.0 88.20 12.92
25 97.24 90.49 92.04 4.61 1.0 98.94 87.87 2.88 30 1.0 1.0 85.71 13.48 10 1.0 1.0 88.47 12.93
30 97.00 89.57 90.38 5.50 1.0 98.80 84.53 3.11 40 1.0 1.0 83.09 16.33 12 1.0 1.0 88.70 13.06
(a) Synthetic measurement error (b) Synthetic road coverage (c) Map quality when varying input size
(d) Running time when varying input size (e) Running time on different data scales (f) Map quality comparison (Beijing-S)
Fig. 4. Summary of experimental results of map inference algorithms
score for lower recall solution since most of the seed points with the same or larger trajectory size. Most of the solutions
from ground-truth cannot find their correspondents on the struggle to finish the task within a reasonable time (< 5
inferred map. In fact, these two methods can achieve the hours) on even Beijing-M dataset but we can see the K-
best accuracy among all solutions, making them the perfect MEANS methods finish the task on Beijing-L efficiently, not
choices for larger input dataset thanks to strict noise control. to mention its ability for parallel processing which further
In terms of efficiency, Fig. 4d shows the scalability of those accelerates the process. From the results in Fig. 4e, we can
methods with fixed map size and increasing trajectory input. clearly see that even with the same input trajectory size,
To our surprise, the trace merging algorithm has the second- the map size can significantly affect the performance of all
best scalability among all candidates, which shows that the algorithms.
cost of the Fréchet distance calculation between every new
trajectory and the map has been fully optimised. On the 5.3.3 Overall performance
other hand, the topic model method has the worst scalability We evaluate the overall performance on 10,000 trajectories
since the topic extraction algorithms (LDA and pLDA) are in Beijing-S map, shown in Fig. 4f. In general, the classi-
all computational-extensive operations which heavily rely cal methods, like KDE and K-means methods, has been
on iterations. In contrary, the k-means-based method is very largely outperformed by recent solutions. However, there
efficient and also scales perfectly. The same applies to Fig. is nothing wrong with the clustering methods. Although
4e where the map size increases to Beijing-M and Beijing-L using the same rasterization technique, the topic model-
13
based methods achieve much better performance than the [4] F. Bastani, S. He, S. Abbar, M. Alizadeh, H. Balakrishnan,
KDE method in terms of both accuracy and road coverage, S. Chawla, S. Madden, and D. DeWitt, “Roadtracer: Automatic
extraction of road networks from aerial images,” in Proceedings
which indicates that the current way of road abstraction of the IEEE Conference on Computer Vision and Pattern Recognition,
from the cluster is still under development. Despite the 2018, pp. 4720–4728.
outstanding efficiency, the accuracy of the K-means method [5] T. Sun, Z. Di, and Y. Wang, “Combining satellite imagery and
is unsatisfactory. Moreover, since its accuracy does not al- gps data for road extraction,” in Proceedings of the 2nd ACM
SIGSPATIAL International Workshop on AI for Geographic Knowledge
ways improve as the input data grows, its potential for large Discovery, 2018, pp. 29–32.
scale map construction cannot be fully utilised unless some [6] J. Biagioni and J. Eriksson, “Inferring road maps from global
pruning step is introduced. The map expanding method positioning system traces: Survey and comparative evaluation,”
has decent inference accuracy, however, its road coverage Transportation research record, vol. 2291, no. 1, pp. 61–71, 2012.
[7] M. Ahmed, S. Karagiorgou, D. Pfoser, and C. Wenk, “A compari-
is underwhelming. Considering its ability of road feature son and evaluation of map construction algorithms using vehicle
inference (non-planar map, more accurate road shape), it tracking data,” GeoInformatica, vol. 19, no. 3, pp. 601–632, 2015.
is an ideal method for map update. Overall, despite its [8] M. Hashemi, “A testbed for evaluating network construction
relatively low recall, the topic model-based method has algorithms from gps traces,” Computers, Environment and Urban
Systems, vol. 66, pp. 96–109, 2017.
the best accuracy and overall performance among all other
[9] W. Elleuch, A. Wali, and A. M. Alimi, “An investigation of parallel
methods. Since the idea and solutions of topic model come road map inference from big gps traces data,” Procedia Computer
from another research field, it shows the potential of map in- Science, vol. 53, pp. 131–140, 2015.
ference problem to be solved by the idea from other research [10] J. Wang, X. Rui, X. Song, X. Tan, C. Wang, and V. Raghavan, “A
novel approach for generating routable road maps from vehicle
areas or through new techniques like machine learning. gps traces,” International Journal of Geographical Information Science,
vol. 29, no. 1, pp. 69–91, 2015.
[11] S. Wang, Y. Wang, and Y. Li, “Efficient map reconstruction and
6 C ONCLUSIONS augmentation via topological methods,” in Proceedings of the 23rd
In this paper, we present a comprehensive survey and SIGSPATIAL International Conference on Advances in Geographic In-
formation Systems. ACM, 2015, p. 25.
experimental study of existing map inference algorithms.
[12] X. Xie, K. B.-Y. Wong, H. Aghajan, P. Veelaert, and W. Philips,
Specifically, we propose a new categorisation method, com- “Road network inference through multiple track alignment,”
pare the representative algorithms experimentally and eval- Transportation Research Part C: Emerging Technologies, vol. 72, pp.
uate the existing quantitative measures. Besides, to test the 93–108, 2016.
robustness of algorithms and quantitative measures, we [13] H. Li, L. Kulik, and K. Ramamohanarao, “Automatic generation
and validation of road maps from gps trajectory data sets,” in Pro-
introduce a synthetic trajectory generator and an artificial ceedings of the 25th ACM International on Conference on Information
map generator to simulate different trajectory errors and and Knowledge Management. ACM, 2016, pp. 1523–1532.
map quality issues, respectively. According to our experi- [14] J. Qiu and R. Wang, “Inferring road maps from sparsely sampled
ments, we observe that besides their respective weakness, gps traces,” Journal of Location Based Services, vol. 10, no. 2, pp.
111–124, 2016.
the existing quantitative measures are unable to identify [15] R. Zheng, Q. Liu, W. Rao, M. Yuan, J. Zeng, and Z. Jin, “Topic
several map issues and do not consider road importance. model-based road network inference from massive trajectories,”
Regarding the candidate algorithms, the existing algorithms in Mobile Data Management (MDM), 2017 18th IEEE International
Conference on. IEEE, 2017, pp. 246–255.
struggle to guarantee performance when the GPS errors
[16] R. Stanojevic, S. Abbar, S. Thirumuruganathan, S. Chawla, F. Filali,
exceed their expected threshold, and they still find a hard and A. Aleimat, “Robust road map inference through network
time identifying roads that are rarely travelled. Moreover, alignment of trajectories,” in Proceedings of the 2018 SIAM Interna-
more input trajectories do not always lead to better inference tional Conference on Data Mining. SIAM, 2018, pp. 135–143.
result. Overall, we identify the method that has the best [17] S. He, F. Bastani, S. Abbar, M. Alizadeh, H. Balakrishnan,
S. Chawla, and S. Madden, “Roadrunner: improving the precision
scalability (RA-K-MEANS), the best accuracy (RA-TOPIC), of road network inference from gps trajectories,” in Proceedings of
and the best suitability for map update (IB-ME), respec- the 26th ACM SIGSPATIAL International Conference on Advances in
tively, and meanwhile point out the potential future research Geographic Information Systems. ACM, 2018, pp. 3–12.
directions. [18] S. Karagiorgou, D. Pfoser, and D. Skoutas, “A layered approach
for more robust generation of road network maps from vehicle
tracking data,” ACM Transactions on Spatial Algorithms and Systems
(TSAS), vol. 3, no. 1, p. 3, 2017.
7 ACKNOWLEDGEMENTS [19] A. Leick, L. Rapoport, and D. Tatarnikov, GPS satellite surveying.
The work was supported by the CCF-Huawei John Wiley & Sons, 2015.
DBIR2019001A and the Guangdong Key Laboratory [20] P. Newson and J. Krumm, “Hidden markov map matching
through noise and sparseness,” in Proceedings of the 17th ACM
Project (2017B030314073). SIGSPATIAL international conference on advances in geographic infor-
mation systems. ACM, 2009, pp. 336–343.
[21] J. J. Davies, A. R. Beresford, and A. Hopper, “Scalable, distributed,
R EFERENCES real-time map generation,” IEEE Pervasive Computing, vol. 5, no. 4,
[1] M. Ahmed, S. Karagiorgou, D. Pfoser, and C. Wenk, “Map con- pp. 47–54, 2006.
struction algorithms,” in Map Construction Algorithms. Springer, [22] J. Biagioni and J. Eriksson, “Map inference in the face of noise
2015, pp. 1–14. and disparity,” in Proceedings of the 20th International Conference on
[2] Y. Wang, X. Liu, H. Wei, G. Forman, C. Chen, and Y. Zhu, Advances in Geographic Information Systems. ACM, 2012, pp. 79–88.
“Crowdatlas: Self-updating maps for cloud and personal use,” [23] M. Ahmed, B. T. Fasy, K. S. Hickmann, and C. Wenk, “A path-
in Proceeding of the 11th annual international conference on Mobile based distance for street map comparison,” ACM Transactions on
systems, applications, and services. ACM, 2013, pp. 27–40. Spatial Algorithms and Systems, vol. 1, no. 1, p. 3, 2015.
[3] G. Cheng, Y. Wang, S. Xu, H. Wang, S. Xiang, and C. Pan, [24] L. Zhou, C. Zhang, and M. Wu, “D-linknet: Linknet with pre-
“Automatic road detection and centerline extraction via cascaded trained encoder and dilated convolution for high resolution satel-
end-to-end convolutional neural network,” IEEE Transactions on lite imagery road extraction.” in CVPR Workshops, 2018, pp. 182–
Geoscience and Remote Sensing, vol. 55, no. 6, pp. 3322–3337, 2017. 186.
14
[25] G. Máttyus, W. Luo, and R. Urtasun, “Deeproadmapper: Extract- [45] A. Fathi and J. Krumm, “Detecting road intersections from gps
ing road topology from aerial images,” in Proceedings of the IEEE traces,” in International Conference on Geographic Information Science.
International Conference on Computer Vision, 2017, pp. 3438–3446. Springer, 2010, pp. 56–69.
[26] H. Wu, C. Tu, W. Sun, B. Zheng, H. Su, and W. Wang, “Glue: a [46] J. Wu, Y. Zhu, T. Ku, and L. Wang, “Detecting road intersections
parameter-tuning-free map updating system,” in Proceedings of the from coarse-gained gps traces based on clustering.” JCP, vol. 8,
24th ACM International on Conference on Information and Knowledge no. 11, pp. 2959–2965, 2013.
Management. ACM, 2015, pp. 683–692. [47] F. Emmert-Streib, M. Dehmer, and Y. Shi, “Fifty years of graph
[27] K. Buchin, M. Buchin, D. Duran, B. T. Fasy, R. Jacobs, V. Sacristan, matching, network alignment and network comparison,” Informa-
R. I. Silveira, F. Staals, and C. Wenk, “Clustering trajectories for tion Sciences, vol. 346, pp. 180–197, 2016.
map construction,” in Proceedings of the 25th ACM SIGSPATIAL In- [48] O. Cheong, J. Gudmundsson, H.-S. Kim, D. Schymura, and
ternational Conference on Advances in Geographic Information Systems. F. Stehn, “Measuring the similarity of geometric graphs,” in In-
ACM, 2017, p. 14. ternational Symposium on Experimental Algorithms. Springer, 2009,
[28] S. Edelkamp and S. Schrödl, “Route planning and map inference pp. 101–112.
with global positioning traces,” in Computer science in perspective.
Springer, 2003, pp. 128–151. Pingfu Chao received the BE degree in au-
[29] T. Guo, K. Iwamura, and M. Koga, “Towards high accuracy road tomation from Tianjin University, China, in 2012
maps generation from massive gps traces data,” in Geoscience and and the ME degree in software engineering from
Remote Sensing Symposium, 2007. IGARSS 2007. IEEE International. East China Normal University, Shanghai, China
IEEE, 2007, pp. 667–670. in 2015. Currently, he is working towards the
[30] G. Agamennoni, J. I. Nieto, and E. M. Nebot, “Robust inference of PhD degree in the Data Science group of ITEE
principal road paths for intelligent transportation systems,” IEEE in University of Queensland, Australia, His re-
Transactions on Intelligent Transportation Systems, vol. 12, no. 1, pp. search interests include spatial-temporal data
298–308, 2011. management and trajectory data mining.
[31] B. Niehoefer, R. Burda, C. Wietfeld, F. Bauer, and O. Lueert, “Gps
community map generation for enhanced routing methods based
on trace-collection by mobile phones,” in 2009 First International Wen Hua is a Lecturer at the School of ITEE,
Conference on Advances in Satellite and Space Communications. IEEE, the University of Queensland. She received her
2009, pp. 156–161. PhD and Bachelor degrees in computer science
[32] L. Cao and J. Krumm, “From gps traces to a routable road map,” from Renmin University of China in 2015 and
in Proceedings of the 17th ACM SIGSPATIAL international conference 2010, respectively. Her main research interests
on advances in geographic information systems. ACM, 2009, pp. 3–12. include information extraction, knowledge graph,
[33] M. Ahmed and C. Wenk, “Constructing street networks from gps and spatio-temporal data management. She has
trajectories,” in European Symposium on Algorithms. Springer, 2012, published actively in reputed journals and in-
pp. 60–71. ternational conferences, and won several Best
[34] S. Karagiorgou and D. Pfoser, “On vehicle tracking data-based Paper Awards.
road network generation,” in Proceedings of the 20th International
Conference on Advances in Geographic Information Systems. ACM,
2012, pp. 89–98. Rui Mao received BS(1997) and MS(2000) in
computer science from the University of Sci-
[35] K. Zheng and D. Zhu, “A novel clustering algorithm of extract-
ence and Technology of China, and MS(2006) in
ing road network from low-frequency floating car data,” Cluster
Statistics and PhD (2007) in Computer Science
Computing, pp. 1–10, 2018.
from the University of Texas at Austin. After three
[36] O. H. Dørum, “Deriving double-digitized road network geometry years work at the Oracle USA Corporation, he
from probe data,” in Proceedings of the 25th ACM SIGSPATIAL In- joined Shenzhen University in 2010. He is now
ternational Conference on Advances in Geographic Information Systems. Associate Dean of the College of Computer Sci-
ACM, 2017, p. 15. ence and Software Engineering, Shenzhen Uni-
[37] T. K. Dey, J. Wang, and Y. Wang, “Graph reconstruction by discrete versity, and Executive Director of the Shenzhen
morse theory,” arXiv preprint arXiv:1803.05093, 2018. Institute of Computing Sciences. His research
[38] U. Blanke, R. Guldener, S. Feese, and G. Tröster, “Crowdsourced mainly focuses on universal data management and analysis.
pedestrian map construction for short-term city-scale events,” in
Proceedings of the First International Conference on IoT in Urban Space. Jiajie Xu received the MS degree from the Uni-
ICST (Institute for Computer Sciences, Social-Informatics and . . . , versity of Queensland in 2006 and the PhD
2014, pp. 25–31. degree from the Swinburne University of Tech-
[39] C. Chen, C. Lu, Q. Huang, Q. Yang, D. Gunopulos, and L. Guibas, nology in 2011. He is currently an Associate
“City-scale map creation and updating using gps collections,” in Professor with the School of Computer Science
Proceedings of the 22nd ACM SIGKDD International Conference on and Technology, Soochow University, China.
Knowledge Discovery and Data Mining. ACM, 2016, pp. 1465–1474. His research interests include spatio-temporal
[40] X. Liu, J. Biagioni, J. Eriksson, Y. Wang, G. Forman, and Y. Zhu, database systems, big data analytics, mobile
“Mining large-scale, sparse gps traces for map inference: com- computing and recommendation systems.
parison of approaches,” in Proceedings of the 18th ACM SIGKDD
international conference on Knowledge discovery and data mining.
ACM, 2012, pp. 669–677. Xiaofang Zhou received the Bachelor and Mas-
[41] X. Xie, W. Philips, P. Veelaert, and H. Aghajan, “Road network ter degrees in computer science from Nanjing
inference from gps traces using dtw algorithm,” in Intelligent University in 1984 and 1987 respectively, and
Transportation Systems (ITSC), 2014 IEEE 17th International Confer- the PhD degree in computer science from the
ence on. IEEE, 2014, pp. 906–911. University of Queensland in 1994. He is a Pro-
[42] M. Ahmed, B. T. Fasy, M. Gibson, and C. Wenk, “Choosing fessor of the University of Queensland. He is
thresholds for density-based map construction algorithms,” in Pro- the head of the Data Science Research Division,
ceedings of the 23rd SIGSPATIAL International Conference on Advances School of Information Technology and Electrical
in Geographic Information Systems. ACM, 2015, p. 24. Engineering. He is also a specially appointed
[43] X. Liu, Y. Zhu, Y. Wang, G. Forman, L. M. Ni, Y. Fang, and M. Li, adjunct professor at Soochow University, China.
“Road recognition using coarse-grained vehicular traces,” Hp Labs, His research is focused on finding effective and
2012. efficient solutions to managing integrating, and analysing very large
[44] J. Li, Q. Qin, C. Xie, and Y. Zhao, “Integrated use of spatial and amounts of complex data for business and scientific applications. His
semantic relationships for extracting road networks from floating research interests include spatial and multimedia databases, high-
car data,” International Journal of Applied Earth Observation and performance query processing, web information systems, data mining,
Geoinformation, vol. 19, pp. 238–247, 2012. and data quality management. He is a Fellow of IEEE.

A Survey and Quantitative Study On Map Inference Algorithms From GPS Trajectories

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

A Survey and Quantitative Study On Map Inference Algorithms From GPS Trajectories

Uploaded by

Copyright:

Available Formats

This article has been accepted for publication in a future issue of this journal, but has not been