Professional Documents
Culture Documents
Science
Article views: 7
Zhe Zeng a, Tong Zhangb, Qingquan Lic, Zhongheng Wud, Haixiang Zoue
and Chunxian Gaof
a
China University of Petroleum, School of Geosciences, Qingdao, China; bLIESMARS, Wuhan University,
Wuhan, China; cShenzhen Key Laboratory of Spatial Smart Sensing and Services, Shenzhen University,
Shenzhen, China; dNavInfo Co., Ltd., Beijing, China; eShenzhen Urban Planning & Land Resource Research
Center, Shenzhen, China; fDepartment of Communication Engineering, Xiamen University, Xiamen, China
ABSTRACT
ARTICLE HISTORY
1. Introduction
Global Positioning System (GPS) probe vehicles have become an important means to
gather real-time trac information in the eld of intelligent transportation systems. With
the advent of big data, the demand of geographic information services for transportation has also increased dramatically. In addition to real-time trac information, human
mobility patterns (Giannotti et al. 2011, Liang et al. 2012), driver experience (Li et al.
2011b), and many other factors can be obtained by mining massive GPS trajectories.
Map matching is an indispensable technology for probe vehicle data mining. Probe
vehicle GPS receivers are based on pseudo range code and their accuracy is too coarse
to identify all correct road segments on which the probe vehicle is driving. The position
zhangt@whu.edu.cn
Z. ZENG ET AL.
imprecision of the probe vehicle data requires map matching to provide accurate
location points. On the other hand, the volume of probe vehicle data is too large to
be processed if the vehicles have high sampling rates. Therefore, the GPS sampling
interval of probe vehicles is commonly approximately one minute (e.g., 20100 s) to
avoid collecting excessive positioning data. The imprecision and deciency of position
data produces GPS data consisting of discrete points, which deviate from the centerline
of the actual driving routes and which need to be recovered using map matching
approaches.
Map matching methods stem from car navigation and can provide matching
results with high accuracy and reliability by exploiting geometrical, topological,
probabilistic, and other advanced data/road characteristics (White et al. 2000, Quddus
et al. 2007, Li et al. 2013), especially when the sampling intervals of GPS signals are short
(e.g., approximately 1 s). Map matching methods for low frequency GPS data still need
to employ geometrical, topological, and probabilistic features of trajectories or roads.
However, the lost position information between the two adjacent track points becomes
more important since the sampling interval is longer. Thus, the implicit features between
the two adjacent track points should be found by searching a matched subpath, which
is a path between the two matched road segments corresponding to the two adjacent
track points, then a strategy can be determined for sequentially searching the global
optimal matched path to which all possible matched subpaths are connected.
Some matching features have been extensively employed for map matching, including distance, direction, speed, and road topology. Marchal et al. (2005) used Euclidean
distance as the primary feature for measuring proximity between GPS track points and
matched subpaths, and adopted an heuristic strategy to search the global matched
paths. Brakatsoulas et al. (2005) employed the Frchet distance for spatial proximity and
then implemented an incremental search strategy to recover global matched paths. In
addition to distance, which was employed as a basic matching feature, other matching
features have been synthesized into various map matching methods. Wu et al. (2007)
chose distance and direction of the vector of two neighboring track points to match
track points onto road segments. Newson and Krumm (2009) introduced a hidden
Markov model (HMM) map matching method, which adopted the distance of neighboring track points and the topology of their corresponding candidate road segments as
matching features for searching matched subpaths and determining the global optimal
matched paths. Lou et al. (2009) proposed an ST matching (ST) method that considers
the spatial geometric and topological structure of road networks, and the temporal/
speed constraints of the trajectories as the matching features for searching matched
subpaths and then exploited candidate graphs for determining global optimal matched
paths. Wang et al. (2011) synthesized distance and direction for matching single track
points and identied matched subpaths through travel time shortest paths between
consecutive points. Miwa et al. (2012) introduced the concept of a driver route choice
model into the process of map matching. The approach still exploited the direction and
distance as principal map matching features. Rahmani and Koutsopoulos (2013)
developed a variant of the ST matching method, which exploited global criteria for
identifying matched paths and employed features of the ST matching method. She et al.
(2012) and He et al. (2013) also employed distance, direction, and road topology features
and then introduced a matching strategy that self-adaptively chose the appropriate
Z. ZENG ET AL.
vk
pn
v4
p3
p2
v3
v2
p1
Vertex
Arc (road segment)
v1
GPS trajecotry
Realistic driving path
Realistic driving trajectory
p0
v0
Figure 1. An example of map matching for probe vehicle data. Red points show track points
sampled by the GPS receiver of the probe vehicle, and red dash lines connect these track points
into a GPS trajectory. The realistic driving trajectory is a moving curve of the probe vehicle in the
realistic driving path of a road network. The GPS trajectory is obtained by sampling the realistic
driving trajectory with GPS positioning noise. The map matching problem is to nd the realistic
driving path in the road network according to the GPS trajectory of the probe vehicle.
sponding features of a possible path and . Then the map matching problem is to nd
a path that satises
arg max
fSF0 ; F g
0
Algorithm 1:
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
Z. ZENG ET AL.
foreach aij in Ci do
Calculate one-to-one similarity Sone between the track point pi and candidate
road segment aij .
end
seqc .push_back(Ci );
end
/ * Step 2
*/
Sequence seqcp
;;
/ * sequence seqcp stores a sequence of two-dimensional arrays for pairwise
similarities (see Figure 2(b)).
*/
for i = 0 to n 1 do
Calculate pairwise feature FT of two adjacent track points pi and pi1
As
;;
/ * As is a two-dimensional array that stores pairwise similarities between the
two adjacent track points and possible matched subpaths of two corresponding candidate road segments (see Figure 2(b)).
*/
j
foreach ai in Ci do
0
foreach aji1 in Ci1 do
Search for the optimal subpath sub between candidate road segments
0
aji ; aji1
Calculate pairwise feature FP of the sub
Calculate pairwise similarity Spair between pairwise feature FT and FP
As [j][j]
Spair ;
end
end
seqcp .push_back(As )
end
/ * step 3
*/
Search an optimal path, , with maximal sum of one-to-one and pairwise
similarities from all possible combination of subpaths, sub .
The features retained in the GPS trajectories include geometric features of single
track points and associated features of two adjacent track points. The features in
possible matched paths also contain two parts, geometric features of single candidate road segments and associated features of two corresponding candidate road
segments for two adjacent track points. The geometric features of single track points
or single candidate road segment are called unary features, while the associated
features of two adjacent track points or two corresponding candidate road segments
are called pairwise features. The similarity between a possible matched path and the
GPS trajectory also includes one-to-one similarity and pairwise similarity. The one-toone similarity (Sone ) is a similarity of unary features between the track point and the
corresponding candidate road segment. The pairwise similarity (Spair ) is a similarity
(a)
Trajectory
pi
Sequence seqc
a i1
...
...
ai2
a i3
Ci
(b)
Sequence seqc
Ci
ai1
...
Ci+1
a1i+1
...
ai2
a2i+1
ai3
ai1
ai2
a1i+1 a2i+1 A
s
ai3
Sequence seqcp
(c)
ai1
...
a1i+1
...
ai2
ai3
a2i+1
Figure 2. Our proposed map matching approach. (a) Step 1: one-to-one point matching. Each
candidate road segment set, Ci, including candidates, a1i ; a2i ; a3i , is selected into the sequence
seqc by computing one-to-one similarity to track point pi. There is one-to-one correspondence
between the sequence seqc and the trajectory . (b) Step 2: pairwise matching. The lines between
the two adjacent candidate sets Ci and Ci+1 shows the matched subpaths between the two road
segment candidates. Pairwise similarities between the two adjacent track points and possible
matched subpaths of two corresponding candidate road segments are computed by the geometric
and topological features of the GPS trajectory. Pairwise similarities are stored in the two-dimensional
array As, which is orderly inserted into the sequence seqcp. (c) Step 3: optimal searching. The gray
lines are all possible matched subpaths. The optimal combination of all possible matched subpaths
are searched. The black lines shows the optimal combination of all possible matched subpaths.
Z. ZENG ET AL.
between pairwise features of two adjacent track points and those of two corresponding candidate road segments (i.e., those of possible matched paths of the two
candidate road segments). Algorithm 1 and Figure 2 give a detailed description of
the proposed framework.
3.1. Prerequisite
Denition 3.1: Let ~
a be a vector in a plane, P. An azimuth angle, ~
a of ~
a, is the angle
rotating from north to ~
a in the clockwise direction. And its range is 0; 2 .
Let ~
a and ~
b be two vectors in a plane, P. The turning angle, delt~
a; ~
b,
is the angle rotating from ~
a to ~
b. Its range is ; , and is positive if the angle rotates in
Denition 3.2:
Proposition 3.3:
Proof.
(1)
ds
:
ds
We also dene
T Ts _ s:
Remark 1: Since s is parameterized by the arc length s, T s _ s is a unit tangent
vector, and so
T s cos s; sin s;
s 2 ; :
Since we can always move a tangent vector to the origin by parallel translation, as a
convention, we regard each tangent vector as an element in the vector space R 2 (rather
than the ane space R 2 ). In what follows, when we speak of rotation of a tangent
vector, it means that we rst regard the tangent vector as a vector at the origin and then
rotate.
Denition 3.5:
Given s0 2 ; ; 2 R satisfying
Ts0 cos; sin;
: ; ! R
~ s0 ;
s ! s s;
~ s0 measures the total angular changes of tangent vectors Tt from s0
where s;
to s along the curve .
Remark 2:
j;
s
(2)
where s is the variation of the arc length, and is the corresponding variation of
s (see Figure 3).
Remark 3: The curvature describes the rate of change in angle of the tangent vector
T s with respect to s. In other words, it reects the shape of the curve.
s)
T(
P1
P
T(s+s)
Z. ZENG ET AL.
T(
s)
ks < 0
N(
N(
T(
s)
s)
T(
s)
s)
T(
N(
s)
N(
s)
s)
10
ks > 0
ks > 0
ks < 0
Figure 4. Sign of signed curvature (left-handed coordinates). The sign of the signed curvature is
positive if the tangent vector rotates clockwise along the curve in the direction of increasing s or
negative if counterclockwise.
d
:
ds
(3)
Remark 4: In this article, the map projection plane is a left-handed coordinate system
in which positive rotation is clockwise about the axis of rotation. Therefore, the signed
curvature is positive if the tangent vector rotates clockwise along the curve in the
direction of increasing s or negative if counterclockwise (see Figure 4).
Denition 3.8: Let s : ; ! R 2 be a smooth plane curve parameterized by
the arc length s. Given a; b 2 ; and a < b, we dene the signed curvature integral
b
s ds:
a; b
(4)
Remark 5: The curvature integral a; b is the total angle change of tangent vectors
Ts from a to b, along the curve .
Let s : ; ! R 2 be a smooth plane curve parameterized by
the arc length s, and a; b 2 ; , a < b. Assume that ~
a and ~
b are two tangent vectors at
a and b respectively. If a; b 2 2; 2, then
Proposition 3.9:
8
b ~
a
< ~
a; b s ds ~
b ~
a 2
:
a
~
b ~
a 2
where ~
a and ~
b are the azimuth angles of tangent vectors ~
a and ~
b, respectively.
Proof.
b
s ds
d
ds b a;
a ds
(5)
11
where a; b are the scalar angles of tangent vectors at the point a and b,
respectively, and are related to the azimuth,
a ~
a 2nn 2 Z; b ~
b 2mm 2 Z;
let
b a; ~
b ~
a:
Thus,
2m n:
(6)
Since 2 < < 2; 2 < < 2, then 2 < m n < 2. There are four
cases to consider: if > 0; < 0 then m n 1; if > 0; 0 then
m n 0; if < 0; 0 then m n 1; if < 0; < 0 then m n 0.
Proposition 3.10:
8
a; ~
b
< delt ~
a; b s ds delt ~
a; ~
b 2
:
a
delt ~
a; ~
b 2
b
(7)
where delt~
a; ~
b is the turning angle between the two tangent vectors ~
a and ~
b.
Proof.
Remark 6: Propositions 3.9 and 3.10 introduce the computational method for the
curvature integral over the range 2; 2. The integral can be calculated by either
azimuth angles or turning angle of two tangent vectors at starting and ending points.
The curvature integral of the GPS trajectory between the two adjacent track points rarely
exceeds the range of 2; 2. Therefore, the map matching method can exploit the
angles to compute the curvedness feature of GPS trajectories.
12
Z. ZENG ET AL.
(9)
C
B
B
X(North)
(a)
(b)
Figure 5. Orientation of signed curvature (left-handed coordinates). (a) det(O) > 0. (b) det(O) < 0.
13
pn
p1
p0
!
pi1 pi 1 i n is composed of two neighboring points (see Figure 6), and can be
approximately considered as the tangent vector at the point pi1 . The rotation angle of
the tangent vector between the two points is far less than 2. Then, from Proposition
3.10, the curvature integral of can be discretely computed as
pn
n1
X
0 n
p ; p
s ds
pi ; pi1 ;
(10)
p0
i0
i1
p0 ; pn
n1
X
pi ; pi1 p0 ; q1 q1 ; q2 ql ; pn ;
(11)
i0
k
X
delt ai1 ; ai ai ;
(12)
i2
where ai is curvature integral between the rst shape point (i.e., a discrete point on a
planar curve) and the last shape point within the arc, which can be computed using
14
Z. ZENG ET AL.
Equation (11), deltai1 ; ai is the turning angle between the arcs ai1 and ai , which can
be computed as
alast
afirst
delt ai1 ; ai delt ~
i1 ; ~
i ;
(13)
afirst
is
where ~
alast
i1 is the vector formed by the last two shape points of arc ai1 , and ~
i
the vector formed by the rst two shape points of arc ai (see Figure 7).
(14)
a6
pi
a8
a7
a2
a1
pi-1
a4
pi+1
a9
a5
Vertex
Arc (road segment)
a3
pi-2
Figure 7. Curvedness computation for matched path and GPS trajectory. The GPS trajectory contains
track points pi2, pi1, pi, pi+1, and a possible matched path includes arcs a3, a4, a7, and a8. For
adjacent track points, pi1 and pi, their candidate road segments are respectively a4 and a7, and their
matched subpath consists of a4 and a7. The curvature integral of the matched subpath includes the
curvature integral within each arc (i.e., a4 and a7), and the change value of the tangent directions
between the two connected arcs. The change value between the two arcs (i.e., a4 and a7) can be
afirst
The curvature integral
computed using the turning angle between the vectors ~
alast
i1 and ~
i
between the two adjacent track points pi1 and pi can be computed by the turning angle of the
!
vectors p!
i2 pi1 and pi pi1 , which are considered as approximately tangent vectors.
15
the sign of pi1 ; pi does not change and the value in Equation (14) can be directly
calculated by Equation (5) or (7). If the signs of s at the two adjacent track points are the
opposite, the sign of pi1 ; pi does change once and the value is equal to
! !
deltp
i2 pi1 ; pi pi1 because the is limited to the range ; .
(15)
where dpi ; ai is the distance from track point pi to candidate road segment ai . In this
article, the parameters 1 ; 2 ; nd were set to 10, 0.03, 1.2, respectively. In our tests, we
found this combination of parameters provided the best results. The radius of searching
for candidate road segments is 25 meters.
(2) Implementation of pairwise matching (Lines 919 of Algorithm 1).
The process of pairwise matching consists of four stages.
(a) Computing pairwise features between the two adjacent track points (Line 11).
The pairwise features of track points include range, evaluated by the
Euclidean distance ltraj between the two adjacent track points, and curvedness, evaluated by the curvature integral traj between the two adjacent track
points, computed using Equation (14).
(b) Searching matched subpaths between the two candidate road segments
(Line 15).
The matched subpaths are considered as shortest paths between the two
road segments, which are candidates for the two adjacent track points.
Therefore, they can be obtained by label setting algorithms for shortest
16
Z. ZENG ET AL.
path (Dijkstra 1959). The weight of road segments may be either the
distance or the travel time when computing shortest paths. In general,
detours do not happen between the two adjacent track points because
the driving distances of probe vehicles within sampling intervals are very
short (Lou et al. 2009). However, the RDP can contain detours for avoiding
congested roads when the probe vehicle drive a long distance. Thus, the
RDP of probe vehicles should be shortest paths with minimum distance for a
small search scope, whereas they should be shortest paths with minimum
time for a large-area search. Therefore, we selected the distance as the
weight for computing shortest path between the two candidate road
segments.
(c) Computing pairwise features of matched subpaths (Line 16).
The pairwise features of matched subpaths also include range and curvedness. The ranges are the distances of subpaths, lsp , which are the results of the
label setting algorithm. The curvedness, sp , can be measured by the curvature
integrals of the subpaths, Equation (12).
(d) Computing pairwise similarity between matched subpaths and adjacent track
points (Line 17).
The information, which is partially retained between the two adjacent track points of
the GPS trajectory after sampling, primarily contains the distance and the curvedness.
For example, see Figure 1, the features of both the distance and the curvedness in
consecutive road segments v1 ; v2 ; v2 ; v3 ; v3 ; v4 of the RDP are similarly remained in
sequential track points p1 ; p2 ; p3 . The two aspects of information can be respectively
measured by corresponding pairwise features in GPS trajectories and candidate matched
paths. The pairwise matching criteria to identify the best RDP can be identied from
many candidate paths between the two adjacent points using similarity in both distance
and curvedness. Thus, the pairwise similarity Spair mainly contains the range, Srange , and
curvedness, Scurve , similarities. The pairwise similarity is
Spair Scurve Srange :
(16)
b
if
x
2
x
;
x
>
N
N
N1
N
>
:
0
if xx0 ; xN
where x presents the dierence values, and Sx is the similarity metric. The range of
similarity metric is 0; SMax . The slope and intercept of piecewise linear functions for
Srange and Scurve are listed in Table 1.
17
k1
k2
k3
k4
k5
k6
Slope
0.0502
0.025
0.0125
0.0062
0.0023
0.0008
Srange
Intercept
b1
10
b2
7.47
b3
4.98
b4
3.09
b5
1.54
b6
0.63
Scurve
Dierence
x0
x1
x2
x3
x4
x5
x6
valueb
0
100
200
300
400
600
800
k1
k2
k3
k4
k5
k6
Slope
0.0889
0.0444
0.0444
0.0222
0.0111
0.0037
Intercept
b1
10.0
b2
8
b3
8
b4
5
b5
3
b6
1.34
Dierence
x0
x1
x2
x3
x4
x5
x6
valuec
0
45
90
135
180
225
359
Notes: a The linear piecewise function for Srange is obtained by tting the exponent function for range dierence as
described by Newson and Krumm (2009).
b
Units = meters.
c
Units = degrees.
18
Z. ZENG ET AL.
kilometres
Dataset 2 contained trajectories along three bus routes. For each bus route, a
trajectory was recorded by one bus within one day. The sampling interval was
20 s. The trajectory of each bus route included enough track points to cover all
road segments of the route. These selected bus routes included a regular route,
denoted as Type AB, travelling directly from A to B, and a loop route, denoted as
Type Loop (see Figure 8). Detailed information of the three bus routes is listed in
Table 2.
In Dataset 3, three roads along bus route 707 were selected as complicated situations
of map matching. For each complicated road, the trajectories were obtained by
67 buses, which passed through the selected road within weekdays. These
selected roads contained typical complex structures, such as interchange ramps,
roundabouts, and local/express lane systems (see Figure 9). The ramps and roundabouts not only possess curvature but also are neighbors to many similar roads.
The local/express lane system consisted of parallel roads that are very confusing
for map matching. The detailed contents can be found in Table 3.
Table 2. Three bus routes for testing map matching methods.
Route ID
Total points of the original trajectory
Route type
Route length (meter)
Number of road segments
Cumulative angle changea
707
800
Type AB
28,302
333
3717
B829
1115
Type Loop
8739
104
2073
B633
459
Type AB
16703
236
3051
Note: a The cumulative angle change can be calculated from Equation (20), in units of degrees.
(a)
19
(b)
local
express
express
local
(c)
Figure 9. Complex road structures. (a) Left directional map, (b) roundabout, and (c) local-express
lane system.
Table 3. Three road segments with complicated matching conditions.
Road segment ID
Road type
Length (meter)
Cumulative angle
changea
Number of similar
roads
Number of testing
trajectories
Location
1078808
Left directional ramps
269.87
269
1078847
Roundabout
170
195
1439160
Local/express lane system
174
0
11
196
196
196
Roundabout at
Road Xixiang
20
Z. ZENG ET AL.
PWM
(18)
Clen
path
(19)
illustrated in Figure 10 (d1 is the length of incorrectly matched paths and d0 that
of the correct path). This criterion reects the length deviation of the wrongly
matched path from the correct path at the location where error matches occur.
Deviation values larger than one indicate a relatively worse degree of t of the
matched results.
(3) Angle closeness (AC). A path, Pt , consists of many consecutive arcs, which are
formed by a sequence of shape points p1 ; p2 :::pn . The cumulative angle change
along the path is
Acum Pt
n1
X
! !
jdeltpi1 pi ; pi pi1 j:
(20)
i2
The angle closeness is the dierence of the cumulative angle change between an
incorrectly matched path and the correct path,
Cangle Acum Pterr Acum Ptright ;
(21)
where Pterr is the wrongly matched path, and Ptright is the correct path at the
location where the incorrect match occurs. For example, in Figure 10, Pterr consists
of arcs a4 ; a5 ; a6 and Ptright only includes arc a2 . Thus, Acum Pterr is determined by
shape points of a4 ; a5 ; a6 in sequence and Acum Ptright is, likewise, computed by shape
d1
a5
a6
a4
a2
a1
a3
d0
track point
correctly matched point
vertex
21
points of a2 . Cangle shows the curvedness error between the incorrectly matched path
and the correct path.
Total number
of track points
118
48
79
48
56
70
60
57
63
32
33
70
114
93
78
135
96
94
80
79
Average
PMW (%)
CURF
5.93
4.17
0.00
0.00
7.14
2.86
5.00
0.00
1.59
0.00
0.00
1.43
0.00
3.23
0.00
2.22
1.04
4.26
3.75
0.00
2.13
CURF_T
5.93
6.25
1.27
0.00
8.93
5.8
6.7
0.00
3.17
3.13
0.00
2.86
0.88
3.23
1.28
2.22
2.08
4.26
3.75
0.99
3.13
HMM
6.78
6.25
2.53
2.08
8.93
4.29
8.33
0.00
6.35
0.00
6.06
2.86
1.75
4.30
3.85
1.48
4.17
5.32
5.00
1.98
4.12
ST
6.78
6.25
1.27
6.25
7.14
4.29
8.33
1.75
6.35
9.38
6.06
5.71
2.63
8.60
1.28
3.70
4.17
2.13
5.00
0.99
4.90
22
Z. ZENG ET AL.
707
(a)
20%
15%
10%
5%
0%
B633
20%
Method
CURF
15%
HMM
10%
ST
5%
0%
B829
20%
15%
10%
5%
0%
20
40
60
Interval (second)
80
100
(b)
PWMs generally increase as the sampling interval increases, but the ST and HMM
methods always produce larger PWM than our proposed CURF method. Due to the
information loss of trajectory with large sampling interval, the PWMs of the CURF
method still tend to increase, but this increase is slower than for the other methods.
Incorporating the curvedness can curb error growth with increasing sampling
intervals.
23
707
10
1
B633
Method
CURF
10
HMM
ST
1
B829
10
1
20
40
60
Interval (second)
80
100
Figure 11(b) directly compares the PWMs from the three methods, and the shadow
buer shows PWM uctuations. The PWM range from the CURF is signicantly less than
the other two methods, indicating better stability under dierent situations.
Consequently, our proposed method incorporating curvedness maintains more stable
performance under dierent situations.
Figure 12 compares LC for the three methods. The box plot represents three quartiles
of LC for an individual matching method. In contrast to PWM, LC is not aected by the
sampling interval. The quartile ranges for the CURF method are always less than for the
ST or HMM methods. Thus, the proposed CURF method is signicantly better than the
other methods at choosing a closer optimum path.
Matching using the CURF, incorporating curvedness, rarely produces a dramatically
incorrect path. In other words, including curvedness prevents the matching path from
seriously deviating from the RDP in length under dierent conditions, such as dierent
routes or sampling intervals.
Figure 13 presents bar plots for the AC on the three bus routes for the three
matching methods. The vertical axis units are a multiple of 360 degrees, while the
horizontal axis is the sampling interval. Figure 13(a) shows the sum of AC along a bus
route and Figure 13(b) the average. The total AC from the CURF method is much less
24
Z. ZENG ET AL.
(a)
707
30
10
0
B633
30
Method
CURF
HMM
ST
20
10
0
B829
30
20
10
0
30
60
90
Interval (second)
(b)
707
1.00
0.75
20
0.50
0.25
0.00
B633
1.00
Method
CURF
HMM
ST
0.75
0.50
0.25
0.00
B829
1.00
0.75
0.50
0.25
0.00
30
60
Interval (second)
90
Figure 13. Dataset 2: Angle closeness of matching on three dierent routes. (a) Sum of angle
closeness and (b) average of angle closeness.
than that of the other two methods, and the average AC is also less than or equal to
that of the others. Thus our proposed CURF method consistently outperforms the two
other methods.
25
0%
Ramp of interchanges
Method
CURF
40%
HMM
20%
ST
0%
Roundabout
40%
20%
0%
20
40
60
Interval (second)
80
100
26
Z. ZENG ET AL.
Map matching for probe vehicle data has been extensively studied by exploiting
GPS trajectory features. These features, which are adopted into map matching,
primarily include geometrical and topological properties of a GPS trajectory. Two
factors aect matching quality: information loss between the two consecutive
sampling points, and confusion amongst several similar matched candidates of
track points. Larger time intervals can increase information loss under complicated
road structures. The HMM and ST matching methods, which incorporate Euclidean
distance, direction, network distance, etc., were compared with the proposed CURF
matching method, which incorporates those features but also adds the curvedness
of GPS trajectories.
40000
30000
Running time(ms)
4.3. Discussion
Matching Methods
CURF
HMM
20000
ST
10000
0
100
200
300
400
Number of track points
500
27
The CURF method performs more eectively than the ST and HMM methods with
respect to increasing sampling intervals and under dierent circumstances, including various types of routes and complicated roads.
PWM variation of the CURF method is more stable than the others on dierent
routes.
The deviation of both LC and AC are signicantly less for the CURF method than
the others.
The CURF matching method performs reliably under various disadvantageous situations, which shows that including the curvedness in matching can relieve the negative
impact of complicating factors. Especially, in fast-growing cities of developing countries,
the roads update so fast that the structures of some road networks have become very
complicated. Traditional map matching may not select the correct road segments from
similar candidates in such complex situations. However, our outcomes show that including the curvedness can help the matching method choose correct driving paths.
Figure 16 shows an example of map matching using curvedness feature in a complex
situation.
Two further interesting aspects were illuminated in the experimental results.
The rst is a compromise between the two factors restricting the matching quality.
Comparing PWMs on dierent planned routes, the CURF method achieves the best
accuracy for 40 s sampling interval (Figure 11), producing PWMs on the three routes
almost equal to zero. Mismatching arises largely from information loss and/or candidate confusion. The best matching at 40 s sampling interval is the optimal trade-o
between these two eects. Less information is lost in the GPS trajectories than for
60 s or 100 s intervals, while more matched road segments can be correctly
determined by the CURF method than at 20 s interval, because the closer distance
between the two adjacent sampling points results in more concentrated candidate
segments than at 40 s interval. Thus, the 40 s interval is a trade-o between
information loss and complex matching conditions. This does not occur for the
conventional methods because elementary features are unable to achieve a tradeo situation.
The second is the loss of information at relative long intervals. Lost information
between the two adjacent sampling points can be recovered by assuming that the
actual driving path is the shortest path between the matched road segments. This
performs well when the sampling interval is relatively small. However, for example in
Figure 11, the sharp rise of PWMs from 80 s to 100 s intervals is partially because the
shortest path assumption can no longer satisfy the context of every pair of adjacent
track points with the increase of sampling intervals. The incorrect matches of the three
methods at below the 80 s interval do not include signicant error caused by the shortpath assumption. This error occurs approximately 30% of the time for the PWMs of the
CURF method at 100 s interval, given that the PWMs are not large. Thus, the actual path
probe vehicle drove becomes harder to completely recover when the sampling interval
increases above a certain level.
The CURF method shows little or no extra workload than the (computationally similar)
ST method. The CURF method still employs conventional techniques for optimal
28
Z. ZENG ET AL.
leftdirectional lane
local lane
3
2
express lane
Legends
Track points of GPS trajectory
Matched points by HMM method
Matched path by HMM method
150
meters
300
Figure 16. An example of map matching using curvedness in complicated local/express road
systems. The matched paths by three methods diverge at track point 3 and 4. The left-directional
lane is selected by the ST and HMM method, and the local lane is selected by the CURF method. The
ST and HMM methods are incorrectly matched at track points 3 and 4. However, the CURF method
gives the correct matched path. Without constraint by curvedness, the ST and HMM methods select
road segments with the best spatial approximation only to these track points. From the constraints
introduced by curvedness features, the CURF method can exclude the subpath matched by the ST or
HMM methods between the track points 4 and 5 because its curvature integral is 2~, which does not
match the curvature integral between the track points 4 and 5. The curvature integral of both the
matched subpath and the two adjacent track points can be evaluated by Equations (12) and (14),
respectively. After global optimization, the CURF method can also exclude the subpath matched by
the ST or HMM methods between the track points 3 and 4. In this case, the curvedness plays a key
role in discriminating the correct matched path from all possible matched paths.
searching and shortest path calculations. Strategies, which tailor the searching space can
be adopted into the CURF method for improving time performance, such as He et al.
(2013), Chen et al. (2014), etc. All three matching methods employ the Dijkstra algorithm
for shortest path calculation, and better than 90% of computing time consuming is
consumed in this step. The Dijkstra algorithm is a standard framework for computing
shortest paths. Many technique can improve the short-path computation eciency, for
example, ALT (Goldberg and Harrelson 2005), or Arc Flags (Khler et al. 2006).
High performance computing, which could also be integrated into our proposed
method, could be another choice for improving the time performance of map matching
(Li et al. 2011a).
29
5. Conclusions
The CURF matching method was inspired by the observation that curvedness is largely
retained in the sampling points of realistic driving trajectories. We employ the curvature
integral to measure the curvedness. This technique transforms the inner geometric
property, which otherwise might be hidden within the track points, into a matching
feature. The matching features provide the tags that help to select the right matched
paths from candidates. Curvedness is such a prominent tag that the correct driving road
can be eectively distinguished from candidates. This is supported by our experimental
results. Using curvedness, the CURF method achieves not only high eciency but also
superior stability.
A problem of current map matching methods for probe vehicles is that exceedingly
long sampling intervals, larger than 120 s, cause matching accuracy to rapidly decline.
The interval is large enough that assuming the shortest path between the two adjacent
track points becomes invalid. Therefore, future work may focus on techniques to
eectively recover the driving path between the two track points with large sampling
intervals. Another issue for future work is to improve the computation performance
using parallel computing.
Acknowledgment
The authors would also like to thank the anonymous reviewers for their helpful suggestions that
improved the article.
Disclosure statement
No potential conict of interest was reported by the authors.
Funding
This study was jointly supported by the National Natural Science Foundation of China [grant
number 41101355], [grant number 41271400, [grant number 41371377, [grant number 41471322];
the Fundamental Research Funds for the Central Universities [grant number 13CX02034A];
Shenzhen Scientic Research and Development Funding Program [grant number
ZDSY20121019111146499]; Shenzhen Dedicated Funding of Strategic Emerging Industry
Development Program [grant number JCYJ20121019111128765]; and Open Research Fund
Program of Shenzhen Key Laboratory of Spatial Smart Sensing and Services (Shenzhen University).
ORCID
Zhe Zeng
http://orcid.org/0000-0002-0528-7069
References
Brakatsoulas, S., et al., 2005. On map-matching vehicle tracking data. In: Proceedings of the 31st
international conference on very large data bases, VLDB 05, 30 August2 September
Trondheim. VLDB Endowment, 853864.
30
Z. ZENG ET AL.
Chen, B.Y., et al., 2014. Map-matching algorithm for large-scale low-frequency oating car data.
International Journal of Geographical Information Science, 28 (1), 2238. doi:10.1080/
13658816.2013.816427
Dijkstra, E.W., 1959. A note on two problems in connexion with graphs. Numerische Mathematik, 1
(1), 269271. doi:10.1007/BF01386390
Giannotti, F., et al., 2011. Unveiling the complexity of human mobility by querying and
mining massive trajectory data. The VLDB Journal, 20 (5), 695719. doi:10.1007/s00778011-0244-8
Goldberg, A.V. and Harrelson, C., 2005. Computing the shortest path: a search meets graph theory.
In: Proceedings of the sixteenth annual ACM-SIAM symposium on discrete algorithms, SODA 05,
Vancouver, BC. Philadelphia, PA: Society for Industrial and Applied Mathematics, 156165.
Greenfeld, J.S., 2002. Matching GPS observations to locations on a digital map. In: Transportation
research board 81st annual meeting. Washington, DC: Transportation Research Board of the
National Academies.
He, Z., et al., 2013. On-line map-matching framework for oating car data with low sampling rate
in urban road networks. IET Intelligent Transport Systems, 7 (4), 404414. doi:10.1049/ietits.2011.0226
Hunter, T., Abbeel, P., and Bayen, A., 2014. The path inference lter: model-based low-latency map
matching of probe vehicle data. IEEE Transactions on Intelligent Transportation Systems, 15 (2),
507529.
Khler, E., Mhring, R.H., and Schilling, H., 2006. Fast point-to-point shortest path computations with arc-ags. In: 9th Dimacs implementation challenge, 1314 November Piscataway,
NJ. American Mathematical Society.
Li, L., Quddus, M., and Zhao, L., 2013. High accuracy tightly-coupled integrity monitoring algorithm
for map-matching. Transportation Research Part C: Emerging Technologies, 36, 1326.
doi:10.1016/j.trc.2013.07.009
Li, Q., Zhang, T., and Yu, Y., 2011a. Using cloud computing to process intensive oating car data for
urban trac surveillance. International Journal of Geographical Information Science, 25 (8), 1303
1322. doi:10.1080/13658816.2011.577746
Li, Q., et al., 2011b. Path-nding through exible hierarchical road networks: an experiential
approach using taxi trajectory data. International Journal of Applied Earth Observation and
Geoinformation, 13 (1), 110119. doi:10.1016/j.jag.2010.07.003
Liang, X., et al., 2012. The scaling of human mobility by taxis is exponential. Physica A: Statistical
Mechanics and Its Applications, 391 (5), 21352144.
Lou, Y., et al., 2009. Map-matching for low-sampling-rate GPS trajectories. In: Proceedings of the
17th ACM SIGSPATIAL international conference on advances in geographic information systems,
GIS 09, Seattle, Washington. New York, NY: ACM, 352361.
Marchal, F., Hackney, J., and Axhausen, K.W., 2005. Ecient map matching of large global
positioning system data sets: tests on speed-monitoring experiment in Zrich. Transportation
Research Record: Journal of the Transportation Research Board, 1935 (1935), 93100.
Miwa, T., et al., 2012. Development of map matching algorithm for low frequency probe data.
Transportation Research Part C: Emerging Technologies, 22, 132145. doi:10.1016/j.
trc.2012.01.005
Newson, P. and Krumm, J., 2009. Hidden Markov map matching through noise and sparseness. In:
Proceedings of the 17th ACM SIGSPATIAL international conference on advances in geographic
information systems, GIS 09, Seattle, Washington. New York, NY: ACM, 336343.
Pressley, A., 2010. Elementary dierential geometry. 2nd ed. London: Springer.
Quddus, M.A., Ochieng, W.Y., and Noland, R.B., 2007. Current map-matching algorithms for transport applications: state-of-the art and future research directions. Transportation Research Part C:
Emerging Technologies, 15 (5), 312328.
Rahmani, M. and Koutsopoulos, H.N., 2013. Path inference from sparse oating car data for urban
networks. Transportation Research Part C: Emerging Technologies, 30, 4154. doi:10.1016/j.
trc.2013.02.002
31
She, X., et al., 2012. Online map-matching framework for oating-car data with low sampling
rate in urban road network. In: Transportation research board 91st annual meeting, 2226
January Washington, DC.
Wang, W., et al., 2011. Large-scale freeway network trac monitoring: a map-matching algorithm
based on low-logging frequency GPS probe data. Journal of Intelligent Transportation Systems,
15 (2), 6374. doi:10.1080/15472450.2011.570103
White, C.E., Bernstein, D., and Kornhauser, A.L., 2000. Some map matching algorithms for personal
navigation assistants. Transportation Research Part C: Emerging Technologies, 8 (16), 91108.
doi:10.1016/S0968-090X(00)00026-7
Wu, D., et al., 2007. A heuristic map-matching algorithm by using vector-based recognition. In:
International multi-conference on computing in the global information technology, ICCGI 2007,
49 March Guadeloupe. IEEE Computer Society Press.