Professional Documents
Culture Documents
Abstract - Detecting and tracking moving objects is a key the partially occluded or inexplicit objects or to generate error
technology for autonomous driving vehicle in dynamic urban results [5]. Azim et al. [6] represents the environment using an
environment. In this paper, we present a hybrid system of octree-based occupancy grid and determining inconsistencies
multiple moving target tracking (MMTT) for autonomous driving between the map, incoming scans to detect dynamic objects
vehicles using 3D-Lidar sensor. To detect targets, geometric
and classifying them into pedestrian, bicycle or vehicle. As
model-based method is adopted in the hybrid system. In further,
we propose a novel hypothesis trajectory model to detect moving trajectory is another feature of the moving object, in our
targets. The targets detected by the two methods are fused under system, we propose a hypothesis trajectory model by
Extend Kalman Filter. Besides, a bipartite graph model-based collecting all probable trajectories, and pick out moving
method with the Hungary algorithm is presented to optimize data targets according to the feature of them.
association matrix. Our tracking algorithm is tested on school The data association between detected targets and tracked
road and Beijing’s 3rd ring road using an experimental vehicle targets is another problem for MMTT, as the wrong matching
with velodyne-32-E. And, the experiment results illustrate the in the process will lead to a wrong tracking. Researchers have
hybrid method performs well in real time.
proposed many representative methods to solve this problem,
Keywords—autonomous driving vehicle, multiple moving e.g Global Nearest Neighbor (GNN) [7], Joint Probabilistic
target tacking, hypothesis trajectory model, data association, Data Association (JPDA) [8] and Multiple Hypothesis
Extend Kalman Filter. Tracking (MHT) [9]. Therefore, the GNN method is used in
our system to establish a relationship matrix for current
I. INTRODUCTION detection targets and tracked targets. Mathematical methods,
such as graph theory, are introduced to optimize the match
Reliable environment perception is fundamental for
assignment problem during data association. In [10], Yang et
autonomous driving vehicles, and the perception system for
al. propose a graph-based optimization to solve the association
autonomous vehicles includes static obstacle detection and
problem. Hungarian method, a combinational optimization
dynamic object detection. Besides, an autonomous vehicle
algorithm, is successfully used to process the multiple-step
should be able to make predictions about targets’ position and
association stage for multi-pedestrian tracking in [11].
behavior and prepare for next actions by tracking moving
As the measurement error exists in the data from sensors,
objects and analyzing the objects’ motion state. As the moving
researchers usually use filters to calculate estimation result as
objects interfere with each other in dynamic scene, tracking
the optimal outcome, such as Kalman filter (KF) [5], Extend
multiple moving targets in complex urban traffic roads is a
Kalman filter (EKF) [12] and Partical filter (PF) [4]. While KF
great challenge for the wide application of autonomous driving
is useful liner filter with wide scope application, EKF and PF
vehicles.
are suit for nonlinear system. Considering that vehicle’s
Different types of sensors have been employed to deal
moving state is non-linear and the system runs in real time,
with the challenge, and there have been numerous fruits in this
EKF performs better than KF for the moving vehicle motion
field [1] [2]. In recent years, 3D-Lidar sensors have become
state estimate.
widely applied in area, because they provide rich and accurate
data of spatial information around the vehicle. Most methods II. SYSTEM CONFIGURATION
of multiple moving target tracking based 3D-Lidar can be
divided into two main steps [3]: detection and tracking, and This paper presents a method of multiple moving target
the tracking part includes data association and state updating. tracking for autonomous driving vehicles with 3D-Lidar The
In the detection step, 2.5D grid map is a popular and framework is shown in Fig.1, the method contains three steps:
efficient representation of the information of 3D point cloud 1) Firstly, the cloud points from 3D-Lidar are projected into
[4]. In [3], geometric model-based method can effectively the 2.5D grid map, and all candidate targets are extracted from
detect the vehicle driving in the road, according to different the grid map through the preprocessing part. 2) In detection
geometry features of objects in grid map. However, as the part, to detect the target from the candidates, geometric
laser scanner is dispersed in variance distance around 3D- model- based method is adopted. In further, a novel
Lidar, the geometric model-based method is unable to detect hypothesis trajectory method is proposed to detect the moving
3D cloud
points Tracked
Targets list
Tracked
trajectories list
(a)
j fj kt Ci , j =
a
+
b
+
c
+
d
(1)
t-1
kt
fkktt+1 x −x
t
l
t y −y l
t W −W
t
m
t L −L
t
l
t t
k fkt−1 fit+1
kt i We build a correlation matrix C to restore the result of
Time t GNN, as in (2). The size of matrix C is (n_detect × n_track),
n_detect is the total number of detected targets and n_track is
Targets the total number of tracked targets. The matrix element Ci,j is
. ..
. ..
list
. ..
i
fitk−t1 fktt+1 k
fktt+1 value of Ci,j into 1 or 0 according to the similar value of Ci,j : if
. ..
Mt (k) k Nt (k) the similar value of Ci,j goes beyond a certain threshold, Ci,j
(b)
fktk−t1 f kt
jt−1 f jt+1 will be set as 1, which means relationship exists between the
kt
j targets; otherwise, Ci,j will be set as 0, which means there is no
. ..
. ..
. ..
P(t)
ª C1,1 " C1, j " C1,n _ track º
Fig. 3 Hypothesis trajectories model. The colourful lines in (a) represent the « »
possible trajectory of k th target from time t-1 to t+1. In (b), the rings « # % " % # » (2)
represent the target, the connection lines mean the hypothesis trajectory as in C = « Ci ,1 " Ci, j " Ci,n _ track »
(a). There may exist M(k) targets in t-1 th moment and N(k) target in t+1 th « »
moment connecting with the k th target in t th moment. The connection « # % # % # »
side f k jt +1 is the hypothesis trajectory between k th target in and the j th target «C " C " C »
t ¬ n_detect ,1 n_detect , j n_detect , n _ track ¼
of t+1 th moment.
2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC)
Updating
function, Vt is the Noise of the measurement system with the
matching state
known covariance, and Wt −1 is the Noise of the process
Empty Seeking augment path
Matching from every vertices 1 system with the known covariance.
G(V,E) < The kinematics formulation shown in (4) describes the
motion state relationship between the previous time step and
Go through
1 Exit augmented the current, and, the state transition matrix f can be written as
all path path
(5).
< Á xt 3 xt #1 ! vt #1 :t cos șt #1
Ä
HQG ÄÄ yt 3 yt #1 ! vt #1 :t sin șt
 (4)
Fig. 5 Seeking augment path flow.
Ävt 3 vt #1
Ä
ÄÃșt 3 șt #1
The ideal matrix C should present only one-to-one
correspondence relationship between the detection targets and ¾1 0 vt #1:t cos șt #1 :t sin șt #1 Î
¿ Ï
the tracked targets. However, in fact, there will be situations of ¿0 1 # vt #1:t sin șt #1 :t cos șt #1 Ï
the one to many, many to one or many to many due to the large At,t #1 = ¿ Ï (5)
¿0 0 1 0 Ï
number of targets, as well as the empty match. To solve the ¿ Ï
problem, the GNN method is usually implemeted with Greedy À¿0 0 0 1 ÐÏ
algorithem [13], however, we prsent a bipartite graph model-
based method with Hungary algorithm to optimaze the data Where, ( x, y ) is the position of tracked target, center
association relationship matrix C. position of the box, in the 2.5D grid map, v is relative
The data association relationshiop bipartite graph model velocity of the target, and ș is heading of the box with the y
G=(V,E) is established to represent the matrix C. In G=(V,E), axis of grid map coordinate system and is relative value as
V is a set of vertices with the size of (n_detect + n_track) well.
consisting of the detected targets and the tracked targets, and The state estimated for the moving target is represented
E is the relationship edges between the vertices, consisting of T
by ( x, y, v, ș ) . These states are independent of each other in
one detection target and one tracked target, and the value of
the element according to them in C is 1. According to the same moment, so the error covariance between any two of
Hungarian algorithm, we keep seeking the augmenting path in them is 0. The states ( x, y ) is fixed and can be written in
the bipartite graph G until there is no augmenting path starting measure matrix as (6):
with a intial place(the seeking flow show as Fig.5). And when ª1 0 0 0 º (6)
Hi = « »
meeting the following requirements, the total number of 1 in C ¬0 1 0 0¼
can reach maximum:
z There exists the element Ci,j, whose value is 1. V. EXPERIMENTS
z There exists no element Ci,k or Cg,j, whose value is1, We have evaluated the performance of our algorithm in
in the same row or column as Ci,j. real time in the following two experiments, which are typical
Thus, the matrix C is transformed into the matrix, where urban environment and the results will be discussed. The
only one-one matching relationship exists and the one-to-one autonomous driving vehicle used in the experiments is our
matched relationship achieves the maximum. self-developed Intelligent Vehicle (as shown in Fig.6), which
took part in the China Future Challenge of Intelligent Vehicle
B. Extended Kalman Filter for State Estimation from 2013 to 2016. A Velodyne-HDL-32E LIDAR is mounted
The Kalman Filter offers an efficient, iterative method to on the top the autonomous vehicle, with the rotation frequency
collect information from sensors for estimating the state of set as 10Hz. We implement the algorithm in C++ as well as the
the vehicle. Compared with standard Kalman filter, EKF is
widely used in non-linear system, such as the vehicle motion.
Considering that motion state of vehicle is non-linear model at
the situation with acceleration and turning, in this paper, EKF
is used to estimate motion state of the target. The filtering
model we use is as following:
ÄÁ X t 3 f ( X t #1 , t # 1) ! wt #1
 (3)
ÄÃ Z t 3 h( X t , t) ! Vt
point cloud library (PCL) .This algorithm runs on a system (b), where is crowded with a vehicle stream. The dataset was
equipped with 4GB RAM and Intel (R) Core(TM) i7 2.60 Hz collected by the intelligent vehicle, with a speed between
CPU. Besides, we equip the intelligent vehicle with sensors 20km/h and 45km/h.
such as GPS/IMU, which can receive the ego vehicle’s In Fig.10 (a), the number of tracked targets is compared
position. between the geometric model method and the hybrid method,
and the tracking performance with the data association
A. Experiment I (on Campus Road) procession proposed in the paper is compared with greedy
We evaluate the precision of target’s motion state algorithm. The truth target in Fig.10 (a) is labeled in the off-
estimation in our tracking system in the scene shown in Fig.7 line datasets. Fig.10 (b) shows the cost time of the proposed
(a). The intelligent vehicle drives on the campus road and multi-target tracking algorithm and illustrates the tracking
tracks the cars parked on the sides of road as shown in Fig.7 algorithm can meet the requirements of running in real time.
(b). As shown in Fig.8, the velocity estimation of static target Besides, Fig.11 shows the scenes of tracking result.
with EKF is compared with the estimation result of KF in 160
frame.
Fig. 11 Tracking results scenes in the frames when the intelligent vehicle drives through one ramp of Beijing third ring road. The tracking results are presented by
rectangles labeled with ID numbers, and the rectangle’s color represents different detection methods proposed in the paper. Blue rectangle is detected by
geometric model-based method, red rectangle is detected by hypothesis trajectory model-based method and yellow rectangle means the target is detected with
both methods. The green thin line is the trajectory of target.