Professional Documents
Culture Documents
Abstract—In the field of the unmanned automobile and the This article integrates a set of image processing
automobile auxiliary driving system, the real-time and accurate techniques and spiking neural network to improve accuracy of
detection of the lane is very important. Based on the previous the lane detection. Firstly, the ROI is set on the image captured
research on the lane detection, the paper introduces the spiking from the on-board camera. This can greatly reduce the running
neural network with the parallel mechanism to detect the lane. time of the algorithm and improving the detection speed;
Firstly, the region of interests (ROI) is set on the origin image Secondly, A set of image preprocessing techniques is applied to
that collected by a vehicle on-board camera. In order to reduce the ROI region, such as transform from RGB image to
processing time, areas outside the road are excluded in the ROI. grayscale, gray stretch and median filtering. Edge detection
Then the image preprocessing is applied to the ROI, including
based on the spiking neural network with biological
RGB to grayscale, gray stretch and median filtering to eliminate
mechanism is proposed to extract efficient edge lines. At last,
noise. Edge detection of the lane is the key to determine whether
the Hough transform can detect the lane. In this paper, the the Hough transform is used to detect the lane. In this paper the
spiking neural network is used to detect the edge of the lane. new ideas are that the ROI preprocessing is used to speed up
Finally, Hough transform is used to detect the lane. Experimental the detection time and spiking neural network is used to obtain
results show that this method is more accurate and robust than better edge lines to improve the detection accuracy.
other methods.
II. IMAGE PREPROCESSING
Keywords-lane detection; region of intrests; spiking neural
network; hough transform The origin image is captured from the on-board camera.
Because the color image contains a lot of noise and other
interference information, they will affect the final lane
I. INTRODUCTION detection. Therefore, the ROI and preprocessing techniques is
The detection of road lane line has a key role in the used to process the origin image so that the noise and other
unmanned automobile and the vehicle assistant driving system. interference information can be removed and then, the correct
When the vehicle appears to deviate from the lane or vehicle rate of lane detection can be improved [7].
safety distance is too small, it can timely alert the driver to
avoid dangerous [1-2]. Because of the low price of machine A. Select ROI in Origin Image
vision and the rich information of the image, the machine
The images collected by the on-board camera are color
vision has been widely used in the lane detection of intelligent
images. Each pixel in the image is made up of R, G, and B
vehicle. Hough transform is a widely used method in the lane
three color components. Each color contains 256 levels. So the
detection. It has very good suppression of noise performance
change range is about 16000000 (256×256×256) for each pixel
and is not sensitive to the target which is partially occluded and
in the image. As seen, the color image contains a lot of
covered in the image. But, because of the complexity of Hough
information. Processing these information needs to consume a
transform, the computation speed is very slow, and the false
lot of time, so that it is difficult to process in real-time because
detection rate is large. It cannot meet the requirements of
of time consuming of the color image processing. Through
accuracy and real-time requirements. The detection speed is
statistical analysis of the multi frame images acquired by the
improved by limiting the polar angle and the polar diameter in
on-board camera, the 1/3 area above the image is the sky and
Hough transform space in reference [3]. Reference [4] detected
the background area. It is call it invalid area. The remaining 2/3
the lane by finding the best threshold of the image, Random
region is an effective area, which is called the ROI, as shown in
sample consensus (RANSAC) algorithm is used to detect the
Figure 1. Processing on only the ROI can greatly reduce the
lane in paper [5], but it has high wrong rate. Paper [6] use the
time of algorithm and improve the running speed. The original
peak point inverse transform to extract line segments in Hough
image is shown in Figure 2 (a), the ROI region is shown in
transforms space. All the methods mentioned above are
Figure 2 (b).
modified in the process of Hough transform, do not take into
account the appropriate processing methods in the image
before Hough transform to improve the detection accuracy and
shorten the calculation time.
627
neuron to distinguish whether it is the edge pixel in the input where g ex
x , y (t ) and g x, y (t ) are the conductance for excitatory and
ih
image [15].
inhibitory synapses respectively, τ ex and τ ih are the time
constants for excitatory and inhibitory synapses respectively,
vN 1 (t ) is the membrane potential of neuron N1, Eex and Eih are
the reverse potential for excitatory and inhibitory synapses
respectively, cm represents a capacitance of the membrane,
gl represents the conductance of membrane, ex is short for
excitatory and ih for inhibitory, wup
x, y
_ ex represents the strength
inhibitory synapses, Aex and Aih are the membrane surface area
connected to the excitatory synapse and inhibitory synapse
respectively. According to the description of biological
receptive fields, values for wup x, y
_ ex and
wup
x, y
_ ih are expressed as
follows [16].
0 if (y-yc ) ≤ 0
Figure 5 Spiking neural network model for image edge detection up _ ex ° (7)
w( x, y ) =® −
( x − xc )2 ( y − yc ) 2
−
The first layer was photonic receptors layer, each pixel in °w e δx
2
δ y2
this layer corresponds to a receiver. The second layer was ¯ e max if (y-y c )>0
composed of four different neurons. Each of them has four
different receptive fields respectively. 'X' represent an 0 if (y-yc )>0
up _ ih ° (8)
excitatory synapses, 'ǻ' represent an inhibitory synapses. The w( x, y ) =® −
2
( x − xc ) ( y − yc )
−
2
third layer was the output layer. The neuron in the output layer °w e δ x
2
δ y2
¯ i max if (y-y c ) ≤ 0
integrated the output from the intermediate layer. In the middle
layer, there are four parallel arrays of neurons, N1, N2, N3 and where ( xc , yc ) is the centre of receptive field RFrcpt .
N4 and only one neuron in each array is shown in Figure 5 for
simplicity. Each of these layers perform the processing for up, ( x, y ) ∈ RFrcpt . δ x and δ y are constants, we max and wi max are
down, left and right edges respectively and connected with the the maximal weights for excitatory synapses and inhibitory
receptive field by different weights. G( x , y ) represent the gray synapses respectively. When the neuron's membrane voltage
reaches a threshold voltage, the neuron generates a pulse
scale. ( x, y ) ∈ RFrcpt . qxex, y and qxih, y were the peak conductance
signal, and then the neuron enters a period of dormancy. When
generated by the excitatory synapses and inhibitory synapse at the membrane voltage reaches the threshold voltage again at
the receiving point (x, y). Each receptor can transform a value the time of dormancy, the neuron generates a pulse signal
of gray scale to peak conductance by the following expressions. again. S N 1 (t ) represent the pulse sequence generated by the
qxex, y = α G( x, y ) qxih, y = β G( x, y ) (3) neurons of N1.
1 if neuron N1 fires at time t.
α and β are constants. According to the conductance based S N 1 (t ) = ® (9)
integrate-and-fire model, the potential changes of neuron N1 ¯0 if neuron N1 does not fire at time t.
can be expressed in the following expressions. Likewise, Neuron N2, N3, and N4 are governed by a set of
ex equations similar to that for neuron N1. The parameters of the
g ( x, y ) (t ) 1 spiking neural network are set as follows: vth = -60 mv, vreset = -
=− g (exx, y ) (t ) + α G( x , y ) (4)
dt τ ex 70 mv, Eex = 0 mv, Eih = -75 mv , El = -70 mv, gl =1.0
g(ihx , y ) (t ) 1 μs/mm2, cm =10 nF/mm2 ˈ τ ex =4 ms, τ ih =10 ms, τ ref =6
=− g(ihx , y ) (t ) + β G( x , y ) (5)
dt τ ih ms, Aih =0.028953 mm2, Aex =0.014103 mm2 , we max is set to
0.7093 for excitatory synapses, and wi max is set to 0.3455 for
dV N 1 (t ) w(upx,_y )ex g (exx, y ) (t )
cm = gl ( El − VN 1 (t )) + ¦ ( Eex − VN 1 (t )) inhibitory synapses [17]. Figure 6a is the result of edge
dt ( x , y )∈RFrcpt Aex detection of Sobel operator. Figure 6b is the canny edge
detection. Figure 6c is the edge detection based on the spiking
w(upx ,_y ih) g (ihx , y ) (t ) neural network. It is the firing rate map on the output layer,
+ ¦
( x , y )∈RFrcpt Aih
( Eih − VN 1 (t )) (6)
Bright lines show that the corresponding neurons fires with a
high frequency and indicate the edges with high contrast. Dark
lines show that the corresponding neurons fires with a low
628
frequency and indicate the edges with low contrast. Using the y
firing rates, different contrast edges can be separated.
( xi , yi )
ρ
(xj , y j )
θ ρi = x i cos θ + yi sin θ
Figure 6b Canny edge detection
( ρ0 , θo )
ρ j = x j cos θ + y j sin θ
o ρ
Figure 6c Edge detection based on the spiking neural network
Figure 8 Polar coordinates mapping to the parameter space
IV. LANE DETECTION BASED ON HOUGH TRANSFORM
Parameter space can be considered as discrete in practical
Firstly, we propose three assumptions of road model before application and a two-dimensional accumulator A (λ , θ ) is
Hough transform [18]. established. As a fixed point ș can take all possible values,
(1) The assumption of a linear model. According to the corresponding Ȝ can be calculated according to equation (10).
characteristics of the ROI and the characteristics of the Let A (λ , θ ) add 1 after calculate each (λ , θ ) . That
image captured by the on-board camera, the lane in the is A(λ ,θ ) = A(λ ,θ ) + 1 . Suppose that the size of image
camera field of view is supposed a straight is D × D , as for ȡ and ș, the range of value
line: y = kx + b .
are [ − 2 D / 2, 2 D / 2] and [− π , π ] respectively. Found the
(2) Suppose that the left lane and right lane are parallel. In 2 2
the camera field of view, whether it is a straight line, a maximum value Amax (λi , θi ) in the results of the voting
curve, or an S curve, it is assumed that the two lanes space A(λ , θ ) . λi and θi are the polar coordinate equation with
are parallel to each other, and they intersect in the the largest collinear points of the origin image ˄ collinear
vanishing point.
points is Amax (λi , θi ) ˅. Hough transform as shown in Figure 9.
(3) The direction of the lane is not perpendicular to the Lane detection results are shown in Figure 10.
driving direction.
Hough transform is the most widely used in the lane
detection. The basic idea of Hough transform is to convert the
image coordinate space to the parameter space to realize the
fitting line curve. In addition, it can detect the lane even the
image was partly occluded and damaged. As shown in Figure
7, the line equation can be expressed in polar coordinates.
λ = x cos θ + y sin θ (10)
Ȝ is the distance from the origin point to the straight line, ș is
the angle with the positive X axis, the range of ș is ±90 0 . Two
collinear points ( xi , yi ) and ( x j , y j ) in polar coordinates mapping
Figure 9 Hough transform
to the parameter space are two sine curve, and intersect at one
point [4]. As shown in Figure 8.
629
REFERENCES
630