Professional Documents
Culture Documents
Abstract— Intelligent Transportation System (ITS) has been accidents etc. in the cities. However, inefficient vehicle traffic
emerged an important component and widely adopted for the management system causes numerous issues such as long wait-
smart city as it overcomes the limitations of the traditional ing time, wastage of fuel, high carbon footprints and vehicular
transportation system. Existing fixed traffic light control systems
split the traffic light signal into fixed duration and run in an accidents [1], [2]. Current traffic light control system deploys
inefficient way, therefore, it suffers from many weaknesses such as a fixed duration traffic light adjustment without taking care of
long waiting time, waste of fuel and increase in carbon emission. real-time traffic or considering the traffic scenario to a limited
To tackle these issues and increase efficiency of the traffic extends [1]. Further, fixed duration traffic light control system
light control system, in this work, a Dynamic and Intelligent splits the traffic signal duration to equal time in every phase
Traffic Light Control System (DITLCS) is proposed which takes
real-time traffic information as the input and dynamically adjusts without considering real time traffic data. Some other traffic
the traffic light duration. Further, the proposed DITLCS runs in light control algorithms take the traffic information as the input
three modes namely Fair Mode (FM), Priority Mode (PM) and from sensors i.e., inductive loop detectors to capture the traffic
Emergency Mode (EM) where all the vehicles are considered with data. The input data is then processed to determine the phase
equal priority, vehicles of different categories are given different duration of the green light signal. However, such traffic light
level of priority and emergency vehicles are given at most priority
respectively. Furthermore, a deep reinforcement learning model is control systems will be paralyzed when the inflow rate of the
also proposed to switch the traffic lights in different phases (Red, traffic is relatively large at the certain point of times. Thus,
Green and Yellow), and fuzzy inference system selects one mode many times, traffic-policemen have to directly manage the
among three modes i.e., FM, PM and EM according to the traffic traffic light intersections by hand-waving signals.
information. We have evaluated DITLCS via realistic simulation Thus, it is very prominent to deploy an intelligent and
on Gwalior city map of India using an open-source simulator
i.e., Simulation of Urban MObility (SUMO). The simulation dynamic traffic light control system to manage vehicular
results prove the efficiency of DITLCS in comparison to other dynamics efficiently, so that traffic may flow smoothly despite
state of the art algorithms on various performance parameters. of high density in traffic and limited infrastructure. There-
Index Terms— Deep learning, intelligent transportation system fore, utilization of Intelligent Transportation System (ITS)
(ITS), fuzzy logic, dynamic traffic light control. has elevated the expectations of society and the individual
traveller based on promise of the technologies involved.
I. I NTRODUCTION Major objectives of ITS are to evaluate, develop, analyse
and integrate new sensor, information and communication
Authorized licensed use limited to: McMaster University. Downloaded on May 03,2020 at 05:04:08 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
can be captured with newer sensors, cameras which can lead recent decades. Author [7], [8] utilized fuzzy model to develop
to more accurate traffic light management [5], [26], [27]. the traffic light controller with the aim of minimizing Average
However, there are certain limitations of the existing systems Waiting Time (AWT) and Queue Length (QL) but they missed
are itemized as follow: heterogeneity. Further, Mir and Hassan [9] developed a traffic
• For existing traffic light control systems, traffic signals light controller utilizing neural network and fuzzy controller
splatted only into fixed intervals and green/red lights to minimize AWT and QL for car vehicles. To tackle the
duration can only be a manifold of the same interval. emergency vehicles, Younes and Boukerche [12] developed
• Existing studies were focused on only finding the accurate a heuristic based algorithm and verified via simulation in
traffic duration but not considered vehicle priority [1]. SUMO, however, they ignored the other priority vehicles.
As obvious, many priority and emergency vehicles such In order to account the priority vehicles, Khan et al. [11]
as fire-brigade, ambulance etc., need to be reached their developed a EVP-STC protocol utilizing micro-controller,
destinations as soon as possible. ZigBee, GPS and other IoT devices, and varied its performance
• Existing traffic light controllers are unable to understand effectiveness using PTV Vissim. However, authors considered
the traffic behaviour and operate in different modes line opening time but missed several important parameters in
i.e., emergency/priority/fair mode. vehicular dynamics.
• Traditional ITS solutions are not cost effective in terms Abadi et al. [2] parented a strategy to predict the traffic flow
of IT and IoT infrastructure (Rasberry pi, Cloud etc.) for on road transportation vehicular networks but with limited traf-
developing vehicular networks. Such type of systems are fic data. They utilized autoregressive model and Monte Carlo
commercially expensive for the developing countries. simulation technique, however, it fails to consider vehicle pri-
Traditional traffic light control systems are lacking in meet- ority and make traffic light control decisions. Quek et al. [3]
ing the current requirements of the smart city, therefore, proposed a novel fuzzy neural approach in order to analyse and
in order to overcome the above mentioned limitations of the predict road traffic utilizing pseudo outer-product fuzzy neural
existing systems, in this work, an efficient, Dynamic and Intel- network using the truth-value-restriction. This work accounts
ligent Traffic Light Control System (DITLCS) is proposed. the vehicle heterogeneity to analyse the traffic behaviour
The significant contributions of the proposed DITLCS are but not designed to flow traffic. Zhang et al. [4] suggested
listed as follow: a force-driven traffic simulation for the future connected
• In DITLCS, time duration for each phase of traffic light autonomous vehicle-enabled smart transportation system, how-
signal is calculated based on extracted information (traffic ever, they missed the requirement traffic control by under-
load, vehicles heterogeneity) from vehicular network. standing the vehicular dynamics. Kachroo and Sastry [10]
• Considering heterogeneous traffic load, an algorithm is developed a theoretical mathematical model to analyse the
proposed utilizing fuzzy inference system to choose one density based travel time for real time vehicular dynamics.
out of three modes: 1) Fair Mode (FM):-all vehicles have Authors show that it is important for intelligent transporta-
same weightage 2) Priority Mode (PM):- vehicle weights tion system applications where travel time is an important
are differentiated based on priority 3) Emergency Mode factor. Bekiaris-Liberis et al. [6] developed a methodology
(EM):-emergency vehicle routes are prioritized first. for highway traffic state estimation with mixed connected and
• An agent is trained utilizing deep reinforcement learning conventional vehicles. Authors validated the performance of
on traffic data which mimics the behaviour of an experi- the developed estimation schemes through simulations using a
enced human agent. This agent can run on light weighted well-known second-order traffic flow model. Belletti et al. [5]
computing device (Raspberry Pi) to predict traffic light utilized multi task deep reinforcement learning for expert
signal phase for the each mode of the DITLCS. level control of ramp metering they proved neural network
• To verify the effectiveness of DITLCS, a realistic sim- based deep reinforcement is effective to the problems whose
ulation is developed utilizing an open source simulator parameters are known. The domain experts [1] have used and
tool i.e., Simulation of Urban Mobility (SUMO) [16] on suggested the application of Convolutional Neural Network
Gwalior city, India. Results of DITLCS are compared (CNN) with reinforcement leaning referred deep reinforcement
with several state of the art algorithms on various testing leaning to control the dynamic traffic light but authors missed
parameters which proved the effectiveness of the model. the heterogeneity among the vehicles.
Rest of the paper is organized as follows. The literature
review is presented in Section II. The model and problem
statement are introduced in Section III. The background on III. M ODEL AND P ROBLEM S TATEMENT
reinforcement learning is introduced in Section IV. The tech- This work considers heterogeneous vehicular traffic network
nical details of the DITLCS is presented in Section V. The of the developing counties with the aim of developing a
Simulation study of the model is given in Section VI. Finally, dynamic and intelligent traffic light signal scheduling system.
the paper is concluded in Section VII. A pictorial view of the model is shown in Fig. 1. The proposed
model works in two steps; first, the traffic light control
II. L ITERATURE R EVIEW system gathers the vehicle traffic information via a vehicular
Due to immense growth in vehicles and limited transport network [14] with the help of cameras, Radio-Frequency Iden-
infrastructure, ITS has been widely focused as a prominent tification (RFID) sensors etc., that is represented by the red
area of research, thus, traffic optimization has been studied in dotted lines in the Fig.1. The traffic light controller (IoT device
Authorized licensed use limited to: McMaster University. Downloaded on May 03,2020 at 05:04:08 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
KUMAR et al.: FUZZY INFERENCE ENABLED DEEP REINFORCEMENT LEARNING-BASED TRAFFIC LIGHT CONTROL 3
TABLE I
L ITERATURE S UMMARY
Authorized licensed use limited to: McMaster University. Downloaded on May 03,2020 at 05:04:08 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
Authorized licensed use limited to: McMaster University. Downloaded on May 03,2020 at 05:04:08 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
KUMAR et al.: FUZZY INFERENCE ENABLED DEEP REINFORCEMENT LEARNING-BASED TRAFFIC LIGHT CONTROL 5
Authorized licensed use limited to: McMaster University. Downloaded on May 03,2020 at 05:04:08 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
Authorized licensed use limited to: McMaster University. Downloaded on May 03,2020 at 05:04:08 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
KUMAR et al.: FUZZY INFERENCE ENABLED DEEP REINFORCEMENT LEARNING-BASED TRAFFIC LIGHT CONTROL 7
Authorized licensed use limited to: McMaster University. Downloaded on May 03,2020 at 05:04:08 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
time and other parameters. The cumulative reward is measured vehicle arrival time and the time at which the the parameters
in every cycle of the deep reinforcement learning module. are measured as shown in Eq. 13.
Fuzzy inference module is specifically used to pick a mode
N
of execution out of three (PM, FM and EM) by observing AW T = 1/N ∗ C T − ATi (13)
the current status of the vehicular traffic scenarios. In order to i=1
meet our requirements, the proposed DITLCS algorithms were
implemented utilizing Python and, the simulation was prepared where N = number of vehicles in the network and C T is the
in SUMO. The deep reinforcement learning component of current time and ATi is the arrival time of the vehicle at the
DITLCS was built with the help of Keras which is built on intersection.
the top of Tensorflow [1]. The simulation was conducted on 2) Throughput: Throughput (T p ) is the number of vehicles
a major Indian city Gwalior, by making use of the realistic passing through an intersection per cycle i.e., when all the
map which is converted into roads and traffic with the help of traffic light at an intersection have turned green once. (T p )
SUMO conversion tools such as netconvert. across all the intersection is measured by Eq. 14.
The length of the vehicle depends on the type of vehi-
N
cle (bus and truck length = 7 meter , car = 4.5 meter , Tp = q Li (14)
bike length = 1.5 meter ) vehicular network. Other SUMO i=1
configuration parameters are as: limit-turn-speed value = where, N = number of intersection and q L i is the queue
5.5 m/sec, max lane speed = 20.0 m/sec, truck and bus length in the i t h intersection.
max speed = 10 m/sec, car max speed = 15 m/sec, 3) Average Queue Length: Average Queue Length (AQL)
other vehicles max speed = 13.9m/sec, acceleration of truck is the average number of vehicle waiting at the traffic inter-
and bus = 0.8 m/sec2 , acceleration of car = 2.5 m/sec, section. The AQL across the intersections can be calculated
acceleration of other vehicles = 1.8 m/sec2 , de-acceleration by Eq. 15.
for all vehicles = 4.0 m/sec2 , min gap-between any two
vehicles = 2.0 meter , sigma(driving-proficiency) = 0.5,
N
Authorized licensed use limited to: McMaster University. Downloaded on May 03,2020 at 05:04:08 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
KUMAR et al.: FUZZY INFERENCE ENABLED DEEP REINFORCEMENT LEARNING-BASED TRAFFIC LIGHT CONTROL 9
VII. C ONCLUSION
This work explored the requirement of the heterogeneity
aware, Dynamic and Intelligent Traffic Light Control Sys-
tem (DITLCS) on vehicular traffic network. Heterogeneity
among the vehicles in terms of emergency and priority vehicles
Fig. 6. Comparative results.
is taken into account. The proposed DITLCS acts in three
among the traffic. This results in reduced waiting time. The phases namely Fair Mode (FM), Priority Mode (PM) and
Fig. 6(b) and Fig. 6(c) present the comparative results of Emergency Mode (EM) where all the vehicles are considered
various models on throughput and queue length respectively. with equal priority, vehicles of different categories are given
From the results, it is analysed that the DITLCS increases different level of priorities and emergency vehicles are given
Authorized licensed use limited to: McMaster University. Downloaded on May 03,2020 at 05:04:08 UTC from IEEE Xplore. Restrictions apply.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
at most priority. The DITLCS is the combination of deep rein- [15] S. Sajad Mousavi, M. Schukat, and E. Howley, “Traffic light
forcement learning and fuzzy inference system. Deep learning control using deep policy-gradient and value-function based rein-
forcement learning,” 2017, arXiv:1704.08883. [Online]. Available:
reinforcement is dedicated to dynamically generate the phase http://arxiv.org/abs/1704.08883
of the traffic lights, and fuzzy inference system opts the mode [16] W. Genders and S. Razavi, “Using a deep reinforcement learning agent
of the execution based on the traffic behaviour, heterogeneity for traffic signal control,” 2016, arXiv:1611.01142. [Online]. Available:
http://arxiv.org/abs/1611.01142
and dynamism in which deep reinforcement learning model [17] N. Casas, “Deep deterministic policy gradient for urban traf-
acts. To verify the effectiveness of the DITLCS, a realistic fic light control,” 2017, arXiv:1703.09035. [Online]. Available:
simulation is developed utilizing an open source simulator tool http://arxiv.org/abs/1703.09035
[18] S. El-Tantawy, B. Abdulhai, and H. Abdelgawad, “Design of reinforce-
i.e., Simulation of Urban MObility (SUMO) [1] on Gwalior ment learning parameters for seamless application of adaptive traffic
city of India. Results of DITLCS are compared with several signal control,” J. Intell. Transp. Syst., vol. 18, no. 3, pp. 227–245,
Fuzzy, neural network and priority based state of the art Jul. 2014.
[19] R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction,
algorithms which proved the effectiveness and high-efficiency vol. 1, no. 1. Cambridge, MA, USA: MIT Press, Mar. 1998.
of the DITLCS. [20] D. Silver et al., “Mastering the game of go with deep neural networks
and tree search,” Nature, vol. 529, no. 7587, pp. 484–489, Jan. 2016.
[21] D. Silver et al., “Mastering the game of go without human knowledge,”
ACKNOWLEDGMENT Nature, vol. 550, no. 7676, pp. 354–359, Oct. 2017.
Dr. Neetesh Kumar would like to put his sincere thanks to [22] M. Abdoos, N. Mozayani, and A. L. C. Bazzan, “Holonic multi-agent
system for traffic signals control,” Eng. Appl. Artif. Intell., vol. 26,
the all Co-PIs of the project, ABV-IIITM Gwalior, India, and nos. 5–6, pp. 1575–1587, May 2013.
DST, IDCPS Division, India, for kind of their support. [23] H. Hartenstein and L. Laberteaux, “A tutorial survey on vehicular ad hoc
networks,” IEEE Commun. Mag., vol. 46, no. 6, pp. 164–171, Jun. 2008.
R EFERENCES [24] V. Mnih et al., “Human-level control through deep reinforcement learn-
ing,” Nature, vol. 518, no. 7540, pp. 529–533, Feb. 2015.
[1] X. Du and X. Liang, “Deep reinforcement learning for traffic light [25] N. Kumar and D. P. Vidyarthi, “A green routing algorithm for IoT-
control in vehicular Networks,” IEEE Trans. Veh. Technol., early access, enabled software defined wireless sensor network,” IEEE Sensors J.,
Mar. 2018, doi: 10.1109/TVT.2018.2890726. vol. 18, no. 22, pp. 9449–9460, Nov. 2018.
[2] A. Abadi, T. Rajabioun, and P. A. Ioannou, “Traffic flow prediction [26] J. Zhang, F.-Y. Wang, K. Wang, W.-H. Lin, X. Xu, and C. Chen, “Data-
for road transportation networks with limited traffic data,” IEEE Trans. driven intelligent transportation systems: A survey,” IEEE Trans. Intell.
Intell. Transp. Syst., vol. 16, no. 2, pp. 653–662, Apr. 2015. Transp. Syst., vol. 12, no. 4, pp. 1624–1639, Dec. 2011.
[3] C. Quek, M. Pasquier, and B. L. B. Lim, “POP-TRAFFIC: A novel [27] S. Bouchelaghem and M. Omar, “Reliable and secure distributed smart
fuzzy neural approach to road traffic analysis and prediction,” IEEE road pricing system for smart cities,” IEEE Trans. Intell. Transp. Syst.,
Trans. Intell. Transp. Syst., vol. 7, no. 2, pp. 133–146, Jun. 2006. vol. 20, no. 5, pp. 1592–1603, May 2019.
[4] Y. Zhang, G. Zhang, R. Fierro, and Y. Yang, “Force-driven traffic [28] S. Chiu, “Adaptive traffic signal control using fuzzy logic,” in Proc.
simulation for a future connected autonomous vehicle-enabled smart Intell. Vehicles Symp., Jun. 1993, pp. 98–107.
transportation system,” IEEE Trans. Intell. Transp. Syst., vol. 19, no. 7,
pp. 2221–2233, Jul. 2018.
[5] F. Belletti, D. Haziza, G. Gomes, and A. M. Bayen, “Expert level Neetesh Kumar (Member, IEEE) received the
control of ramp metering based on multi-task deep reinforcement learn- M.Tech. and Ph.D. degrees from Jawaharlal Nehru
ing,” IEEE Trans. Intell. Transp. Syst., vol. 19, no. 4, pp. 1198–1207, University, New Delhi. He is currently a Fac-
Apr. 2018. ulty Member with the Atal Bihari Vajpayee Indian
[6] N. Bekiaris-Liberis, C. Roncoli, and M. Papageorgiou, “Highway traffic Institute of Information Technology and Manage-
state estimation with mixed connected and conventional vehicles,” IEEE ment (IIIT) Gwalior, India. He has published several
Trans. Intell. Transp. Syst., vol. 17, no. 12, pp. 3484–3497, Dec. 2016. research publications in worlds top tier publishers,
[7] M. Kafash, M. B. Menhaj, M. J. M. Sharif, and A. Maleki, “Designing such as IEEE journals and transactions, Elsevier,
fuzzy controller for traffic lights to reduce the length of queues in and springer journals. He is also acting as a Lead
according to minimize extension of green light time and reduce waiting PI of several sponsored projects in the domain of
time,” in Proc. 13th Iranian Conf. Fuzzy Syst. (IFSC), Aug. 2013, Intelligent Transportation System (ITS). Broadly, his
pp. 1–6. research interests include algorithm design, high performance computing,
[8] E. Azimirad, N. Pariz, and M. B. N. Sistani, “A novel fuzzy model and and ITS.
control of single intersection at urban traffic network,” IEEE Syst. J.,
vol. 4, no. 1, pp. 107–111, Mar. 2010.
Syed Shameerur Rahman is currently pursuing
[9] A. Mir and A. Hassan, “Fuzzy inference rule based neural traffic light
the Integrated Master’s degree with the Atal Bihari
controller,” in Proc. IEEE Int. Conf. Mechatronics Automat. (ICMA),
Vajpayee Indian Institute of Information Technol-
Changchun, China, Aug. 2018, pp. 816–820.
ogy and Management Gwalior, Gwalior, India. His
[10] P. Kachroo and S. Sastry, “Travel time dynamics for intelligent trans-
research interests include algorithm design, the IoT,
portation systems: Theory and applications,” IEEE Trans. Intell. Transp.
and vehicular ad hoc networks.
Syst., vol. 17, no. 2, pp. 385–394, Feb. 2016.
[11] A. Khan, F. Ullah, Z. Kaleem, S. U. Rahman, H. Anwar, and Y.-Z.
Cho, “EVP-STC: Emergency vehicle priority and self-organising traffic
control at intersections using Internet-of-things platform,” IEEE Access,
vol. 6, pp. 68242–68254, 2018, doi: 10.1109/ACCESS.2018.2879644.
[12] M. B. Younes and A. Boukerche, “An efficient dynamic traffic
light scheduling algorithm considering emergency vehicles for intelli- Navin Dhakad is currently pursuing the bache-
gent transportation systems,” Wireless Netw., vol. 24, pp. 2451–2463, lor’s degree with the Indian Institute of Information
Mar. 2018. Technology Pune, Pune, India. His research inter-
[13] Hua Wei, Guanjie Zheng, “Intellilight: A reinforcement learning ests include algorithms, and intelligent transportation
approach for intelligent traffic light control,” in Proc. 24th ACM system (ITS).
SIGKDD Int. Conf. Knowl. Discovery Data Mining, London, U.K.,
Aug. 2018, pp. 2496–2505.
[14] W. Farooq, M. A. Khan, and S. Rehman, “A cluster based multicast
routing protocol for autonomous unmanned military vehicles (AUMVs)
communication in VANET,” in Proc. Int. Conf. Comput., Electron.
Electr. Eng. (ICE Cube), Apr. 2016, pp. 42–48.
Authorized licensed use limited to: McMaster University. Downloaded on May 03,2020 at 05:04:08 UTC from IEEE Xplore. Restrictions apply.