Professional Documents
Culture Documents
Abstract— Commonly in Malaysia, the road surfaces deteriorate up in a hospital for recovery treatments [2]. Within a few days
over time which results to potholes. With the increasing number later, on the 3rd of January 2021, a 75-year-old motorcyclist
of potholes on the road, it has become a road hazard, potentially had met fatality after crashing into a pothole in Kuala Lumpur
harming the safety of the Malaysian citizens. Potholes generally where he flew off his motorcycle and landed 30m away from
will cause drivers to skew away from their original direction his motorcycle [3]. With these cases, a method would be
which may result in vehicle damages and/or road accidents. needed to assist drivers to quickly detect potholes on road
Computer vision has improved significantly in the past decade surfaces.
to improve the technologies that require image processing, such
as pothole detection. However, it has several limitations in Several research have been done for pothole detection.
pothole detection as potholes have inconsistent shapes, making Authors in [4] used popular semantic segmentation techniques
it difficult to obtain an accurate prediction. Another limitation such as Mask-RCNN and U-Net algorithms and they yielded
is the speed of the detection algorithm is not able to predict the an accuracy of 80.00% and 41.67% and respectively. This
pothole in real time. In this paper, the use of computer vision shows that Mask RCNN significantly outperformed U-Net,
technology on vehicles is presented as a form of solution to detect with especially poor performance by U-Net on images with
the potholes in real time. A deep learning model based on multiple potholes and long shot images. On the other hand,
Convolutional Neural Networks, YOLOv5 is found to improve authors from [5] have developed a stereo vision-based pothole
the accuracy of the prediction as compared to past results. The
detection system based on a novel disparity transformation
findings on the trained YOLOv5 model have a mAP@0.5 of 80.8
algorithm and disparity map modelling algorithm and yielded
%, 82.2 % and 82.5 % on the YOLOv5m6, YOLOv5s6 and
YOLOv5n6 respectively. The trained YOLOv5n6 model was an accuracy of 99.64%. Disparity map modelling prevents
chosen to be the main image processing model for its robust disparity errors of larger than one pixel which provides higher
performance to size ratio. The results were able to demonstrate accuracy than point cloud modelling However, this research
the actual deployment of the trained YOLOv5n6 model on the results in a further need for segmentation of road surfaces as
road in real time with distance estimation. not all surfaces of the road can be treated as quadratic. The
pothole detection model in paper [6] was constructed using
Keywords—computer vision, pothole detection, convolutional “you only look once” (YOLO) algorithm based on
neural network convolutional neural network (CNN). In this study, a pothole
image was first converted to a single channel using grey scale
I. INTRODUCTION to minimize loss in quality of the image and was then fed into
. Technologies have improved human lives by several fold the YOLO algorithm. Overall, the model showed a mean
over the past decade until humans are almost capable to average precision (mAP) of 77.86%. On the other hand,
developing an autonomous navigation system. Although authors in [7] proposed a pothole detection system using a
multiple advancements in technology had been invented, there black-box camera. The proposed algorithm consists of 3 steps:
is one flaw in technology that still bothers people till now: pre-processing (cropping, greyscaling, and thresholding),
pothole on road surfaces. Potholes vary in size and shape candidate extraction (detecting lanes to remove unwanted
caused by the ground water expanding and contracting. Thus, noise) and cascaded detection (distinguish true or false
potholes that have severe depression can cause vehicles to detected potholes) was able to obtain a mAP of 71%.
skew away from the original direction, causing drivers to lose Although extensive research has been conducted on various
control, resulting in vehicle damages and/or road accidents. methodologies as well as the accuracy of the pothole
This problem is more prevalent in lesser developed countries detection, these literatures were lacking in terms of the speed
and can be found quite commonly in Malaysia. With the help of the detection in real time.
from Waze, an application for Malaysian road traffic In our paper, the pothole detection system will be constructed
navigation, reported a whopping 52,295 potholes in the state using the YOLOv5 model to build a pothole recognition
Selangor and city Kuala Lumpur alone. Besides that, the methodology. After training the model using the existing
statistics show that the total deaths caused from pothole datasets, the optimal model of the YOLOv5 is selected to be
accidents from the year 2000 to 2011 is 840 and amounts to deployed on the Nvidia Jetson Xavier X which is located on
11.2% of all the road deaths based on road defects [1]. the vehicle. The images of the moving vehicle from the Intel
Many local drivers have expressed their concerns on Realsense D435i Camera will be given as input, then the
potholes, where even the Malaysian Minister Khairy system will be able to detect the potholes and its distance as
Jamaluddin, who had met an accident in December 2020, the vehicle is moving in real time. The remainder of our paper
where he tripped over a pothole while cycling and had ended is structured as follows; Section II provides a description of
Authorized licensed use limited to: Vasavi College of Engineering. Downloaded on September 22,2023 at 02:07:12 UTC from IEEE Xplore. Restrictions apply.
the YOLOv5 segmentation framework. Our methodology is In addition, YOLOv5 has 5 different models to choose from
presented in Section III, with results and discussion in Section with YOLOv5x6 (extra-large) being the most accurate model
IV. Finally, a summary of the primary findings is presented in but requires more graphical computational cost and being the
Section V. biggest in size while YOLOv5n6 (nano) is the least accurate
but requires less graphical computational cost and smallest in
II. FRAMEWORK size. The statistics and performances of each model can be
A. Model development found in Figures 3 & 4. These model weights were pretrained
using COCO dataset that contains 5000 images of 80 classes
In this paper the CNN (Convolutional Neural Network) of identification with 300 epochs of training cycles.
model that will be used is You Only Look Once version 5
(YOLOv5) by Ultralytics where their model can be found in
GitHub. YOLOv5 is based off of Pytorch machine learning
library which can be executed in Linux-based OS using
Python 3.6 and above. The schematic architecture of
YOLOv5 can be seen in Figure 1.
Authorized licensed use limited to: Vasavi College of Engineering. Downloaded on September 22,2023 at 02:07:12 UTC from IEEE Xplore. Restrictions apply.
B. Hardware integration with YOLOv5 Table 1 - Tabulation of Trained Models Pothole Detection
The hardware architecture is shown in Figure 5. An Intel Performances
Realsense D435i camera will be used for its stereoscopic Pothole Detection Performances
functions, which allows for 3D calculations and perspective
on distance estimation. The camera is connected to an Nvidia Models Precision (%) Recall (%) mAP@0.5 (%) mAP@0.5:0.95 (%)
Jetson Xavier NX supercomputer which holds and operates
the AI algorithm in detecting potholes and the distance of the YOLOv5m6 84.2 71.3 80.8 53.9
pothole from the vehicle. The Nvidia Jetson Xavier NX
YOLOv5s6 80.7 75.4 82.2 54.7
consumes 30 W of power with specs of 8 GB RAM and 384
CUDA cores (GPU) which are crucial for the use of AI YOLOv5n6 81.9 74.6 82.5 53.8
computations.
Authorized licensed use limited to: Vasavi College of Engineering. Downloaded on September 22,2023 at 02:07:12 UTC from IEEE Xplore. Restrictions apply.
The Nvidia Jetson Xavier Nx and the Intel Realsense D435i
camera were installed on-board a vehicle; a Myvi car and
were powered through a car inverter with an output power of
220W as can be seen in Figure 8. The vehicle was deployed
and tested in the neighbourhood of Casawood Cybersouth,
Malaysia.
(c)
(d)
Figure 9(a)-(d): Results of the Pothole Detection System
on Vehicle in real-time
(b)
Authorized licensed use limited to: Vasavi College of Engineering. Downloaded on September 22,2023 at 02:07:12 UTC from IEEE Xplore. Restrictions apply.
The trained YOLOv5n6 model was chosen to be the main
image processing model for its robust performance to size
ratio which perfectly suited the mobility on deploying it and
lack of GPU cores in the Nvidia Jetson Xavier NX. Following
that, the detection model was then integrated with the Intel
Realsense D435i Stereo Camera to provide images as the
input. The devices on-board the vehicle were installed and
tested in a household area called Casawood Cybersouth
located in Dengkil, Malaysia.
The results were able to demonstrate the actual deployment
of the trained YOLOv5n6 model on the road in real time with
distance estimation. However, the results also showed some
minor errors as it detects manholes, trees and shadows as
potholes which is due to the resemblances with vast images
of potholes in the image dataset. It is recommended that
further research can be done to enhance the image processing
particularly during night-time and increase detection
performance on long distance object detection.
Figure 10: Manhole detected of the
Pothole Detection System ACKNOWLEDGMENT
In addition, the model also detected certain parts of trees as This work is part of the ‘Road Obstacles and Pedestrian
potholes and even detected shadows as potholes. The Awareness System’ (ROPAS) project and funded by the
detected trees and shadows are based on the angle at which Empower Grant Scheme by Heriot-Watt University
the camera was mounted and the intensity of light which may Malaysia.
off-balance the white balance of the camera. This off-balance
could make the trees and shadows look like potholes to the REFERENCES
model as shown in Figure 11. Overall, the model had done a [1] Y. Darma, M. R. Karim, and S. Abdullah, “An analysis of
good job at detecting potholes with its mAP@0.5 of 82.5%. Malaysia road traffic death distribution by road environment,”
Sadhana - Academy Proceedings in Engineering Sciences, vol. 42,
no. 9, pp. 1605–1615, Sep. 2017, doi: 10.1007/s12046-017-0694-
9.
[2] “Malaysia’s Minister Khairy Jamaluddin injures from fall after
bicycle hits pothole,” The Straits Times, Dec. 28, 2020. Accessed:
May 31, 2022. [Online]. Available:
https://www.straitstimes.com/asia/se-asia/malaysias-minister-
khairy-jamaluddin-injured-from-fall-after-bicycle-hits-pothole
[3] K. Perimbanayagam, “75-year-old man killed after crashing into
pothole,” New Straits Times, Jan. 03, 2021. Accessed: May 31,
2022. [Online]. Available:
https://www.nst.com.my/news/nation/2021/01/654169/75-year-
old-man-killed-after-crashing-pothole
[4] S. Thiruppathiraj, U. Kumar, and S. Buchke, “Automatic pothole
classification and segmentation using android smartphone sensors
and camera images with machine learning techniques,” in 2020
IEEE REGION 10 CONFERENCE (TENCON), Nov. 2020, pp.
1386–1391. doi: 10.1109/TENCON50793.2020.9293883.
[5] R. Fan, U. Ozgunalp, B. Hosking, M. Liu, and I. Pitas, “Pothole
Detection Based on Disparity Transformation and Road Surface
Modeling,” IEEE Transactions on Image Processing, vol. 29, pp.
897–908, 2020, doi: 10.1109/TIP.2019.2933750.
[6] J. W. Baek and K. Chung, “Pothole classification model using
edge detection in road image,” Applied Sciences (Switzerland),
Figure 11: Part of trees detected of the Pothole Detection vol. 10, no. 19. MDPI AG, Oct. 01, 2020. doi:
System 10.3390/APP10196662.
[7] Y. Jo and S. Ryu, “Pothole detection system using a black-box
V. CONCLUSION camera,” Sensors (Switzerland), vol. 15, no. 11, pp. 29316–29331,
Nov. 2015, doi: 10.3390/s151129316.
This paper primarily shows the work process done on the [8] S. Liu, L. Qi, H. Qin, J. Shi, and J. Jia, “Path Aggregation Network
development of pothole detection system. The pothole for Instance Segmentation,” Mar. 2018.
detection methodology was based off of YOLOv5 CNN [9] GitHub, “GitHub - ultralytics/yolov5.”
model that has included its pre-processing backbone DarkNet https://github.com/ultralytics/yolov5 (accessed May 31, 2022).
and its neck PANet which essentially entraps the object [10] NVIDIA Developer, “Getting Started With Jetson Xavier NX
detection and use convolutional methods to achieve the image Developer Kit.”
https://developer.nvidia.com/embedded/learn/get-started-jetson-
processing model. The findings on the trained YOLOv5 xavier-nx-devkit (accessed May 31, 2022).
model have a mAP@0.5 of 80.8 %, 82.2 % and 82.5 % on the
YOLOv5m6, YOLOv5s6 and YOLOv5n6 respectively.
Authorized licensed use limited to: Vasavi College of Engineering. Downloaded on September 22,2023 at 02:07:12 UTC from IEEE Xplore. Restrictions apply.