You are on page 1of 5

Self Driving AI Cars

Akshay Karthik Anant Vikram Singh


Dept. of Computer Science and Engineering Dept. of Computer Science and Engineering
RNS Institute of Technology RNS Institute of Technology
Bangalore, India Bangalore, India
1rn19cs012.akshay@gmail.com 1rn19cs015.anantvikramsingh@gmail.com

Anubhab Saha Astitva Singh


Dept. of Computer Science and Engineering Dept. of Computer Science and Engineering
RNS Institute of Technology RNS Institute of Technology
Bangalore, India Bangalore, India
1rn19cs016.anubhabsaha@gmail.com 1rn19cs022.astitva@gmail.com

Dr. Kiran P
Head of Department
Dept. of Computer Science and Engineering
RNS Institute of Technology
Bangalore, India

Abstract—Self-driving cars are an emerging technology that revolutionize the way we travel. These vehicles use a combina-
utilizes advanced machine learning algorithms, including deep tion of sensors, cameras, and advanced algorithms to interpret
learning, to enable vehicles to navigate roads and traffic without the surrounding environment and make decisions about how to
human intervention. This technology has the potential to revo-
lutionize the transportation industry by improving road safety, safely navigate the road. The development of self-driving AI
reducing congestion, and increasing mobility for people who cars is being driven by several factors, including the desire to
cannot drive or do not have access to transportation. Deep reduce traffic congestion, increase safety, and improve access
learning is a subset of artificial intelligence which contains to transportation for people who cannot drive. In this article,
algorithms to train computers using copious amounts of data, this we will explore the key features and benefits of self-driving
is done so that it doesn’t have to be programmed individually.
These algorithms are critical to self-driving cars as they allow AI cars and the challenges that must be overcome to make
the vehicle’s sensors, such as cameras and lidar, to process and them a reality.
analyze visual data from the surrounding environment to make
informed decisions. One of the primary challenges of developing A. Key Features of Self-Driving AI Cars
self-driving cars is ensuring their safety in unpredictable and
complex driving scenarios. Deep learning algorithms enable Self-driving AI cars are equipped with a variety of sensors
vehicles to learn from a vast amount of data, including diverse and cameras that allow them to perceive the world around
and rare driving scenarios, to improve their decision-making them. These sensors include :
ability. Deep learning architectures, such as Convolutional neural
networks(CNNs) and Recurrent neural networks(RNNs), have 1) Lidar: A laser-based sensor that creates a 3D map of the
been used to train the AI of self-driving cars. Convolutional car’s surroundings.
neural networks have been used to process visual data from cam- 2) Radar: A radio-based sensor that detects the presence
eras, Lidars and Radars to identify objects, such as jay-walking and distance of objects.
pedestrians, traffic lights, road blockages and other vehicles. 3) Cameras: High-resolution cameras that capture visual
Recurrent neural networks have been used to understand and
predict the movement of other vehicles, pedestrians and animals information about the environment.
to anticipate potential collisions. The development of self-driving 4) GPS: A global positioning system that provides location
cars using deep learning algorithms is a rapidly evolving field. information.
As the technology continues to advance, it has the potential to 5) Inertial measurement unit (IMU): A sensor that mea-
transform the transportation industry, improve road safety, and sures the car’s acceleration and rotation.
increase access to transportation for people around the world.
Index Terms—Convolutional Neural Networks, Recurrent neu- These sensors work together to create a detailed map of
ral networks, lidar, radar, deep learning, self driving the car’s surroundings, which is then processed by advanced
algorithms to make decisions about how to navigate the road.
I. I NTRODUCTION
Self-driving AI cars, also known as autonomous vehicles B. Benefits of Self-Driving AI Cars
(AVs), are a rapidly developing technology that promises to Self-driving AI cars offer a number of benefits, including:
1) Improved safety: Self-driving AI cars are designed to until the 2000s that significant progress was made in
reduce the number of accidents caused by human er- this area, with the development of advanced sensors,
ror. According to the National Highway Traffic Safety machine learning algorithms, and other technologies.
Administration, 94% of accidents are caused by human In 2004, the DARPA Grand Challenge was launched,
error, and self-driving AI cars have the potential to which encouraged researchers and engineers to
significantly reduce this number. develop self-driving cars that could navigate through
2) Increased mobility: Self-driving AI cars can provide difficult terrain and complete complex tasks without
transportation for people who are unable to drive, such human intervention. This challenge sparked a wave
as the elderly or people with disabilities. This can of innovation and investment in the field, with major
help to improve their quality of life and increase their companies like Google, Uber, and Tesla investing
independence. heavily in self-driving AI technology.
3) Reduced traffic congestion: Self-driving AI cars are ex-
pected to reduce traffic congestion by optimizing traffic 2) History of Self-Driving AI Cars: The idea of self-
flow and reducing the number of cars on the road. driving cars has been around for decades, with the first
4) Improved efficiency: Self-driving AI cars can be pro- prototypes developed in the 1980s. However, it was not
grammed to drive in a more fuel-efficient manner, reduc- until the 2000s that significant progress was made in
ing carbon emissions and improving the environment. this area, with the development of advanced sensors,
machine learning algorithms, and other technologies.
C. Challenges of Self-Driving AI Cars
In 2004, the DARPA Grand Challenge was launched,
While self-driving AI cars offer many benefits, there are which encouraged researchers and engineers to
also a number of challenges that must be overcome before develop self-driving cars that could navigate through
they can become a reality. These challenges include: difficult terrain and complete complex tasks without
1) Safety: Self-driving AI cars must be able to operate human intervention. This challenge sparked a wave
safely in all conditions, including bad weather, heavy of innovation and investment in the field, with major
traffic, and unforeseen events such as accidents or road companies like Google, Uber, and Tesla investing
closures. Ensuring the safety of self-driving AI cars is a heavily in self-driving AI technology.
major challenge that must be addressed before they can
become widely adopted. 3) Challenges and Opportunities: Despite the significant
2) Regulation: The development and deployment of self- progress made in self-driving AI cars, there are
driving AI cars are subject to a complex web of regula- still many challenges and opportunities facing this
tions and laws that must be navigated. These regulations technology. One of the biggest challenges is ensuring
vary by country and state and must be carefully followed the safety and reliability of self-driving cars, especially
to ensure that self-driving AI cars can be legally oper- in complex and unpredictable environments. Other
ated. challenges include the legal and regulatory frameworks
3) Infrastructure: Self-driving AI cars require a complex needed to govern self-driving cars, as well as the ethical
infrastructure to operate, including advanced sensors and and social implications of this technology. However,
communication networks. Building this infrastructure is there are also many opportunities associated with
a major challenge that must be addressed before self- self-driving AI cars.
driving AI cars can become a reality.
4) Public acceptance: Self-driving AI cars are a new tech- 4) Recent Research and Publications: There has been a
nology, and many people may be hesitant to trust them. significant amount of research and publications in the
Ensuring that the public is comfortable with self-driving field of self-driving AI cars in recent years. One re-
AI cars is a major challenge that must be addressed to cent study by researchers at the Massachusetts Institute
ensure their widespread adoption. of Technology (MIT) examined the ethical and moral
dilemmas associated with self-driving cars and proposed
II. L ITERATURE S URVEY
a framework for making ethical decisions in autonomous
Self-driving AI cars are one of the most fascinating driving situations. Another study by researchers at Stan-
advancements in the field of artificial intelligence and ford University explored the use of machine learning
transportation. Self-driving cars have been an active area of algorithms to improve the safety and reliability of self-
research and development for several years. In this literature driving cars, by predicting potential accidents and taking
survey, we will explore the different aspects of self-driving preemptive measures to avoid them.
AI cars, including their history, development, and prospects.
III. A RCHITECTURE
1) History of Self-Driving AI Cars: The idea of self- Some of the algorithms used in the aforementioned project
driving cars has been around for decades, with the first are, YOLO and Faster R-CNN for object detection tasks.
prototypes developed in the 1980s. However, it was not YOLO short for “You Only Look Once” more precisely
YOLO version-8, is a real-time object detection algorithm that
provides high-accuracy outputs by dividing the image into a
grid and predicting bounding boxes, and classifying the object
bounded by the box into one of the names described by the
programmer during training along with the probability of the
classification. On the other hand, Faster R-CNN is a region-
based Convolutional neural network that proposes candidate
regions in an image and applies a classifier to each proposed
region to identify objects. Even though Faster R-CNN had
higher accuracy and robustness when compared to YOLO v8
in the detection of smaller objects it was spotted to be dilatory
and unpractical for this particular application, which needed
instantaneous, real-time image processing. Hence, YOLO v8
was found to be more suitable.
A. Yolo
YOLO v8 is a state-of-the-art object detection algorithm that
achieves high accuracy and real-time performance by utilizing
Fig. 1. Detailed Block Diagram of YOLO Architecture (Source: Research-
a unique architecture that combines a backbone network, Gate)
neck network, and head network. The backbone network,
which is based on the DarkNet architecture, is responsible for
extracting features from the input image. This network consists B. Faster R CNN:
of 13 convolutional layers followed by 3 max-pooling layers.
The neck network, also known as the SPP (Spatial Pyramid Faster R-CNN is a popular object detection algorithm that is
Pooling) module, is responsible for aggregating features from widely used in computer vision applications. It was proposed
multiple scales. This network consists of four pooling layers, by Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun in
each with a different output size. The outputs of the pooling 2015. The algorithm is built on the basis of R-CNN and Fast
layers are concatenated and fed into several fully connected R-CNN, which are the earlier versions of the algorithm. Faster
layers to produce a fixed-length feature vector. Finally, the R-CNN is designed to overcome the limitations of the previous
head network is responsible for predicting object classes and versions, such as the slow processing speed and the need for
bounding boxes. This network consists of several convolu- multiple stages of computation. The Faster R-CNN algorithm
tional layers followed by two fully connected layers. The consists of two main parts: a region proposal network (RPN)
output of the head network is a tensor with a fixed shape, and a region-based convolutional neural network (R-CNN).
which is then processed to produce bounding boxes and class The RPN is responsible for generating region proposals that
probabilities. To better understand the YOLO v8 architecture, are likely to contain objects. The R-CNN uses these proposals
we can look at its block diagram. The input image is first fed to classify the objects and predict their bounding boxes. The
into the backbone network, which consists of convolutional RPN is a fully convolutional network that takes an image
layers and max- pooling layers. The output of the backbone as input and outputs a set of object proposals, which are
network is then passed through the neck network, which rectangular regions of interest (RoIs) that are likely to contain
consists of four pooling layers and fully connected layers. The objects. The RPN is trained to maximize the overlap between
output of the neck network is a fixed-length feature vector the predicted proposals and the ground- truth bounding boxes
that is then fed into the head network. The head network of objects. The R-CNN then takes these RoIs as input and
consists of several convolutional layers and fully connected extracts a fixed-length feature vector for each of them using
layers and produces a tensor with a fixed shape that represents a convolutional neural network. The feature vectors are then
the predicted object classes and bounding boxes. The output fed into a set of fully connected layers that classify the
tensor is then processed to produce final object detection objects and predict their bounding boxes. Faster R-CNN uses
results, which include the object classes, bounding boxes, and a shared convolutional neural network to extract features from
confidence scores. Overall, the YOLO v8 algorithm is a highly the input image. This network is trained on a large dataset,
effective and efficient object detection algorithm that utilizes a such as ImageNet, to learn features that are useful for object
unique architecture to achieve real-time performance with high detection. The RPN and the R-CNN share the same network
accuracy. The below figure, Fig 1, shows the block diagram up to the feature extraction layer, which allows them to
of YOLO v8 architecture used to build the Self Drive AI Car. share the computation and reduce the processing time. Faster
R-CNN has several advantages over its predecessors. It is
faster than R-CNN and more accurate than Fast R-CNN. It
also has a simpler architecture than its predecessors, which
makes it easier to implement and train. Additionally, Faster
R-CNN can detect objects of different scales and aspect ratios, V. R ESULT A NALYSIS
which makes it more robust to variations in object size and
shape. Overall, Faster R-CNN is a powerful algorithm that has
become a standard in the field of object detection.
The below figure, Fig 2, shows the block diagram of Faster
R-CNN architecture used to build the Self Drive AI Car.

Fig. 3. A view from the object detections in a traffic video. Objects are
labeled correspondingly.

Fig. 2. Block Diagram of Faster R-CNN (Source: ResearchGate)

IV. S TAGES OF THE P ROJECT


The process of training an object detection model using
a time-lapse video of the college started with collecting the
video data and labeling the objects of interest. The video
frames were analyzed to identify the objects such as vehicles,
pedestrians, and bicycles. The annotated data was used to train
the Faster R-CNN algorithm. The training process involved
optimizing the neural network’s weights to minimize detection
errors. After the training process, the model was tested on live
cameras on the college campus. However, there was a delay
Fig. 4. Table of all the labels used from the COCO dataset
in the detection due to the use of Faster R-CNN. Therefore,
it was decided to switch to a faster object detection algorithm
called YOLO v8 (You Only Look Once version 8). YOLO
v8 is a real-time object detection algorithm that performs
object detection in a single pass, making it much faster than
Faster R-CNN. To further improve the model’s performance,
an ultrasonic-based object detection and avoidance algorithm
was incorporated into the system. This algorithm helps the
model avoid obstacles in the environment and navigate around
them to avoid collisions. The testing of the model was done
in a simulator made in the Unreal Engine. The simulator
allowed for the model to be tested in a virtual environment
that closely mimics the real world. This testing allowed for the
fine-tuning of the model’s performance and the identification
of any remaining issues that needed to be addressed. Once the
model was fully tested and optimized, it was integrated into Fig. 5. Continuous live output of the objects identified in the image.
an RC car using an Arduino Uno and a stepper motor. The RC
car was equipped with 5 ultrasonic sensors and the data from
that was used to detect objects in the environment in real time.
The ultrasonic-based object avoidance algorithm was used to
guide the car around obstacles in its path.
VI. C ONCLUSION
In conclusion, the development of self-driving cars has
the potential to revolutionize the way we travel and signif-
icantly improve road safety. While there are still technical
and regulatory challenges to be addressed, the progress made
in recent years is promising. Self-driving cars can provide
benefits such as reduced traffic congestion, increased mobility
for the elderly and disabled, and improved energy efficiency.
However, there are also potential drawbacks, such as job losses
in the transportation sector and ethical dilemmas related to
decision-making algorithms. As we continue to advance in
this field, it is important to carefully consider the implications
and ensure that the technology is developed in a responsible
and ethical manner. Overall, self-driving cars offer exciting
possibilities for the future of transportation, and it will be
fascinating to see how this technology develops and impacts
our society.
ACKNOWLEDGMENT
People are continually appreciative of others for their aid
and direction in achieving their goals. This formal letter of
thanks is an attempt to express our gratitude to everyone who
assisted us in realizing this project and kept us motivated
despite the challenges it presented.
R EFERENCES
[1] ”A Survey on Autonomous Vehicles” by Ali Al-Maashri and Salah Al-
Majeeni, in IEEE Access, 2019.
[2] ”Autonomous Cars: Past, Present and Future - A Review of the Develop-
ments in the Last Century, the Present Scenario and the Expected Future
of Autonomous Vehicle Technology” by Shukla et al., in International
Journal of Engineering Research and Applications, 2019.
[3] ”Artificial Intelligence for Self-Driving Cars: A Review” by Muhammad
Arif and Sajjad Ahmed, in International Journal of Advanced Computer
Science and Applications, 2021.
[4] ”Deep Reinforcement Learning for Autonomous Driving: A Survey”
by Li and Liang, in IEEE Transactions on Intelligent Transportation
Systems, 2020.
[5] ”A Survey on Sensor Fusion in Autonomous Vehicles” by Mohan and
Verma, in Sensors, 2020.
[6] ”Machine Learning Techniques for Autonomous Driving: A Survey”
by Sermanet and LeCun, in IEEE Intelligent Transportation Systems
Magazine, 2018.
[7] ”A Survey on Motion Planning and Control Techniques for Autonomous
Driving” by Bödefeld et al., in IEEE Transactions on Intelligent Trans-
portation Systems, 2020.
[8] ”A Comprehensive Review on Deep Learning Techniques for Au-
tonomous Vehicles” by Vignesh and Karthik, in Journal of Ambient
Intelligence and Humanized Computing, 2021.
[9] ”A Survey of Autonomous Vehicle Simulators” by Katzourakis and
Khan, in Robotics and Autonomous Systems, 2020.
[10] ”A Review of Deep Learning for Autonomous Driving” by Tamaazousti
et al., in Neural Computing and Applications, 2021.

You might also like