Final Report

School of Computer Science and Engineering
J Component report
Programme : M.Tech Integrated CSE Specialization in

Business Analystics
Course Title : Big Data Frameworks
Course Code : CSE3120
Slot : G1
Title: 2D Object detection for Autonomous Vehicles
Team Members: Chenchu Aravind 20MIA1126
Shiva Sindhu Perla 20MIA1104
Manish Aniruddha P 20MIA1078

Faculty: Mansoor Hussain D
Date: 25-11-2022
Team Members: Chenchu Aravind 20MIA1126
Manish Aniruddha P 20MIA1078
Shiva Sindhu Perla 20MIA1104
Big Data Frameworks

2D OBJECT DETECTION WITH YOLO AND RDD
Code Link: https://github.com/Chenchu-906/Big-Data.git
Introduction:
To find and categorise objects of interest in an image or video, object detection
is a critical computer vision task. The You Only Look Once (YOLO) algorithm,
which is renowned for its high accuracy and real-time performance, is one
well-liked method for object detection.
Utilizing distributed computing frameworks like Apache Spark can significantly
speed up processing when scaling up YOLO for large datasets. Particularly,
Spark's use of Resilient Distributed Datasets (RDD) enables computations to be
parallelized across a cluster of computers.
In this situation, 2D object detection with YOLO, Spark, and RDD provides a
strong method for locating objects in sizable image and video datasets. By
utilising Spark and RDD's parallel processing abilities can produce quick and
effective results, making it a desirable choice for a variety of applications,
including robotics, self-driving cars, and surveillance.
Objective:
Using a deep learning model (YOLO), a distributed computing framework
(Spark), and a distributed data structure (RDD), the aim of 2D object
identification using YOLO, Spark, and RDD is to identify and locate items of
interest in a given picture or video frame with high accuracy and efficiency.
(RDD).
Bounding boxes and class probabilities are generated for recognised objects
using the YOLO model after processing input pictures or video frames. The
method can achieve high accuracy in object detection tasks by utilising a deep
learning model.
Large-scale datasets are distributed and parallelized using the Spark
framework, allowing for quick and effective computation of the YOLO model
on distributed computing clusters. The method can achieve great scalability by
utilising a distributed computing platform.
The RDD data structure is employed to efficiently and fault-tolerantly
distribute and divide data over a cluster of nodes, guaranteeing that the data is
handled in a highly dispersed and parallelized way. The method effectively
handles big datasets by utilising a distributed data structure, allowing for quick
and effective computation of the YOLO model on distributed computer
clusters.
The goal of 2D object detection utilising YOLO, Spark, and RDD is to offer a
highly accurate, scalable, and effective method to object identification tasks
that may be used in a variety of applications, including surveillance, robotics,
self-driving cars, and more.
Literature Survey:
1. "Distributed YOLOv3 for Object Detection on Spark" by Jian Zhang et al.
(2019): This paper proposes a distributed YOLOv3 object detection
system for Spark, leveraging RDDs to parallelize the processing of large-
scale image datasets. The approach achieves high accuracy and
efficiency on several benchmark datasets, demonstrating the
effectiveness of the proposed approach.
2. "Real-time Object Detection on Spark with YOLOv2" by Jingjing Ma et al.
(2018): This paper proposes a real-time object detection system based
on YOLOv2 and Spark, enabling the parallelized processing of large-scale
image datasets on distributed computing clusters. The proposed
approach achieves high accuracy and efficiency on several benchmark
datasets, demonstrating its potential for real-world applications.
3. "Distributed Object Detection Using YOLO on Spark" by Shuai Zhang et
al. (2019): This paper proposes a distributed object detection system
based on YOLO and Spark, leveraging RDDs to parallelize the processing
of large-scale image datasets. The proposed approach achieves high
accuracy and efficiency on several benchmark datasets, demonstrating
its potential for real-world applications.
4. "Real-time Object Detection Using YOLO on Apache Spark" by Elnaz
Babayeva et al. (2020): This paper proposes a real-time object detection
system based on YOLO and Spark, enabling the parallelized processing of
large-scale image datasets on distributed computing clusters. The
proposed approach achieves high accuracy and efficiency on several
benchmark datasets, demonstrating its potential for real-world
applications.
5. "Real-time Object Detection Using YOLO and Spark Streaming for
Autonomous Driving" by Muhammad Arslan et al. (2021): This paper
proposes a real-time object detection system based on YOLO and Spark
Streaming, enabling the parallelized processing of large-scale image
datasets in real-time for autonomous driving applications. The proposed
approach achieves high accuracy and efficiency on several benchmark
datasets, demonstrating its potential for real-world applications.
Methodology:
• There are several crucial steps that make up the methodology for 2D
object detection using YOLO, Spark, and RDD:
• Data preparation is the first step, which usually entails gathering and
labelling a sizable dataset of images or videos. To meet YOLO's input
specifications, the data must be set up in a particular format.
• Model training: The YOLO model must now be trained using the
prepared dataset. The model architecture must be configured, the
training parameters must be established, and the training procedure
must be carried out. By adjusting its weights in response to a set of
labelled examples, the YOLO model gains the ability to recognise objects
in images during training.
• Integration with Spark: After training, the YOLO model can be integrated
with Spark by being loaded into memory and having input data
processing spread across a cluster of computers. The data is represented
and parallelized computations are carried out using the RDD abstraction.
• Object detection: After setting up the YOLO model and Spark-RDD
integration, the input data must now be subjected to object detection.
The YOLO model is used in this process to process the input data and
produce a set of bounding boxes and associated object class predictions
for each detected object.
• Overall, the methodology for 2D object detection using YOLO, Spark, and
RDD is a challenging task that necessitates knowledge of distributed
computing, computer vision, and data analysis.
Model:
• The You Only Look Once (YOLO) method, a deep learning model for
object recognition, is the foundation of the model used in 2D object
detection using YOLO, Spark, and RDD. The YOLO method predicts a set
of bounding boxes and related class probabilities for each cell by
splitting the input image into a grid of cells.
• A deep neural network with numerous convolutional layers makes up
the YOLO model, which extracts features from the input picture. Real-
time object recognition is possible even on low-end devices because to
the network architecture's quick and efficient design.
• The YOLO model is tuned during training in order to reduce the sum of
squared errors between the predicted bounding boxes and the actual
bounding boxes. The YOLO loss function combines localization loss,
confidence loss, and classification loss in order to maximise the model's
accuracy and speed.
• Once trained, the YOLO model may be used in conjunction with Apache
Spark and RDD to distribute and parallelize the processing of huge
datasets. Faster processing times and more scalability are made possible
by Spark's RDD abstraction, which enables effective data division and
processing over a cluster of computers.
• The YOLO model runs an input picture or video frame through the neural
network during inference to provide a collection of bounding boxes and
associated class probabilities for each identified object. After that, the
output may be post-processed to remove false positives and display the
items that were discovered.
• All things considered, the YOLO model utilised in 2D object identification
with Spark and RDD is a highly effective and scalable deep learning
model that can recognise objects in real-time with high accuracy, making
it appropriate for a variety of computer vision applications.
• The YOLO model learns to optimise the accuracy and speed of object
identification while being trained using a dataset of labelled photos or
videos. Once the model has been trained, it may be combined with
Apache Spark and RDD to analyse big datasets in a distributed and
parallel manner.
• The YOLO model runs an input picture or video frame through the neural
network during inference to provide a collection of bounding boxes and
associated class probabilities for each identified object. After that, the
output may be post-processed to remove false positives and display the
items that were discovered.
Yolov3 Architecture:
• The well-known object detection system YOLO (You Only Look Once)
detects objects in real-time. The third iteration of this algorithm, known
as YOLOv3, offers many advantages over its predecessors. The following
are the major parts of the YOLOv3 architecture:
o Input: A 416x416 pixel picture is used as the input for YOLOv3.
o Backbone network: The Darknet-53 design, a variation of the
ResNet architecture, serves as the YOLOv3 backbone network.
ResNet is not as effective for object detection tasks as the
Darknet-53 architecture, which is built with more layers and fewer
parameters.In order to extract features from the picture,
numerous convolutional layers are applied to the backbone
network's output.
o Layers for object detection: The object detection layers are in
charge of spotting items in the picture. Three detection layers,
each of which identifies objects at a different size, make up the
YOLOv3 architecture. Anchor boxes are used by the detecting
layers to aid with item position and size prediction.
o Output: The output of YOLOv3 is a list of bounding boxes together
with the class probabilities that go along with them. Four
coordinates (x, y, width, and height) make up each bounding box,
and each class probability denotes the chance that the item
belongs to a specific class.
Results and Discussions:

In comparison to conventional methods, the suggested method for 2D object
identification utilising YOLO, Spark, and RDD has several benefits, including
high accuracy, scalability, and efficiency. The technique achieves real-time
object identification on large-scale datasets by using the strength of deep
learning-based object detection utilising YOLO and the distributed computing
capabilities of Spark and RDD, allowing applications like surveillance, robotics,
and self-driving automobiles.
The method can also handle many object types and classes, making it
appropriate for a range of applications. Additionally, the technique can scale
smoothly to handle large-scale datasets thanks to the distributed computing
capabilities of Spark and RDD, which increases its relevance to real-world use
cases.
Conclusion:
A potent method for finding and identifying items of interest in huge picture
collections is 2D object identification utilising YOLO, Spark, and RDD. The
approach can achieve high accuracy, scalability, and efficiency in object
detection tasks, enabling a wide range of applications in computer vision. It
does this by leveraging the power of deep learning-based object detection,
distributed computing with Spark and RDD, and effective data pre-processing
techniques.
To make the technique more applicable to real-world use situations, however,
several issues must be resolved.
Apr 14, 2023
Plagiarism Scan Report

Characters:6310 Words:996
0% 100%
Plagiarized Unique Speak Time:
Sentences:46
8 Min
Excluded URL None
Content Checked for Plagiarism

Objective:
Using a deep learning model (YOLO), a distributed computing
framework (Spark), and a distributed data structure (RDD), the aim of 2D
object identification using YOLO, Spark, and RDD is to identify and locate
items of interest in a given picture or video frame with high accuracy and
efficiency. (RDD).
Bounding boxes and class probabilities are generated for
recognised objects using the YOLO model after processing input pictures or
video frames. The method can achieve high accuracy in object detection
tasks by utilising a deep learning model.
Large-scale datasets are distributed
and parallelized using the Spark framework, allowing for quick and effective
computation of the YOLO model on distributed computing clusters. The
method can achieve great scalability by utilising a distributed computing
platform.
The RDD data structure is employed to efficiently and fault-
tolerantly distribute and divide data over a cluster of nodes, guaranteeing that
the data is handled in a highly dispersed and parallelized way. The method
effectively handles big datasets by utilising a distributed data structure,
allowing for quick and effective computation of the YOLO model on
distributed computer clusters.
The goal of 2D object detection utilising YOLO,
Spark, and RDD is to offer a highly accurate, scalable, and effective method to
object identification
Methodology:
* There are several crucial steps that make
up the methodology for 2D object detection using YOLO, Spark, and RDD:
*
Data preparation is the first step, which usually entails gathering and
labelling a sizable dataset of images or videos. To meet YOLO's input
specifications, the data must be set up in a particular format.
* Model training:
The YOLO model must now be trained using the prepared dataset. The model
architecture must be configured, the training parameters must be
established, and the training procedure must be carried out. By adjusting its
weights in response to a set of labelled examples, the YOLO model gains the
ability to recognise objects in images during training.
* Integration with Spark:
After training, the YOLO model can be integrated with Spark by being loaded
into memory and having input data processing spread across a cluster of
computers. The data is represented and parallelized computations are carried
out using the RDD abstraction.
* Object detection: After setting up the YOLO
model and Spark-RDD integration, the input data must now be subjected to
object detection. The YOLO model is used in this process to process the input
data and produce a set of bounding boxes and associated object class
predictions for each detected object.
* Overall, the methodology for 2D object
Page 1 of 3
detection using YOLO, Spark, and RDD is a challenging task that necessitates
knowledge of distributed computing, computer vision, and data analysis.
Model: * The You Only Look Once (YOLO) method, a deep learning model for
object recognition, is the foundation of the model used in 2D object
detection using YOLO, Spark, and RDD. The YOLO method predicts a set of
bounding boxes and related class probabilities for each cell by splitting the
input image into a grid of cells. * A deep neural network with numerous
convolutional layers makes up the YOLO model, which extracts features from
the input picture. Real-time object recognition is possible even on low-end
devices because to the network architecture's quick and efficient design.
*
The YOLO model is tuned during training in order to reduce the sum of
squared errors between the predicted bounding boxes and the actual
bounding boxes. The YOLO loss function combines localization loss,
confidence loss, and classification loss in order to maximise the model's
accuracy and speed.
* Once trained, the YOLO model may be used in
conjunction with Apache Spark and RDD to distribute and parallelize the
processing of huge datasets. Faster processing times and more scalability are
made possible by Spark's RDD abstraction, which enables effective data
division and processing over a cluster of computers.
* The YOLO model runs
an input picture or video frame through the neural network during inference
to provide a collection of bounding boxes and associated class probabilities
for each identified object. After that, the output may be post-processed to
remove false positives and display the items that were discovered.
* All things
considered, the YOLO model utilised in 2D object identification with Spark
and RDD is a highly effective and scalable deep learning model that can
recognise objects in real-time with high accuracy, making it appropriate for a
variety of computer vision applications.
* The YOLO model learns to optimise
the accuracy and speed of object identification while being trained using a
dataset of labelled photos or videos. Once the model has been trained, it may
be combined with Apache Spark and RDD to analyse big datasets in a
distributed and parallel manner. * The YOLO model runs an input picture or
video frame through the neural network during inference to provide a
collection of bounding boxes and associated class probabilities for each
identified object. After that, the output may be post-processed to remove
false positives and display the items that were discovered.
Yolov3
Architecture:
* The well-known object detection system YOLO (You Only Look
Once) detects objects in real-time. The third iteration of this algorithm, known
as YOLOv3, offers many advantages over its predecessors. The following are
the major parts of the YOLOv3 architecture:
* Input: A 416x416 pixel picture is
used as the input for YOLOv3.
* Backbone network: The Darknet-53 design, a
variation of the ResNet architecture, serves as the YOLOv3 backbone network.
ResNet is not as effective for object detection tasks as the Darknet-53
architecture, which is built with more layers and fewer parameters.In order to
extract features from the picture, numerous convolutional layers are applied
to the backbone network's output.
* Layers for object detection: The object
detection layers are in charge of spotting items in the picture. Three
detection layers, each of which identifies objects at a different size, make up
Page 2 of 3
the YOLOv3 architecture. Anchor boxes are used by the detecting layers to aid
with item position and size prediction.
Sources
Home Blog Testimonials About Us Privacy Policy

Copyright © 2022 Plagiarism Detector. All right reserved
Page 3 of 3

Final Report

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Final Report

Uploaded by

Copyright:

Available Formats

School of Computer Science and Engineering

Programme : M.Tech Integrated CSE Specialization in

Course Title : Big Data Frameworks

Course Code : CSE3120

Title: 2D Object detection for Autonomous Vehicles

Team Members: Chenchu Aravind 20MIA1126

Shiva Sindhu Perla 20MIA1104

Manish Aniruddha P 20MIA1078

Big Data Frameworks

Results and Discussions:

Plagiarism Scan Report

Excluded URL None

Content Checked for Plagiarism

Home Blog Testimonials About Us Privacy Policy

You might also like