Professional Documents
Culture Documents
Submitted by
S.KEERTHIVASAN
Register No: 1919101018
Submitted by
S.KEERTHIVASAN
Register No: 1919101018
DECEMBER 2022
S.KEERTHIVASAN
Register No: 1919101018
SUBMITTED
BY:
Name :Keerthivasan.S
Register No :19CIVBE064
E-mail id :keerthivasan9162001@gmail.com
Ph.No :8072930969
TO:
Dr. Gitakrishnan Ramadurai
Associate Professor in the Transportation Engineering Division,Department of civil
engineering,Indian institute of technology,Madras.
Page|2
TABLE OF CONTENTS
Acknowledgement
Keerthivasan.s
Page|5
• INTRODUCTION
Vehicle detection and statistics in highway monitoring video scenes are of
considerable significance to intelligent traffic management and control of
the highway. With the popular installation of traffic surveillance cameras,
a vast database of traffic video footage has been obtained for analysis.
Generally, at a high viewing angle, a more-distant road surface can be
considered. The object size of the vehicle changes greatly at this viewing
angle, and the detection accuracy of a small object far away from the road
is low. In the face of complex camera scenes, it is essential to effectively
solve the above problems and further apply them we focus on the above
issues to propose a viable solution, and we apply the vehicle detection
results to multi-object tracking and vehicle counting.
The area of Computer Vision basically deals with anything that humans see and
perceive.
Object Detection is the ability to detect or identify objects in any given image
correctly along with their spatial position in the given image, in the form of
rectangular boxes (known as Bounding Boxes) which bound the object within it.
P a g e | 10
• TYPES OF DETECTORS
P a g e | 11
• labelImg
LabelImg is a graphical image annotation tool. It is written in Python and
uses Qt for its graphical interface. Annotations are saved as XML files in
PASCAL VOC format, the format used by ImageNet. Besides, it also supports
YOLO and CreateML formats.
P a g e | 12
Goal:
What we use
The various deep learning methods use data to train neural network
algorithms to do a variety of machine learning tasks, such as the
classification of different classes of objects. Convolutional neural
networks are deep learning algorithms that are very powerful for
the analysis of images.
1) Convolutional Layer
2) Pooling Layer
3) Fully-Connected layer
P a g e | 14
OUTPUT INPUT
YOLO Algorithm
YOLO is an abbreviation for the term ‘You Only Look Once’. This is
an algorithm that detects and recognizes various objects in a picture (in
real-time). YOLO algorithm employs convolutional neural networks
(CNN) to detect objects in real-time.
Working of yolov3
The YOLOv3 algorithm first separates an image into a grid. Each grid
cell predicts some number of boundary boxes (sometimes referred to as
anchor boxes) around objects that score highly with the aforementioned
predefined classes.
Other comparable algorithms that can carry out the same objective are
R-CNN (Region-based Convolutional Neural Networks made in 2015)
and Fast R-CNN (R-CNN improvement developed in 2017), and Mask
R-CNN.
• MY RESPONSIBILITIES/POSITION IN
INTERNSHIP
Workings
1. Images were collected from the traffic camers from various
locations.
2. The images were classified into two folders,namely “test” and
“train”.
3. The images are annotated/labelled using a tool/software.the
annotations are classified into 5 categories,namely
“car”,”motorbike”,”truck”,”auto” and “bus”.Labels are used to help
identify components in your data which you want to train your model
to identify in datasets that are not labeled. High quality datasets are
essential for computer vision and building a highly performant model.
Creating computer vision models follows the garbage in, garbage out
philosophy which means labeling images carefully and accurately is
important.
4. For this annotation an popular open source software/tool is used
known as“labelImg”,we feed the collected images into this tool,and we
annotate/label the images,after annotation of each image its
saved in the preferred location in ”.xml” format. It’s written in
Python and uses QT for its graphical interface.
5. Finally all the frames/images are annotated.
• CONCLUSION:
From my summer fellowship programme 2022-internship,I hope that
this project is going to be very helpul for CHENNAI CITY POLICE in
data analysis and also for road safety.I was able to get a better
understanding of how the deep learning works,its application and how
effective it is. I enjoyed working with the team. Overall, I found the
internship experience to be positive, and I'm sure I will be able to use the
skills I learned in my career later.
P a g e | 23
+
REFERENCE
1. Training a deep learning architecture for vehicle detection using limited
heterogeneous traffic data
Deepak Mittal; Avinash Reddy; GitakrishnanRamadurai; KaushikMitra; BalaramanRavindran
Link:https://ieeexplore.ieee.org/abstract/document/8328279
Link:
https://openaccess.thecvf.com/content/CVPR2022W/AICity/html/Bharadhwaj_Detecting_V
ehicles_on_the_Edge_Knowledge_Distillation_To_Improve_Performance_CVPRW_2022_paper.
html