Welcome to Scribd!

Unlike Classification Networks Such As ResNets or VGG Net

Uploaded by

0% found this document useful (0 votes)

14 views3 pages

The document discusses object detection algorithms. It explains that object detection must identify and localize multiple objects by predicting bounding boxes around each object in an image. It describes how images are divided into grids, and each grid cell is responsible for predicting objects whose center falls within that cell. The target label y defines properties of each grid cell, including object confidence p, bounding box attributes, and class confidences c. Anchor boxes and non-max suppression help improve predictions when multiple objects are located closely together.

Original Description:

please read

Original Title

Unlike Classification Networks Such as ResNets or VGG Net

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

14 views3 pages

Unlike Classification Networks Such As ResNets or VGG Net

Uploaded by

Gajanan Tale

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 3

Search inside document

 Unlike classification networks such as ResNets or VGG net, the object detection

algorithm has to identify multiple objects and specify their exact location as
shown in the image.
 This property of predicting the bounding boxes around the objects is known
as object localization.
 Object localization needs to predict the height, width and location of bounding
box around the image.
 Before specifying the bounding box attributes of each object the image is divided
into (S \times S)(S×S) grid cells as shown in the picture.
 If the centre of the object falls on in a grid cell then that grid cell is responsible for
predicting the object.
 he target label yy defines each of the grid cells.
 y is a vector given by y = \begin{bmatrix}p \\ b_x \\ b_y\\ b_h\\ b_w \\ c_1 \\
c_2 \vdots \\ c_n\end{bmatrix}⎣⎢⎢⎢⎢⎢⎢⎢⎢⎢⎢⎡pbxbybhbwc1c2⋮cn⎦⎥⎥⎥⎥⎥⎥⎥⎥⎥⎥⎤ .
 p is known as object confidence that gives the probability of the presence of an
object in the bounding box.
 c_1, c_2 ... c_n is the class confidence intervals For example, if you have one of
the two classes to identify pedestrian or a car, then c_1 gives the probability that
the grid cell has a car and c_2 gives the probability of the presence of a
pedestrian

Note the object confidence \bold pp is different from

that of the class confidence \bold cc.
 p is the probability of the presence of an object
within the bounding box irrespective of the class
of object.
 c is the probability of the object belonging to a
particular class under the probability

IOU
 Intersection over union(IOU) is a measure of the accuracy of the predicted
bounding box against the ground truth box (the actual bounding box).
 It is the ratio of area covered by the intersection of ground truth box and
predicted box to the area covered by the union of these to boxes.
 The maximum possible value of IOU is 1. If the measured IOU is greater than the
set threshold, we can conclude that predicted bounding box is close to the
ground truth box.

 Consider a case where two objects share the same center as shown in the
image.

 If the two objects remain in the same bounding box then our model ends up
predicting one of the objects which should not be the case.

Swipe next to see how the concept of anchor boxes eliminates this problem.

Anchor Boxes
 Assuming the object in each grid cell can either fit in one of two fixed anchor
boxes and we have one of the two class of objects to be identified, then the
target label y is defined as

\small y = \begin{bmatrix}p_1 \ b_{x1} \ b_{y1} \ b_{h1} \ b_{w1} \ c_{11} \

c_{12} \ p_2 \ b_{x2} \ b_{y2} \ b_{h2} \ b_{w2} \ c_{21} \ c_{22}
\end{bmatrix}y=[p1 bx1 by1 bh1 bw1 c11 c12 p2 bx2 by2 bh2 bw2 c21 c22]

 In general, if image is divided in S \times SS×S grid and each grid is defined by
B number of bounding box and each box is responsible for predicting C number
of classes then the dimension target, y is \small S \times S \times (B*(5 +
C))S×S×(B∗(5+C)).

Hand Engineering
 The dataset to train the object detection model is slightly different from the object
classification model.
 Each image in the training data for object detection is divided manually into S x S
grid cells.
 If the center of the object of interest falls in a grid cell and fits in one of the anchor
boxes, then p-value of that anchor box and the class value of that object are set
to 1 along with the bounding box attributes.

Overview
 In You Only Look Once (YOLO) algorithm, you run the image through a CNN
model and detect the object through a single pass.
 This algorithm identifies multiple bounding boxes for the same object. Hence, we
use a method called non-max suppression to filter out single prediction box for
each object in the image. Rest of the cards show you step by step procedure of
how YOLO algorithm works.

Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
YOLO
Document31 pages
YOLO
Manel Lnsry
No ratings yet
Neural Network Toolbox
Document8 pages
Neural Network Toolbox
muce01122917
No ratings yet
Yolo
Document10 pages
Yolo
fernandovaras2005
No ratings yet
The mathematics of quantum mechanics
From Everand
The mathematics of quantum mechanics
Alessio Mangoni
No ratings yet
GP-CP-SQRT Decomposition
Document5 pages
GP-CP-SQRT Decomposition
Dj
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
"Object Detection With Yolo": A Seminar On
Document14 pages
"Object Detection With Yolo": A Seminar On
SHRINIVAS BHUSANNAVAR
No ratings yet
C11240283S19
Document4 pages
C11240283S19
Muhammad Juniad
No ratings yet
C11240283S19
Document4 pages
C11240283S19
rmolinabadia
No ratings yet
Aircraft Recognition in High Resolution Satellite Images: P.C.Hemalatha, Mrs.M.Anitha M.E.
Document5 pages
Aircraft Recognition in High Resolution Satellite Images: P.C.Hemalatha, Mrs.M.Anitha M.E.
theijes
No ratings yet
Autonomous Robot Operation
Document16 pages
Autonomous Robot Operation
Diego Correa
No ratings yet
Report V1.1.4.final
Document17 pages
Report V1.1.4.final
Diego Correa
No ratings yet
Fiber Segmentation in Composite Materials Using Marked Point Processes
Document7 pages
Fiber Segmentation in Composite Materials Using Marked Point Processes
Barna Keresztes
No ratings yet
Display Images: Computational and Medical Applications: Foundations
Document8 pages
Display Images: Computational and Medical Applications: Foundations
Novica Petkovic
No ratings yet
Review: Deepmask (Instance Segmentation) : An Instance Segment Proposal Method Driven by Convolutional Neural Networks
Document6 pages
Review: Deepmask (Instance Segmentation) : An Instance Segment Proposal Method Driven by Convolutional Neural Networks
Peter
No ratings yet
Incremental Training For Image Classification of Unseen Objects
Document19 pages
Incremental Training For Image Classification of Unseen Objects
HARSHILJAIN3
No ratings yet
Sis Sonn PDF
Document4 pages
Sis Sonn PDF
Helloproject
No ratings yet
Mitsyn Ososkov Optical Memory NeurGas
Document11 pages
Mitsyn Ososkov Optical Memory NeurGas
Геннадий Ососков
No ratings yet
A Matching Algorithm For Content-Based Image Retrieval: Base, Query by Sketch, Matching, Similarity
Document6 pages
A Matching Algorithm For Content-Based Image Retrieval: Base, Query by Sketch, Matching, Similarity
divyaa76
No ratings yet
Support Vector Machines
Document32 pages
Support Vector Machines
George Paul
No ratings yet
3D Reconstruction From A Single Image: IMA Preprint Series # 2221
Document19 pages
3D Reconstruction From A Single Image: IMA Preprint Series # 2221
Thuy Nguyen
No ratings yet
Chapter 5 2018 2019
Document5 pages
Chapter 5 2018 2019
Chahrazed Djida
No ratings yet
ch11 Kriging
Document21 pages
ch11 Kriging
Axel Lr
No ratings yet
Robotics Vision
Document21 pages
Robotics Vision
علي قاسم الموسوي
No ratings yet
Linear Modelling
Document8 pages
Linear Modelling
cracking khalif
No ratings yet
Clustering Appearances of Objects Under Varying Illumination Conditions
Document7 pages
Clustering Appearances of Objects Under Varying Illumination Conditions
Tanuj Kumar
No ratings yet
IN5400 - Machine Learning For Image Analysis
Document6 pages
IN5400 - Machine Learning For Image Analysis
Johanne Saxegaard
No ratings yet
Deep Learning YOLOv2
Document3 pages
Deep Learning YOLOv2
Pedro Antonio
No ratings yet
Object Tracking Based On Pattern Matching
Document4 pages
Object Tracking Based On Pattern Matching
editor_ijarcsse
No ratings yet
3D Object Recognition With Deep Belief Nets
Document9 pages
3D Object Recognition With Deep Belief Nets
maxhutch
No ratings yet
Unsupervised Image Segmentation Based On Fuzzy Connectedness With Sale Space Theory
Document6 pages
Unsupervised Image Segmentation Based On Fuzzy Connectedness With Sale Space Theory
Nuthangi Mahesh
No ratings yet
Our Ccah Mult
Document14 pages
Our Ccah Mult
lafia
No ratings yet
Stanford Dog
Document7 pages
Stanford Dog
Mincan Yang
No ratings yet
Visible-Surface Determination: For (Each Pixel in The Image)
Document23 pages
Visible-Surface Determination: For (Each Pixel in The Image)
vissu.viit
No ratings yet
SVM - Hype or Hallelujah
Document13 pages
SVM - Hype or Hallelujah
Vaibhav Jain
No ratings yet
Spatial Feat Embedding
Document4 pages
Spatial Feat Embedding
martin_321
No ratings yet
Collision Detection: John Amato Pixelsplash Software
Document5 pages
Collision Detection: John Amato Pixelsplash Software
Everton Kozloski
No ratings yet
Irjet V7i3756
Document5 pages
Irjet V7i3756
Bharat Pant
No ratings yet
Real-Time Image Processing Algorithms For Object and Distances Identification in Mobile Robot Trajectory Planning
Document6 pages
Real-Time Image Processing Algorithms For Object and Distances Identification in Mobile Robot Trajectory Planning
amitsbhati
No ratings yet
W9a Autoencoders Pca
Document7 pages
W9a Autoencoders Pca
zeliawillscumberg
No ratings yet
Supervised Machine Learning
Document74 pages
Supervised Machine Learning
srikarthikchitteti77
No ratings yet
Knapsack Problem
Document6 pages
Knapsack Problem
Gurpreet Singh
100% (6)
A New Hybrid Clustering Algorithm Based On K-Means and Ant Colony Algorithm
Document4 pages
A New Hybrid Clustering Algorithm Based On K-Means and Ant Colony Algorithm
mm8871
No ratings yet
CSE 474/574 Introduction To Machine Learning Fall 2011 Assignment 3
Document3 pages
CSE 474/574 Introduction To Machine Learning Fall 2011 Assignment 3
kwzeet
No ratings yet
Models of Fraud Detection and Analysis of Payment Transactions Using Machine Learning
Document16 pages
Models of Fraud Detection and Analysis of Payment Transactions Using Machine Learning
RockyRambo1
No ratings yet
Object Detection and Localization: Academic Session 2022/2023
Document37 pages
Object Detection and Localization: Academic Session 2022/2023
muhammed suhail
No ratings yet
Advantages:: Q.No 1.a Ans
Document12 pages
Advantages:: Q.No 1.a Ans
Tiwari Vivek
No ratings yet
Map Overlay and Spatial Aggregation in SP: Edzer Pebesma April 14, 2016
Document19 pages
Map Overlay and Spatial Aggregation in SP: Edzer Pebesma April 14, 2016
rojasleop
No ratings yet
Cad (Unit-3)
Document70 pages
Cad (Unit-3)
Srinith Kumar
No ratings yet
On Clustering Using Random Walks: Abstract. We Propose A Novel Approach To Clustering, Based On Deter
Document24 pages
On Clustering Using Random Walks: Abstract. We Propose A Novel Approach To Clustering, Based On Deter
Bababa Eke
No ratings yet
Intellectual Approaches To Improvement of The Classification Decisions Quality On The Base of The SVM Classifier
Document9 pages
Intellectual Approaches To Improvement of The Classification Decisions Quality On The Base of The SVM Classifier
Nguyễn Quang Huy
No ratings yet
Group Coset Monogamy Games and An Application To Device-Independent Continuous-Variable QKD
Document65 pages
Group Coset Monogamy Games and An Application To Device-Independent Continuous-Variable QKD
carlos24odio
No ratings yet
Or Sparse
Document3 pages
Or Sparse
Sreeni Vasa Reddy
No ratings yet
Final Version Maier 5681
Document31 pages
Final Version Maier 5681
Div Dutta
No ratings yet
Project Report On Contour Detection
Document10 pages
Project Report On Contour Detection
Raghav Gupta
No ratings yet
07 Representation Learning
Document11 pages
07 Representation Learning
Rajul Rahmad
No ratings yet
Multiple-Choice Questionnaire Group C
Document8 pages
Multiple-Choice Questionnaire Group C
Prof. Sudesh R. Agrawal
No ratings yet
Lec3-The Kernel Trick
Document4 pages
Lec3-The Kernel Trick
Shankaranarayanan Gopal
No ratings yet
Local and Global
Document20 pages
Local and Global
lcsimao
No ratings yet
Collision Particles - Conger
Document27 pages
Collision Particles - Conger
ernestoarriola85
No ratings yet
CVnotes2 Bhuyan
Document4 pages
CVnotes2 Bhuyan
N.RAMAKUMAR
No ratings yet
Background
Document3 pages
Background
Gajanan Tale
No ratings yet
Filtering Words Hingoli
Document3 pages
Filtering Words Hingoli
Gajanan Tale
No ratings yet
Non-Max Suppression: Coord I 0i S J 0j B Ijobj I I I I
Document2 pages
Non-Max Suppression: Coord I 0i S J 0j B Ijobj I I I I
Gajanan Tale
No ratings yet
Frequency Distribution: Text1 Text Corpora Text Corpus
Document2 pages
Frequency Distribution: Text1 Text Corpora Text Corpus
Gajanan Tale
No ratings yet
Pratil Made
Document2 pages
Pratil Made
Gajanan Tale
No ratings yet
Alexandre Kowalczyk - SVM Succinctly
Document114 pages
Alexandre Kowalczyk - SVM Succinctly
AMANDA ALVES DA SILVA
No ratings yet
Differential Numerical
Document41 pages
Differential Numerical
Wipharat
No ratings yet
An Improved Recursive Formula For Calculating Shock Response Spectra-SRS-Smallwood
Document7 pages
An Improved Recursive Formula For Calculating Shock Response Spectra-SRS-Smallwood
jack
No ratings yet
Ada Lab
Document109 pages
Ada Lab
Aman Kaintura
No ratings yet
Unit 1 and 5
Document16 pages
Unit 1 and 5
DESTROYER
No ratings yet
Device Noise Simulation of ΔΣ Modulators (Designers-guide)
Document22 pages
Device Noise Simulation of ΔΣ Modulators (Designers-guide)
alirezad
No ratings yet
Discrete Wavelet Transform ..
Document9 pages
Discrete Wavelet Transform ..
id.priyanka
0% (2)
Taylor Series 1
Document39 pages
Taylor Series 1
umar
No ratings yet
ELE8311 - Module 3 - Transfer Function
Document13 pages
ELE8311 - Module 3 - Transfer Function
UmarSaboBabaDoguwa
No ratings yet
Indian Sign Language Interpretation and Sentence Formation: Disha Gangadia Varsha Chamaria Vidhi Doshi
Document6 pages
Indian Sign Language Interpretation and Sentence Formation: Disha Gangadia Varsha Chamaria Vidhi Doshi
Dhanush Kumar
No ratings yet
CS 902
Document4 pages
CS 902
Rituparna Majumdar
No ratings yet
Sas Solved Dec 2018
Document14 pages
Sas Solved Dec 2018
Aavani K A
No ratings yet
《Deep Learning in Video Multi-Object Tracking A Survey》
Document13 pages
《Deep Learning in Video Multi-Object Tracking A Survey》
by nick
No ratings yet
Introduction To Algorithms: Prof. Shafi Goldwasser Prof. Erik Demaine
Document53 pages
Introduction To Algorithms: Prof. Shafi Goldwasser Prof. Erik Demaine
Fripppy
No ratings yet
DC
Document8 pages
DC
harshit420
No ratings yet
Tutorial Letter 102/1/2018: Techniques of Artificial Intelligence
Document7 pages
Tutorial Letter 102/1/2018: Techniques of Artificial Intelligence
Thanyani Sirumula
No ratings yet
Unit 4.3 Greedy Algorithm Prim's Algo
Document22 pages
Unit 4.3 Greedy Algorithm Prim's Algo
Harshil Modh
No ratings yet
Scratch Detection and Removal From Old Videos
Document19 pages
Scratch Detection and Removal From Old Videos
Amit Kumar
No ratings yet
DAA-Assignment 04 (2.5 Points) : Fatima Jinnah Women University Computer Science
Document2 pages
DAA-Assignment 04 (2.5 Points) : Fatima Jinnah Women University Computer Science
Hafza Ghafoor
No ratings yet
Sampling Theorem
Document34 pages
Sampling Theorem
gaurav_juneja_4
No ratings yet
Day 5 Supervised Technique-Decision Tree For Classification PDF
Document58 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
amrita cse
100% (1)
OS Notes
Document16 pages
OS Notes
Komal Tariq
No ratings yet
Genetic Algorithm
Document8 pages
Genetic Algorithm
Fabiano Rodrigues Pereira
No ratings yet
MA204: Computational Methods and Scientific Computing: Dr. Chirala Satyanarayana
Document9 pages
MA204: Computational Methods and Scientific Computing: Dr. Chirala Satyanarayana
Likhith Reddy
No ratings yet
Root Finding
Document3 pages
Root Finding
yashsonone25
No ratings yet
Unit 4 Hashing
Document35 pages
Unit 4 Hashing
anitajar1
No ratings yet
03 - Handbook of Test Problems in Local and Global Optimization
Document446 pages
03 - Handbook of Test Problems in Local and Global Optimization
ArinjayKumar
No ratings yet
Abdullah Gül University ME 301 Machine Elements Group Project Fall 2020/2021
Document8 pages
Abdullah Gül University ME 301 Machine Elements Group Project Fall 2020/2021
sonz jung
No ratings yet