0% found this document useful (0 votes)

119 views18 pages

Understanding mAP in Object Detection

mAP stands for mean Average Precision, which is a popular metric for evaluating object detection models. AP is calculated as the area under the precision-recall curve. It measures the accuracy of a model by averaging the precision over different recall levels. The COCO dataset uses a 101-point interpolated AP over 10 IoU thresholds from 0.5 to 0.95 with a step size of 0.05 to calculate mAP, which is the primary evaluation metric for object detection models on COCO.

Uploaded by

Mai Minh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

119 views18 pages

Understanding mAP in Object Detection

Uploaded by

Mai Minh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

mAP for Object Detection

Karel Horak
Brno University of Technology / Czech Technical University in Prague
horak@[Link]

Slides modified, courtesy of Jonathan Hui

mAP for Object Detection
● mAP stands for mean Average Precision.

● AP is a popular metric in measuring the accuracy of object detectors like Faster R-CNN, SSD, etc.

● Average precision computes the average precision value for recall value over 0 to 1. It sounds
complicated but actually pretty simple as we illustrate it with an example.

● But before that, we will do a quick recap on precision, recall, and IoU first.
Precision & recall
● Precision measures how accurate is your predictions. i.e. the percentage of your predictions are
correct.

● Recall measures how good you find all the positives. For example, we can find 80% of the possible
positive cases in our top K predictions.

● F1 score is a measure combining Precision and Recall.

IoU (Intersection over union)
● IoU measures the overlap between two boundaries.

● We use that to measure how much our predicted boundary overlaps with the ground truth (the real
object boundary).

● In some datasets, we predefine an IoU

threshold (say 0.5) in classifying whether
the prediction is a true positive or a false
positive.
Average Precision
● Let us create an over-simplified example in demonstrating the calculation of the average precision.

● In this example, the whole dataset contains five apples only. We collect all the predictions made for
apples in all the images and rank it in descending order according to the predicted confidence level.
The second column indicates whether the prediction is correct or not. In this example, the prediction
is correct if IoU ≥ 0.5.
Average Precision
● Let us take the row with rank #3 and demonstrate how precision and recall are calculated first.

● Precision is the proportion of TP = 2/3 = 0.67.

● Recall is the proportion of TP out of the possible positives = 2/5 = 0.4.

● Recall values increase as we go down the prediction ranking. However, precision has a zigzag pattern
— it goes down with false positives and goes up again with true positives.
Average Precision
● Let us plot the precision against the recall value to see this zig-zag pattern.

● The general definition for the

Average Precision (AP) is finding
the area under the precision-recall
curve.
Average Precision
● Precision and recall are always between 0 and 1. Therefore, AP falls within 0 and 1 also.

● Before calculating AP for the object detection, we often smooth out the zigzag pattern first.

● Graphically, at each recall level, we replace each precision value with the maximum precision value to
the right of that recall level.
Average Precision
● So the orange line is transformed into the green line and the curve will decrease monotonically
instead of the zigzag pattern.

● The calculated AP value will be less suspectable to small variations in the ranking.

● Mathematically, we replace the precision value for recall ȓ with the maximum precision for any recall
≥ ȓ.
Interpolated AP
● PASCAL VOC is a popular dataset for object detection.

● For the PASCAL VOC challenge, a prediction is positive if IoU ≥ 0.5. Also, if multiple detections of the
same object are detected, it counts the first one as a positive while the rest as negatives.

● In Pascal VOC2008, an average for the 11-point interpolated AP is calculated.

Interpolated AP
● First, we divide the recall value from 0 to 1.0 into 11 points — 0, 0.1, 0.2, …, 0.9 and 1.0.

● Next, we compute the average of maximum precision value for these 11 recall values.

● In our example, AP = (5 × 1.0 + 4 × 0.57 + 2 × 0.5)/11

Interpolated AP
● When APᵣ turns extremely small, we can assume the remaining terms to be zero. i.e. we do not
necessarily make predictions until the recall reaches 100%.

● If the possible maximum precision levels drop to a negligible level, we can stop.

● For 20 different classes in PASCAL VOC, we compute an AP for every class and also provide an
average for those 20 AP results.

● However, this interpolated method is an approximation which suffers two issues:

1. It is less precise.
2. Second, it lost the capability in measuring the difference for methods with low AP.

● Therefore, a different AP calculation is adopted after 2008 for PASCAL VOC.

AP (Area under curve AUC)
● For later Pascal VOC competitions, VOC2010–2012 samples the curve at all unique recall values (r₁,
r₂, …), whenever the maximum precision value drops.

● With this change, we are measuring the exact area under the precision-recall curve after the zigzags
are removed.
AP (Area under curve AUC)
● No approximation or interpolation is needed.

● Instead of sampling 11 points, we sample p(rᵢ) whenever it drops and computes AP as the sum of the
rectangular blocks.
COCO mAP
● Latest research papers tend to give results for the COCO dataset only. In COCO mAP, a 101-point
interpolated AP definition is used in the calculation.

● For COCO, AP is the average over multiple IoU (the minimum IoU to consider a positive match).

● AP@[.5:.95] corresponds to the average AP for IoU from 0.5 to 0.95 with a step size of 0.05.

● For the COCO competition, AP is the average over 10 IoU levels on 80 categories (AP@[.50:.05:.95]:
start from 0.5 to 0.95 with a step size of 0.05).
COCO mAP
● The following are some other metrics collected for the COCO dataset:
COCO mAP
● Example: the AP result for the YOLOv3 detector:

● For clarity, AP@.75 means the AP with IoU=0.75.

COCO mAP
● mAP (mean average precision) is the average of AP.

● In some context, we compute the AP for each class and average them. But in some context, they
mean the same thing. For example, under the COCO context, there is no difference between AP and
mAP. Here is the direct quote from COCO:

„AP is averaged over all categories. Traditionally, this is called “mean average precision” (mAP). We make no distinction
between AP and mAP (and likewise AR and mAR) and assume the difference is clear from context.“

● In ImageNet, the AUC method is used.

● To summarize: even all of them follow the same principle in measurement AP, the exact calculation
may vary according to the datasets. Fortunately, development kits are available in calculating this
metric.

Object Detection Metrics Explained
No ratings yet
Object Detection Metrics Explained
32 pages
Object Detection Metrics Survey
No ratings yet
Object Detection Metrics Survey
6 pages
A P I C M A P: Arallel Mplementation OF Omputing EAN Verage Recision
No ratings yet
A P I C M A P: Arallel Mplementation OF Omputing EAN Verage Recision
15 pages
Paper Survey On Performance Metrics For Object Detection Algorithms
No ratings yet
Paper Survey On Performance Metrics For Object Detection Algorithms
6 pages
Precision and Recall in Model Evaluation
No ratings yet
Precision and Recall in Model Evaluation
6 pages
Object Detection Basics and Evaluation
No ratings yet
Object Detection Basics and Evaluation
24 pages
YOLO Validation Plots and A Guide To Their Interpretation-1
No ratings yet
YOLO Validation Plots and A Guide To Their Interpretation-1
4 pages
DataLab Cup 2: Object Detection Guide
No ratings yet
DataLab Cup 2: Object Detection Guide
25 pages
Deep Learning for Object Detection
No ratings yet
Deep Learning for Object Detection
59 pages
Machine Learning Performance Metrics Guide
No ratings yet
Machine Learning Performance Metrics Guide
8 pages
Performance Indicators for Object Detection
No ratings yet
Performance Indicators for Object Detection
5 pages
Understanding Object Detection in Deep Learning
No ratings yet
Understanding Object Detection in Deep Learning
61 pages
COCO Metric Evaluation at Training Time
No ratings yet
COCO Metric Evaluation at Training Time
7 pages
Performance Metrics Deep Dive
No ratings yet
Performance Metrics Deep Dive
7 pages
Bone Facture
No ratings yet
Bone Facture
14 pages
Overview of Object Detection Evaluation Metrics - by Youssef Hosni - Towards AI
No ratings yet
Overview of Object Detection Evaluation Metrics - by Youssef Hosni - Towards AI
10 pages
Learning Best Practices For Model Evaluation and Hyper-Parameter Tuning
No ratings yet
Learning Best Practices For Model Evaluation and Hyper-Parameter Tuning
20 pages
Model Evaluation and Performance Metrics
No ratings yet
Model Evaluation and Performance Metrics
16 pages
Comprehensive YOLO Framework Review
No ratings yet
Comprehensive YOLO Framework Review
34 pages
Understanding PCA and Machine Learning Pipelines
No ratings yet
Understanding PCA and Machine Learning Pipelines
12 pages
Iai&ml Unit-5
No ratings yet
Iai&ml Unit-5
15 pages
YOLO Evolution for Tech Experts
100% (1)
YOLO Evolution for Tech Experts
31 pages
Understanding Precision and Recall Metrics
No ratings yet
Understanding Precision and Recall Metrics
9 pages
Performance Metrics
No ratings yet
Performance Metrics
8 pages
ML Model Evaluation Metrics
No ratings yet
ML Model Evaluation Metrics
11 pages
Performance Evaluation
No ratings yet
Performance Evaluation
24 pages
Evaluation of Classification Metrics
No ratings yet
Evaluation of Classification Metrics
7 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
Accuracy, Precision, Recall & F1 Score Interpretation of Performance Measures
No ratings yet
Accuracy, Precision, Recall & F1 Score Interpretation of Performance Measures
5 pages
Precision and Recall
No ratings yet
Precision and Recall
20 pages
Loss Functions and Classification Metrics
No ratings yet
Loss Functions and Classification Metrics
56 pages
Evaluation Metrics: Yining Chen (Adapted From Slides by Anand Avati) May 1, 2020
No ratings yet
Evaluation Metrics: Yining Chen (Adapted From Slides by Anand Avati) May 1, 2020
31 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
25 pages
A Comprehensive Review of YOLO From YOLOv1 To YOLO
No ratings yet
A Comprehensive Review of YOLO From YOLOv1 To YOLO
27 pages
Key Evaluation Metrics Explained
No ratings yet
Key Evaluation Metrics Explained
2 pages
Understanding F1 Score, Accuracy, ROC-AUC & PR-AUC Metrics
No ratings yet
Understanding F1 Score, Accuracy, ROC-AUC & PR-AUC Metrics
10 pages
Confusion Matrix
No ratings yet
Confusion Matrix
4 pages
Classification Performance Metrics Explained
100% (1)
Classification Performance Metrics Explained
30 pages
Deep Learning for Object Detection
No ratings yet
Deep Learning for Object Detection
37 pages
Yolo
No ratings yet
Yolo
34 pages
Lec36 Obj Detn
No ratings yet
Lec36 Obj Detn
60 pages
Evaluating Models CH-3
No ratings yet
Evaluating Models CH-3
5 pages
Model Evaluation & Metrics Guide
No ratings yet
Model Evaluation & Metrics Guide
25 pages
F1 Score
No ratings yet
F1 Score
14 pages
Hamner y Frasco - 2012 - Metrics Evaluation Metrics For Machine Learning
No ratings yet
Hamner y Frasco - 2012 - Metrics Evaluation Metrics For Machine Learning
26 pages
Perf Meas
No ratings yet
Perf Meas
15 pages
Understanding Performance Metrics in AI
No ratings yet
Understanding Performance Metrics in AI
46 pages
Classification Model Evaluation Metrics
No ratings yet
Classification Model Evaluation Metrics
22 pages
Accuracy, Precision, Recall or F1 - Towards Data Science
No ratings yet
Accuracy, Precision, Recall or F1 - Towards Data Science
9 pages
Understanding Machine Learning Metrics
No ratings yet
Understanding Machine Learning Metrics
33 pages
Performance Metrics
No ratings yet
Performance Metrics
12 pages
Retrieval Evaluation Metrics Explained
No ratings yet
Retrieval Evaluation Metrics Explained
34 pages
Confusion Matrix
No ratings yet
Confusion Matrix
5 pages
Evaluation Machine Learning
No ratings yet
Evaluation Machine Learning
5 pages
6.performance Metrics
No ratings yet
6.performance Metrics
30 pages
Macro - and Micro-Averaged Evaluation
No ratings yet
Macro - and Micro-Averaged Evaluation
27 pages
Unit8 (Evaluation Method)
No ratings yet
Unit8 (Evaluation Method)
43 pages
3D Object Detection in Foggy Weather Conditions
No ratings yet
3D Object Detection in Foggy Weather Conditions
16 pages
Jafer Haider's Software Engineering Resume
No ratings yet
Jafer Haider's Software Engineering Resume
1 page
Brice Burger Mohamed Chaouch Christophe Saint-Jean Louahdi Khoudour Alain Crouzil Dr. Pierre Duthon Prof. Sergio A. Velastin
No ratings yet
Brice Burger Mohamed Chaouch Christophe Saint-Jean Louahdi Khoudour Alain Crouzil Dr. Pierre Duthon Prof. Sergio A. Velastin
1 page
Pixor: PIXOR: Real-Time 3D Object Detection From Point Clouds - CVPR 2018
No ratings yet
Pixor: PIXOR: Real-Time 3D Object Detection From Point Clouds - CVPR 2018
11 pages
Enhancing Mineral Processing With Deep Learning Automated Quartz Identification Using Thin Section Images
No ratings yet
Enhancing Mineral Processing With Deep Learning Automated Quartz Identification Using Thin Section Images
17 pages
Early Prognosis of Preeclampsia Using Machine Learning
100% (1)
Early Prognosis of Preeclampsia Using Machine Learning
349 pages
Project Report Final 5 58
No ratings yet
Project Report Final 5 58
54 pages
DATA4800 T2 2025 Assessment 03 Outline
No ratings yet
DATA4800 T2 2025 Assessment 03 Outline
17 pages
AI - Class 10 - LP - Aug - 1 To 15
No ratings yet
AI - Class 10 - LP - Aug - 1 To 15
2 pages
DiverseRAG: Enhanced QA for Arabic Dialects
No ratings yet
DiverseRAG: Enhanced QA for Arabic Dialects
14 pages
Credit Card Fraud Detection: A Comparative Study of Machine Learning and Deep Learning Methods
No ratings yet
Credit Card Fraud Detection: A Comparative Study of Machine Learning and Deep Learning Methods
12 pages
Living Off The Analyst: Harvesting Features From Yara Rules For Malware Detection
No ratings yet
Living Off The Analyst: Harvesting Features From Yara Rules For Malware Detection
11 pages
Deep Learning for Alzheimer's Stages
No ratings yet
Deep Learning for Alzheimer's Stages
16 pages
COMPX310-19A Machine Learning Chapter 3: Classification
No ratings yet
COMPX310-19A Machine Learning Chapter 3: Classification
39 pages
Authentiscan - Capstone Report Phase-II Final
No ratings yet
Authentiscan - Capstone Report Phase-II Final
32 pages
Datanot
No ratings yet
Datanot
11 pages
Enhancing Product Categorization in E-Commerce Using NLP and Machine Learning
No ratings yet
Enhancing Product Categorization in E-Commerce Using NLP and Machine Learning
6 pages
3.a.i Difference Between Regression and Classification.: Xii/Sem-Iii/Artificial Intelligence (Arti)
No ratings yet
3.a.i Difference Between Regression and Classification.: Xii/Sem-Iii/Artificial Intelligence (Arti)
40 pages
Malware Detection Using ML Techniques
No ratings yet
Malware Detection Using ML Techniques
5 pages
5 Jets56012024 22155 Other 79818 1 18 20240227
No ratings yet
5 Jets56012024 22155 Other 79818 1 18 20240227
10 pages
Image Tampering Detection Report With Alternatives
100% (1)
Image Tampering Detection Report With Alternatives
21 pages
Smart PCB Defect Detection Using CNN
No ratings yet
Smart PCB Defect Detection Using CNN
4 pages
Class 10 Artificial Intelligence Sample Paper Set 15
100% (2)
Class 10 Artificial Intelligence Sample Paper Set 15
9 pages
490 2220 2 PB
No ratings yet
490 2220 2 PB
11 pages
PDFF
No ratings yet
PDFF
15 pages
Deep Learning for Road Damage Detection
No ratings yet
Deep Learning for Road Damage Detection
11 pages
Fake News Detection
No ratings yet
Fake News Detection
5 pages
Movies Reviews Sentiment Analysis Model
No ratings yet
Movies Reviews Sentiment Analysis Model
7 pages
A Study of Network Intrusion Detection S
No ratings yet
A Study of Network Intrusion Detection S
27 pages
Lab Manual - DM
No ratings yet
Lab Manual - DM
56 pages
Data Mining: Frequent Pattern Analysis
No ratings yet
Data Mining: Frequent Pattern Analysis
34 pages
ML Yifan Lu REPORT
No ratings yet
ML Yifan Lu REPORT
22 pages
Final To
No ratings yet
Final To
20 pages
An Intelligent Hexapod Robot For Inspection of Airframe Components Oriented by Deep Learning Technique
No ratings yet
An Intelligent Hexapod Robot For Inspection of Airframe Components Oriented by Deep Learning Technique
15 pages

Understanding mAP in Object Detection

Uploaded by

Understanding mAP in Object Detection

Uploaded by

mAP for Object Detection

Slides modified, courtesy of Jonathan Hui

● F1 score is a measure combining Precision and Recall.

● In some datasets, we predefine an IoU

● Precision is the proportion of TP = 2/3 = 0.67.

● Recall is the proportion of TP out of the possible positives = 2/5 = 0.4.

● The general definition for the

● In Pascal VOC2008, an average for the 11-point interpolated AP is calculated.

● In our example, AP = (5 × 1.0 + 4 × 0.57 + 2 × 0.5)/11

● However, this interpolated method is an approximation which suffers two issues:

● Therefore, a different AP calculation is adopted after 2008 for PASCAL VOC.

● For clarity, AP@.75 means the AP with IoU=0.75.

● In ImageNet, the AUC method is used.

You might also like