Professional Documents
Culture Documents
Mahadi Final Presentation
Mahadi Final Presentation
Supervised By
Presented By
Dr. Kaushik Deb
Md. Mahadi hasan
Professor
1604020
Dept. of CSE, CUET
(a) (b)
Figure1: Detection and classification of road damage. (a) Test image from RDD-2020 dataset,
(b) Test image from CUET Road.
[3] D. Arya et al., "Transfer learning-based road damage detection for multiple
countries, "arXiv preprint arXiv:2008.13101, 2020
Smart Application
https://www.tno.nl/media/2360/onderhoud_weg_240.jpg?rnd=1
https://gcn.com/-/media/GIG/GCN/Redesign/Articles/2018/May/automatedvehicle.png
https://onlinelibrary.wiley.com/doi/abs/10.1111/mice.12387 5 Department of CSE, CUET
Literature Survey
No Methodology Outcome Limitations
[1] They used MobileNet-v2 Bounding boxes on damage Failed to detect
as a backbone transfer reign multiple overlapping
learning model damage region in
They used SSD as Classification of these same image
detection head damage region
[1] D. Arya et al., ‘Transfer learning-based road damage detection for multiple countries,’ CoRR, vol. abs/2008.13101, 2020. arXiv:
2008.13101. [Online].Available: https://arxiv.org/abs/2008.13101
[2] J. Singh and S. Shekhar, ‘Road damage detection and classification in smartphone captured images using faster R-CNN,’
CoRR, vol. abs/1811.04535,2018. arXiv: 1811.04535. [Online]. Available: http://arxiv.org/abs/1811.04535. .
[3] H. Maeda, Y. Sekimoto, T. Seto, T. Kashiyama and H. Omata, ‘Road damage detection using deep neural networks with
images captured through a smartphone,’ CoRR, vol. abs/1801.09454, 2018. arXiv: 1801.09454
[5] Q. Zhang, X. Chang and S. B. Bian, ‘Road-damage-detection segmentation algorithm based on improved mask rcnn,’
IEEE Access, vol. 8, pp. 6997–7004, 2020. doi:10.1109/ACCESS.2020.2964055.
[7] They used a modified version of Bounding boxes on They used only 573
DenseNet-121 named Proposed damage region images for training
Net as a backbone transfer Generate mask in and 120 images for
learning model damage region testing
They used Mask R-CNN
algorithm as detection head
[6] P. Kumar, A. Sharma and S. R. Kota, ‘Automatic multiclass instance segmentation of road damage using deep learning
model,’ IEEE Access,vol. 9, pp. 90 330–90 345, 2021. doi: 10.1109/ACCESS.2021.3090961.
[7] S. Shim and G.-C. Cho, ‘Lightweight semantic segmentation for road-surface damage detection based on multiscale
learning,’ IEEE Access, vol. 8, pp. 102 680–102 690, 2020. doi: 10.1109/ACCESS.2020.2998427.
Image Source:
http://www.roadsafetyuae.com/assets/uploads/2014/12/Topic-Pic-Bad-Weather.jpg
https://paperswithcode.com/media/thumbnails/task/task-0000000480-281654b9.jpg
22500
17500
12500
7500
2500
D00 D10 D20 D40 Total
India 1555 68 2021 3187 6831
Czech 988 399 161 197 1745
Japan 4049 3979 6199 2243 16470
Total 6592 4446 8381 5627 25046
850
750
650
550
450
350
250
150
50
D00 D10 D20 D40 Total
No of labels 123 92 278 336 829
Figure 7: CUET road sample images. Figure 8: Dataset description of CUET road dataset.
Hyperparameters:
Batch Size: 4
Optimizer: SGD with Momentum
Loss Function: Sparse Categorical Cross Entropy
Learning Rate: 0.01
IoU threshold: 0.6
Steps: 25000
(a) ( b)
Figure 9: Evaluation of Faster RCNN with ResNet-50 using RDD-2020 dataset.(a) Confusion Matrix,
(b) performance score.
(a) (b)
Figure 10: Evaluation of Faster RCNN with ResNet-50 using CUET road images.(a) Confusion Matrix, (b)
performance score.
Figure 12: Test images result using Faster RCNN with ResNet-50. (a) Image from Japan,
(b) Image from India, (c) Image from Czech.
Figure 11: CUET road images test result using Faster RCNN with ResNet-50.
(a) & (b) Classify damage region correctly, (c) Incorrectly classified damage
region.
0.85
0.65
0.45
0.25
0.05
Precission for Recall for D00 F1 score for Precission for Recall for D10 F1 score for Precission for Recall for D20 F1 score for Precission for Recall for D40 F1 score for
D00 D00 D10 D10 D20 D20 D40 D40
Faster 0.47 0.48 0.48 0.52 0.66 0.58 0.43 0.71 0.54 0.41 0.18 0.24
RCNN
with
Resnet-50
Faster 0.58 0.39 0.47 0.42 0.39 0.41 0.29 0.72 0.41 0.57 0.25 0.35
RCNN
with
Resnet-
101
SSD with 0.25 0.19 0.22 0.59 0.17 0.26 0.25 0.64 0.36 0.1 0.06 0.07
Mobilenet
V1
SSD with 0.44 0.44 0.44 0.04 0.01 0.01 0.32 0.74 0.45 0.15 0.07 0.1
Mobilenet Faster RCNN with Resnet-50 Faster RCNN with Resnet-101 SSD with Mobilenet V1 SSD with Mobilenet V2 SSD with Resnet-50 EfficientDet
V2
SSD with 0.23 0.12 0.15 0 0 0 0.22 0.79 0.35 0.12 0.04 0.06
Resnet-50
Efficient- 0.63 0.09 0.15 0.26 0.18 0.21 0.2 0.86 0.33 0.5 0.03 0.06
Det
Figure 14: Bar chart representation of performance score on RDD-2020 Dataset’s images of six
models.
22 Department of CSE, CUET
Results & Discussion
Optimizers:
Table-2: F1 score of Faster RCNN with ResNet-50 for 4 classes on three different Optimizers
Model F1 Score mAP Numeber of GFLops
params (M)
Figure 16: Bar chart representation of performance score on RDD-2020 Dataset of six models in
different batch size.
[1] D. Arya et al., ‘Transfer learning-based road damage detection for multiple
countries,’ CoRR, vol. abs/2008.13101, 2020. arXiv: 2008.13101. [Online].
Available: https://arxiv.org/abs/2008.13101
1.1
0.9
0.7
0.5
0.3
0.1
Precission for Recall for D00 F1 score for Precission for Recall for D10 F1 score for Precission for Recall for D20 F1 score for Precission for Recall for D40 F1 score for
D00 D00 D10 D10 D20 D20 D40 D40
Faster 0.38 0.26 0.31 0.25 0.05 0.09 0.35 0.44 0.39 0.54 0.61 0.57
RCNN with
Resnet-50
Faster 0.39 0.1 0.16 0.48 0.11 0.18 0.38 0.85 0.52 0.54 0.24 0.33
RCNN with
Resnet-101
SSD with 0.16 0.14 0.15 0.27 0.04 0.07 0.37 0.85 0.52 0.45 0.09 0.14
Mobilenet
V1
SSD with 0.36 0.11 0.16 0.64 0.2 0.3 0.37 0.87 0.52 0.54 0.19 0.28
Mobilenet
V2
SSD with 0.42 0.04 0.07 0 0 0 0.34 0.99 0.51 0.06 0 0.01
Resnet-50
Faster RCNN with Resnet-50 Faster RCNN with Resnet-101 SSD with Mobilenet V1 SSD with Mobilenet V2 SSD with Resnet-50 EfficientDet
EfficientDet 0.29 0.06 0.1 0.14 0.01 0.02 0.33 0.79 0.47 0.5 0.2 0.29
Build a Dataset
• Make a small dataset of CUET Road
• Evaluate six different models with CUET road
images
Figure 18: Detection and classification result (a) Result from paper [1], (b) Result using Faster
RCNN with ResNet-50, (c) Result using Faster RCNN with ResNet-101
Future Work:
Train our models with large
Bangladeshi dataset
Figure 20: Conversion from xml to csv file Figure 21: Conversion from csv to TfRecord file
Table 5: Performance score of six models on different batch size using RDD-2020 dataset
Figure 25: Evaluation of faster RCNN with ResNet-101 in RDD-2020 test images. (a) Confusion
Matrix (b)ROC curve (c) performance score
Figure 26: Evaluation of SSD with MobileNet-v1 in RDD-2020 test images. (a) Confusion
Matrix (b)ROC curve (c) performance score
Figure 27: Evaluation of SSD with MobileNet-v2 in RDD-2020 test images. (a) Confusion
Matrix (b)ROC curve (c) performance score
Figure 28: Evaluation of SSD with ResNet-50 in RDD-2020 test images. (a) Confusion Matrix
(b)ROC curve (c) performance score
Figure 29: Evaluation of EfficientDet in RDD-2020 test images. (a) Confusion Matrix (b)
ROC curve (c) performance score
Figure 30: Evaluation of Faster RCNN with ResNet-101 in CUET test images. (a) Confusion
Matrix (b) performance score
48 Department of CSE, CUET
Appendix –Performance on SSD with MobileNet-v1
Evaluate with CUET Road images-
Batch Size: 4
Optimizer: SGD with Momentum
Loss Function: Sparse Categorical Cross Entropy
Learning Rate: 0.01
IoU_threshold: 0.6
Epochs: 25000
Momentum_optimizer_value: 0.9
Figure 31: Evaluation of SSD with MobileNet v1 in CUET test images. (a) Confusion Matrix (b)
performance score
49 Department of CSE, CUET
Appendix –Performance on SSD with MobileNet v2
Evaluate with CUET Road images-
Batch Size: 4
Optimizer: SGD with Momentum
Loss Function: Sparse Categorical Cross Entropy
Learning Rate: 0.01
IoU_threshold: 0.6
Epochs: 25000
Momentum_optimizer_value: 0.9
Figure 32: Evaluation of SSD with MobileNet v2 in CUET test images. (a) Confusion Matrix (b)
performance score
50 Department of CSE, CUET
Appendix –Performance on SSD with ResNet-50
Evaluate with CUET Road images-
Batch Size: 4
Optimizer: SGD with Momentum
Loss Function: Sparse Categorical Cross Entropy
Learning Rate: 0.01
IoU_threshold: 0.6
Epochs: 25000
Momentum_optimizer_value: 0.9
Figure 33: Evaluation of SSD with ResNet-50 in CUET test images. (a) Confusion Matrix (b)
performance score
51 Department of CSE, CUET
Appendix –Performance on EfficientDet
Evaluate with CUET Road images-
Batch Size: 4
Optimizer: SGD with Momentum
Loss Function: Sparse Categorical Cross Entropy
Learning Rate: 0.01
IoU_threshold: 0.6
Epochs: 25000
Momentum_optimizer_value: 0.9
Figure 34: Evaluation of EfficientDet in CUET test images. (a) Confusion Matrix (b)
performance score
52 Department of CSE, CUET
Appendix
Why F1 score is used ?
Precision:
Precision tells us how many of the correctly predicted cases actually turned out to
be positive.
This would determine whether our model is reliable or not
Recall:
Recall tells us how many of the actual
positive cases we were able to predict
correctly with our model.
also known as Sensitivity or true positive rate
n=10K records