Professional Documents
Culture Documents
Sciencedirect: © 2018, Ifac (International Federation of Automatic Control) Hosting by Elsevier Ltd. All Rights Reserved
Sciencedirect: © 2018, Ifac (International Federation of Automatic Control) Hosting by Elsevier Ltd. All Rights Reserved
77
IFAC MMM 2018
78
Shanghai, China, August 23-25, 2018 Jiangyun Li et al. / IFAC PapersOnLine 51-21 (2018) 76–81
We constructed an all convolutional YOLO network with Loss function Aiming at the ease of optimization, the
27 convolutional layers. The first 25 convolutional layers YOLO detection network uses the sum-squared error in
are used to extract steel strip surface defect features, the loss function. However, the sum-squared error weights
while the last two convolutional layers predict the defect localization error equally with classification error which
categories and bounding boxes. The network structure is does not perfectly align with the goal of maximizing
shown in Fig. 4. average precision. In every image many grid cells dont
contain any defects. This pushes the confidence scores
Similar to (Lin et al., 2013), our network simply uses of no-defect cells towards zero, often overpowering the
continual 3 × 3 convolutional layer with 1 × 1 reduction gradient from those defect grid cells, which can lead the
layer followed. The continual 3×3 filters extract the defect model instability.
features from input images, and the 1 × 1 kernels are
used to reduce the feature space of the previous feature To remedy this, the YOLO network increases the loss from
map. Learning from (Springenberg et al., 2014)(Radford bounding box coordinate predictions and decrease the loss
et al., 2015), max pooling layers can be replaced by from confidence predictions for no-defect boxes. YOLO
convolution layers with stride of 2 without loss in accuracy network uses two parameters (λcoord = 5, λno def ect = 0.5)
on several image recognition benchmarks. Besides, the to accomplish this.
convolution layers allow the network to learn its own In order to improve the detection effect on small-scale
spatial downsampling rather than deterministic spatial defects, YOLO network increases the proportion of errors
downsampling. In this paper, we replaced the max-pooling in the bounding box of the small-scale defects by increasing
functions of original network with 3 × 3 (stride = 2) the square difference information of the width and height
convolutional functions and achieved a slightly increase of the bounding box in the loss function. The optimized
in accuracy of 0.6%. loss function is as follows (2):
The network predicts defect categories information and
bounding box information on the 13 × 13 feature maps 2
(S = 13). This is sufficient for large-scale defects. However,
s
B
78
IFAC MMM 2018
Shanghai, China, August 23-25, 2018 Jiangyun Li et al. / IFAC PapersOnLine 51-21 (2018) 76–81 79
79
IFAC MMM 2018
80
Shanghai, China, August 23-25, 2018 Jiangyun Li et al. / IFAC PapersOnLine 51-21 (2018) 76–81
detection results with traditional methods, the results are 4.3 The detection rate
shown in Table 3.
When only needs to detect the defects without classifying
As can be seen in Table 3, the YOLO detection network
the categories, our network achieves 99% detection rate,
is significantly higher in classification accuracy than the
with only 1% defects missed. With the growth and ac-
shallow neural network in (Masci et al., 2012) and the
cumulation of online defect data, the performance of our
SVM classifier in (Chu et al., 2017), slightly higher than
network can be further improved.
the Tetrolet transform in (Ke et al., 2016) and the hybrid
chromosomal genetic algorithm in (Hu et al., 2016). Since
4.4 Defect scale accuracy
the number of defect images in this experiment far exceeds
the number in these referrences, the defect features that
our network extracted have better generalization. According to the experiments, our network can detect
defects with a minimum area of 10 square millimeters.
80
IFAC MMM 2018
Shanghai, China, August 23-25, 2018 Jiangyun Li et al. / IFAC PapersOnLine 51-21 (2018) 76–81 81
all convolutional YOLO detection network. The results generative adversarial networks. arXiv preprint arX-
show that our network achieves a 97.55% mAP, 95.86% iv:1511.06434.
recall rate and 99% detection rate. The network provides Redmon, J., Divvala, S., Girshick, R., and Farhadi, A.
an end-to-end detection solution for strip surface defect, (2016). You only look once: Unified, real-time object
and achieves a detection speed of 83 FPS, making the real- detection. In Proceedings of the IEEE conference on
time detection of strip surface defects more effective. computer vision and pattern recognition, 779–788.
Redmon, J. and Farhadi, A. (2016). Yolo9000: better,
The improved YOLO detection network can predict loca-
faster, stronger. arXiv preprint, 1612.
tion and the scale information of surface defects on the
Ren, S., He, K., Girshick, R., and Sun, J. (2015). Faster
entire strip production line, which is of great significance
r-cnn: Towards real-time object detection with region
for improving the product quality of the strip steel produc-
proposal networks. In Advances in neural information
tion. In the case of obtaining more types and quantities
processing systems, 91–99.
of strip surface defect data, this method can be further
Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus,
improved in detection accuracy.
R., and LeCun, Y. (2013). Overfeat: Integrated recog-
nition, localization and detection using convolutional
REFERENCES networks. arXiv preprint arXiv:1312.6229.
Springenberg, J.T., Dosovitskiy, A., Brox, T., and Ried-
Bengio, Y., Courville, A., and Vincent, P. (2013). Rep- miller, M. (2014). Striving for simplicity: The all con-
resentation learning: A review and new perspectives. volutional net. arXiv preprint arXiv:1412.6806.
IEEE transactions on pattern analysis and machine in- Xu, K., Ai, Y.H., Zhou, P., and Yang, C.L. (2013). Recog-
telligence, 35(8), 1798–1828. nition of surface defects in continuous casting slabs
Chu, M., Zhao, J., Gong, R., and Liu, L. (2017). Steel based on contourlet transform. Journal of University
surface defects recognition based on multi-label classifier of Science Technology Beijing, 35(9), 1195–1200.
with hyper-sphere support vector machine. In Control Zhang, X. (2011). Vision inspection of metal surface
And Decision Conference (CCDC), 2017 29th Chinese, defects based on infrared imaging. Acta Optica Sinica,
3276–3281. IEEE. 31(3), 0312004.
Girshick, R. (2015). Fast r-cnn. arXiv preprint arX-
iv:1504.08083.
Guo, H., Shao, W., and Zhou, A. (2017). Novel defect
recognition method based on adaptive global threshold
for highlight mental surface. Chinese Journal of Scien-
tific Instrument, 38(11), 2797–2804.
Hu, H., Liu, Y., Liu, M., and Nie, L. (2016). Surface defect
classification in large-scale strip steel image collection
via hybrid chromosome genetic algorithm. Neurocom-
puting, 181, 86–95.
Ioffe, S. and Szegedy, C. (2015). Batch normalization:
Accelerating deep network training by reducing internal
covariate shift. In International conference on machine
learning, 448–456.
Ke, X.U., Lei, W., and Wang, J. (2016). Surface defect
recognition of hot-rolled steel plates based on tetrolet
transform. Journal of Mechanical Engineering.
Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012).
Imagenet classification with deep convolutional neural
networks. In Advances in neural information processing
systems, 1097–1105.
LeCun, Y., Bengio, Y., and Hinton, G. (2015). Deep
learning. nature, 521(7553), 436.
Lin, M., Chen, Q., and Yan, S. (2013). Network in network.
arXiv preprint arXiv:1312.4400.
Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B.,
and Belongie, S. (2017). Feature pyramid networks for
object detection. In CVPR, volume 1, 4.
Ma, Y., Li, Q., He, F., Yan, L., and Xi, S. (2017). Adap-
tive segmentation algorithm for metal surface defects.
Chinese Journal of Scientific Instrument.
Masci, J., Meier, U., Ciresan, D., Schmidhuber, J., and
Fricout, G. (2012). Steel defect classification with max-
pooling convolutional neural networks. In Neural Net-
works (IJCNN), The 2012 International Joint Confer-
ence on, 1–6. IEEE.
Radford, A., Metz, L., and Chintala, S. (2015). Unsuper-
vised representation learning with deep convolutional
81