Professional Documents
Culture Documents
Abstract—As the failure of power line insulators leads to the OAD-BSPK Orientation angle detection and binary shape
failure of power transmission systems, an insulator inspection prior knowledge.
system based on an aerial platform is widely used. Insulator BRISK Binary robust invariant scalable keypoints.
defect detection is performed against complex backgrounds in
aerial images, presenting an interesting but challenging problem. VLAD Vector of locally aggregated descriptors.
Traditional methods, based on handcrafted features or shallow- SVM Support vector machine.
learning techniques, can only localize insulators and detect faults WMRA Wavelet multiresolution analysis.
under specific detection conditions, such as when sufficient prior CPLID Chinese power line insulator dataset.
knowledge is available, with low background interference, at cer- RPN Region proposal network.
tain object scales, or under specific illumination conditions. This
paper discusses the automatic detection of insulator defects using ReLU Rectified linear units.
aerial images, accurately localizing insulator defects appearing ILN Insulator localizer network.
in input images captured from real inspection environments. DDN Defect detector network.
We propose a novel deep convolutional neural network (CNN) ROI Region of interest.
cascading architecture for performing localization and detect- VOC2007 Visual object classes challenge 2007.
ing defects in insulators. The cascading network uses a CNN
based on a region proposal network to transform defect inspec- DOST Discrete orthogonal s-transform.
tion into a two-level object detection problem. To address the IoU Intersection over union.
scarcity of defect images in a real inspection environment, ACF Aggregated channel features.
a data augmentation method is also proposed that includes four SSD Single shot multibox detector.
operations: 1) affine transformation; 2) insulator segmentation YOLO You only look once.
and background fusion; 3) Gaussian blur; and 4) brightness
transformation. Defect detection precision and recall of the GAN Generative adversarial nets.
proposed method are 0.91 and 0.96 using a standard insula-
tor dataset, and insulator defects under various conditions can
be successfully detected. Experimental results demonstrate that
this method meets the robustness and accuracy requirements for I. I NTRODUCTION
insulator defect detection.
NSULATORS are essential equipment used to electri-
Index Terms—Aerial image, convolutional neural network
(CNN), data augmentation, defect detection, insulators.
I cally isolate and mechanically secure wires in high-
voltage power transmission systems [1], [2]. Insulator failure
is a direct threat to the stability and safety of power transmis-
sion lines. Statistically, accidents caused by insulator defects
account for the highest proportion of power system failures.
N OMENCLATURE Therefore, intelligent and timely detection of insulator defects
CNN Convolutional neural networks. is particularly important.
UAV Unmanned aerial vehicles. In the last decade, many studies have investigated automat-
ing the task of visually inspecting power transmission
Manuscript received July 7, 2018; accepted September 11, 2018. This systems [3], [4]. As aerial platforms such as helicopters
work was supported in part by Science Challenge Project under Grant
TZ2018006-0204-02, and in part by the National Natural Science Foundation and UAVs have become available, and demonstrate high
of China under Grant 61703399, Grant 61503376, Grant 61421004, and efficiency, accuracy, and safety, they have become an impor-
Grant 61673383. This paper was recommended by Associate Editor L. Wang. tant tool in the inspection of power equipment (see Fig. 1) in
(Corresponding author: Xian Tao.)
X. Tao, D. Zhang, X. Liu, and D. Xu are with the Research Center of recent years. A camera mounted on the platform is able to
Precision Sensing and Control, Institute of Automation, Chinese Academy of obtain many aerial images that include insulator information.
Sciences, Beijing 100190, China (e-mail: taoxian2013@ia.ac.cn). At present, insulator defect inspection is performed manu-
Z. Wang is with the Sino-European Institute of Aviation Engineering, Civil
Aviation University of China, Tianjin 300300, China. ally based on such aerial images, and as a result, highly
H. Zhang is with the School of Information Science and Technology, Hainan efficient and accurate detection is difficult. To overcome the
Normal University, Haikou 571158, China. limitations of human inspection, automated defect detection
Color versions of one or more of the figures in this paper are available
online at http://ieeexplore.ieee.org. techniques [5]–[7] need to be developed to assist or replace
Digital Object Identifier 10.1109/TSMC.2018.2871750 human decisions.
2168-2216 c 2018 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
TAO et al.: DETECTION OF POWER LINE INSULATOR DEFECTS USING AERIAL IMAGES ANALYZED WITH CNNs 3
(a) (b)
network DDN detects all the missing caps from the cropped of image patches are cropped from the original images. The
images. We introduced RPN [17] to our model to compute sizes of these cropped images are expected to be the small-
proposals effectively. RPN is designed to quickly produce est rectangular area, which contains one or more complete
a collection of detection boxes that may contain objects in insulators.
an image. In the first network, this detection box may con- Crop Module: The output of the ILN is a confidence score,
tain the region of the insulator, and in the second network along with the left-top and bottom-right corner vertex coor-
such a box may contain the area of the missing cap. The RPN dinates of the boxes containing insulators. The insulators are
is inserted after the last convolutional layer. A small window cropped out of the input images at these coordinates. Then,
slides across the convolutional feature map in order to classify these cropped images are sent to the DDN to find the missing
objects and move the regressing bounding box. In the RPN, cap defects. The ILN is powerful in that a tiny insulator may
three scales and three aspect ratios are used, resulting in nine be found in the image. However, these tiny insulators may be
different object proposals. It is more efficient to obtain the so small that determination of any abnormality may be diffi-
detection box directly on the feature map than to determine it cult. Hence, before sending to the DDN, the size of cropped
from an input image. To train the RPN, the loss function for images must be determined to be sufficiently large. To this
an aerial image can be defined as end, the equivalent edge length ratio R is defined as
1 1 ∗
L({pi }, {ti }) = Lcls pi , p∗i + λ pi Lreg ti , ti∗
Ncls Nreg H×W
i i
R= (3)
(2) (x2 − x1 )(y2 − y1 )
where i is the object proposal index, and pi represents the
ground truth label. The label p∗ i is the ith object’s pro- where H and W are the height and width of the input image,
posal, and if the proposal is for an object, p∗ i = 1, but [x1 , y1 ] and [x2 , y2 ] are the coordinates of the bottom-right and
otherwise is p∗ i = 0. ti is a four-dimensional vector that rep- top-left vertices of the box that is output from the insulator
resents the coordinates of the detection box, and t∗ i is the localizer. Then, we define a threshold Rt , that is used to decide
labeled detection box. Lcls and Lreg are the logarithmic loss whether to crop the insulator. This procedure accelerates the
and regression loss functions, respectively, and Ncls and Nreg following defect detection process.
are the number of classes and detected boxes, and with λ, DDN: The original images are cropped according to the
these three parameters are used to normalize the two loss detected box in the ILN as the inputs of the DDN. The network
functions. at this stage is aware of its previous iteration. In this stage, the
ILN: In the front-end of the model, we use the convolu- defects are smaller relative to the insulator patches, allowing
tion layers of VGG-16 [18], which consists of a stack of a more accurate network to be designed in the DDN. ResNet-
3 × 3 convolution and 2 × 2 max-pooling layers, to gen- 101 [19] convolution layers are used instead of those from
erate the feature maps. These feature maps are fed into the VGG-16 because of the former’s more numerous available
RPN. In the RPN, each 3 × 3 area of the feature maps network layers. In contrast to the ILN, which is based on
becomes a feature with 512 dimensions by applying a con- VGG-16 and only has fully connected layers followed by RPN
volution layer and ReLU. The 512-dimensional features are and ROI Pooling layers, the DDN inserts its RPN and ROI
fed into two fully connected layers to perform box classifi- Pooling layers between two sets of convolution layers. This
cation and regression. Based on the box information, another enables improved detection accuracy at a cost of additional
pooling is applied to the feature map produced from the VGG- computing time. For this reason, ResNet was chosen as the
16 convolution layers, called ROI pooling. From this step, the front end for the DDN. The output of the DDN must not only
features extracted in the VGG-16 convolution layers can be to provide the defect location, but also mark the corresponding
shared. In the end, the pooling result is equipped with two fully defective insulator. Thus, two outputs are generated: a defect
connected layers for box classification and more precise box box (with confidence) to locate the defect, and an insulator
border regression. In the inference stage of the ILN, a number annotated with its status.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
TAO et al.: DETECTION OF POWER LINE INSULATOR DEFECTS USING AERIAL IMAGES ANALYZED WITH CNNs 5
Fig. 5. Data augmentation process. An affine transformation is applied to the image initially, and then the insulator is separated from the original background,
and merged with the new background. Then, the region containing the insulator is cropped from the artificial image. Blur and brightness changes may be
added to the image before labeling the defects and adding to the dataset.
IV. DATA AUGMENTATION shear, rotation, and reflection transforms of the image, and t
The State Grid Corporation of China provided aerial images controls the translation transform.
of insulators. A common problem in defect detection is that Transformations applied to the images include scaling,
defect datasets are normally too small to train CNN. Owing shearing, rotating, flipping, and translating, replicating how
to the limited size of the training samples, augmenting the a UAV might capture photographs of the same insulator from
data was necessary in order to avoid overfitting. Data augmen- different angles. With the affine transformation, we can change
tation generates new datasets using existing data. A pipeline the size, direction, and position of insulators in the original
demonstrating how augmented samples are generated is shown image. These newly generated images can be used as train-
in Fig. 5, and includes four operations: 1) affine transfor- ing samples. In this paper, the affine transformation is applied
mation; 2) insulator segmentation and background fusion; randomly within a certain range. After the transform, points
3) Gaussian blur; and 4) brightness transformation. Each step lying outside the boundaries of the input are filled with the
simulates different imaging environments, including changes values of their nearest points.
in perspective, background, blur, and lighting.
B. Insulators Segmentation and Background Fusion
A. Affine Transformation An aerial image containing insulators can be divided into
A color aerial image can be described by a function defined two segments by human vision: 1) the foreground (insulators)
on a two-dimensional finite region. Mathematically, we have and 2) the background (other structure). However, the limita-
tion of an affine transformation is that it cannot change the
I : → r, g, b ∈ {0, . . . , 255}3
T relative position of insulators and the background. To solve
x, y → I(x, y) (4) this problem, we generate samples by adding insulators to
background images.
where
Background images are collected from real aerial scenes
T
= x, y : 0 ≤ x < W, 0 ≤ y < H (5) without any insulators. This portion of the algorithm seg-
ments the insulator contours from the complex aerial images,
is the image region, W is the width, and H is the height of and then fuses the insulator back into the background images
image I. We use p = [x, y]T to denote the image point or pixel without insulators.
at location (x, y) and I(x, y) = I(p) = [r, g, b]p to denote 1) Insulator Segmentation: Since the background of an
the intensity of the color image at point p. For convenience, aerial image is complex, an insulator cannot be directly seg-
homogeneous coordinates T p = [x, y, 1]T describe point p mented using prior knowledge of colors or gradients. The
in the image. An affine transformation for an image can be TV-based algorithm [20] is an image segmentation method
defined as that is based on an energy functional and a total variation
model, and is an interactive method, as the foreground and
p̄ = Ap̄ (6)
background must be marked manually, as shown in Fig. 6.
where When the TV-based method is used to segment the insula-
⎡ ⎤ tor image, the final segmentation result is closely related to
m11 m12 t1
M t the detail level of the mark. As more marks are applied, bet-
A= = ⎣ m21 m22 t2 ⎦. (7)
0 1 ter results can be obtained. Thus, significant effort is spent in
0 0 1
marking each image.
Here, p̄ are the homogeneous coordinates of the point after The fully convolution network U-Net [21] is applied
transformation and p̄ = [x , y , 1]T . M controls the scaling, for automatically segmenting the insulators. The U-Net is
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
TAO et al.: DETECTION OF POWER LINE INSULATOR DEFECTS USING AERIAL IMAGES ANALYZED WITH CNNs 7
Fig. 7. Image segmentation comparison. The top row used the TV-based method, and the bottom row used the U-Net. (a) Original image to be segmented.
(b) Insulator segmentation results. (c) Detail of the segmentation result. (c) Edge of the insulator is not complete when using the TV-based algorithm.
Fig. 9, the defect boxes start with the adjacent cap on the left
(a) (b) of the missing cap and end with the neighboring cap on the
right of the missing cap. The defective image of this labeled
Fig. 8. Examples of training samples. (a) Positive sample. (b) Negative mode is very similar to the dumbbell structure. In a complex
sample.
aerial image, it is more representative than the purely labeled
TABLE I missing area.
PARAMETER S ETTING OF DATA AUGMENTATION 4) Defect Detection Criteria (Evaluation Criteria): We
adopted three widely used measures to quantitatively eval-
uate the performance of our defect detection method:
precision (Pr ), recall (Re ), and F1 . A higher evaluation value
indicates better performance and vice versa
TP
Pr = (9)
G TP + FP
TP
Re = (10)
TP + FN
2 ∗ Pr ∗ Re
cropped images of all negative samples were used for training F1 = (11)
Pr + Re
and testing the DDN. Seven hundred and ninety-two images
where TP (True Positive) and FP (False Positive) denote the
for training and 264 images for testing were carried out in
number of correctly and incorrectly located defects, respec-
the DDN for defect detection (location). Most of the neg-
tively. TP + FP is the total number of located defects, and
ative sample images are obtained through the method of
TP + FN is the total number of actual defects. F 1 is the
data augmentation in Section IV. The parameter settings of
harmonic mean of precision and recall.
data augmentation are shown in Table I.
The region of the missing cap is small compared with the
For the ground truth in training sets and test sets, detec-
entire aerial image, so we calculate the above criteria under
tion regions (insulator strings and defects) were labeled with
the IoU threshold. The IoU is defined as
“insulator” and “defect” by the boxes, respectively, which
Area(GT ∩ PM )
are determined by the coordinates of the left-top corner IoU(GT , PM ) = (12)
and right-bottom corner. These labeled results are in the Area(GT ∪ PM )
VOC2007 format [23]. The insulator boxes are the rectangular where GT is the ground truth and PM is the located box,
areas, which contain the complete insulator areas. As shown in respectively. For each located box, its IoU scores to all ground
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
TABLE II
T RAINING PARAMETER S ETTING OF ILN
(a) (b)
TABLE III Fig. 10. Location results of aerial images. (a) Defect-free insulator strings.
T RAINING PARAMETER S ETTING OF DDN (b) Defective insulator strings.
TABLE IV
P ERFORMANCE OF ILN AND DDN
TAO et al.: DETECTION OF POWER LINE INSULATOR DEFECTS USING AERIAL IMAGES ANALYZED WITH CNNs 9
TABLE V
D ETECTION P ERFORMANCE FOR D IFFERENT M ETHODS colors, etc.) and Adaboosting classifiers to implement the
sliding-window detector. By using the feature pyramid to
accelerate the detection process, it can reach 30 ft/s on a sin-
gle core processor. However, its detection accuracy does not
meet our requirements. Since the aggregated channel feature
is a hand-crafted feature, it is not as robust as the learning
feature. Adaboosting models are essentially shallow networks
compared to deep learning networks, so the ACF detector per-
forms poorly in the expressing the features of insulator defects.
Meanwhile, distinguishing between normal or defective insula-
tors was also beyond the capacity of the classification method.
The feature expression of missing caps in the cropped images
was weak, since the missing caps were too small. The shapes
of the insulators in different cropped images were also dif-
ferent. Owing to the limitation of the number of insulator
samples, it is also not possible to train classification networks
from scratch. As a result, many defective insulators are diffi-
cult to distinguish from normal insulators, which results in the
Fig. 11. Effect analysis of the cascade architecture. majority of defective insulators being classified as defect-free.
Unfortunately, Faster R-CNN shows poor performance in
terms of precision and recall. This is not surprising since it
3) ILN+ACF Detector [25]: To compare with a tradi- is difficult to distinguish the 40 × 40-pixel objects in the
tional machine learning method, we also trained the ACF 4608 × 3456-pixel raw input image. Owing to the pooling
and Adaboosting models to detect and locate the missing layer in the CNN shrunking the image, the low-level layers
cap. In the comparative experiment, the detection module contain less semantic information about the objects. This will
was also built based on our ILN to locate insulator strings cause the missing cap area to be easily misdetected or missed
and an ACF-based detector was used to locate the missing using the single network. The precision and recall obtained by
cap. An ACF-based detector is a traditional state-of-the-art the method in [27] are close to ours, and a cascaded object
method in the field of pedestrian detection. Piotr’s Computer detection framework was also used. However, both of its detec-
Vision MATLAB Toolbox [26] was used to implement the tion frameworks are existing frameworks that will miss some
ACF detector as a baseline. The ACF detector was trained small objects.
with a sliding window size of 48 × 48 pixels and 4096 weak As shown in Fig. 12, an intuitive comparison result is
classifiers. presented among the five methods. The green box denotes
4) Cascaded DNN Detector: In [27], the cascaded DCNN- the normal insulator, while the defective insulators and the
based detection method was proposed in a coarse-to-fine man- defects are bounded by the red box. Each box is associated
ner, including two detectors (SSD and YOLO) to sequentially with its category and a confidence score. Fig. 12(a) shows that
localize the cantilever joints and their fasteners. This cascaded Faster R-CNN misjudges the end insulator region as a missing
DNN detector was used to detect the missing cap with the cap. Fig. 12(b) shows that ILN+CNN successfully locates all
Python implementation in the comparative experiment. insulators, but misclassifies all of the insulators as defect-free
Table V and Fig. 11 show the quantitative comparison of insulators. Fig. 12(c) shows that the ACF detector locates the
five methods applied to the defective insulator images. The missing cap successfully, but it seems like that more extra-
best performance is highlighted in bold. It can be seen that neous false results are obtained. Fig. 12(d) shows that the
our proposed method achieved the best defect detection accu- Cascaded DNN detector [27] successfully detected the missing
racy. The values of precision, recall and F 1 of the DDN cap, but it missed the insulator in the lower left-hand corner
are 0.91, 0.958, and 0.933, respectively. The cascaded DNN of the image and inaccurately positioned the insulator in the
detector also exhibits good performance in both precision lower right-hand corner of the image. Fig. 12(e) shows that
and recall. Compared with the two cascaded methods, the the proposed method, which combines ILN and DDN, has
single object detection framework (Faster R-CNN) and clas- a good performance. As can be seen from Figs. 11 and 12, it
sification method have achieved poor detection results. The is necessary to locate the insulator defects in stages, and the
ILN+ACF architecture is superior in detection-time consump- cascading structure can accurately detect the missing cap of
tion, but it shows very poor performance in terms of precision. the insulator from the aerial image under a complex scene. The
Meanwhile, the recall of the ILN+CNN method is the lowest defect detection of the insulator seems to be a classification
among the five methods. task, but it can be better performed by detection in our appli-
Owing to the complicated backgrounds and different atti- cation. This is because if the classification method is used, the
tudes of the insulators, locating the insulator defects is entire insulator region must be used as a complete pattern for
beyond the capacity of traditional machine learning methods. classification. In this case, a large number of real defective
Generally, the ACF detector has a large speed advantage. It samples are needed, the defects of which locate at different
uses aggregated multichannel features (including gradients, positions with different sizes. In the absence of a sufficient
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
(d) (e)
Fig. 12. Five insulator defect detection results. (a) Faster R-CNN. (b) ILN+CNN. (c) ILN+ACF. (d) Cascaded DNN. (e) Proposed method.
TABLE VI
dataset, DDN is designed for defect detection. The pattern of P ERFORMANCE OF T RAINING DATASET U SING
a defective area with the dumbbell structure is small, so that D IFFERENT AUGMENTATION
fewer samples are required. Moreover, the features are auto-
matically learned by the network, rather than the features being
hand-crafted as is in the ACF detector, so its effect is better
than that of both CNN and ACF detector.
TAO et al.: DETECTION OF POWER LINE INSULATOR DEFECTS USING AERIAL IMAGES ANALYZED WITH CNNs 11
Fig. 13. Insulator defects detection results. The green box denotes the normal insulator, the defective insulators and the defects are bounded by the red box.
Each box is associated with its category and a confidence score.
insulators in different cropped images do not have unified scale the normal insulator and the red box represents defective insu-
and shape. The missing cap defects are too small compared lators or defects. The confidence is shown on the top left
to the original image in the Faster-RCNN. Therefore, simply of each box. As can be seen from Fig. 13, our method can
increasing the amount of data is not enough to improve the successfully detect most of the insulator defects in cluttered
detection accuracy of the above two models. backgrounds. Moreover, there are some defect-free insulators
and high-voltage power tower in Fig. 13(a), (j), and (t), which
have a similar feature with the insulator defects. No insulator
D. Results on the CPLID defect was found in these images, proving that the proposed
To demonstrate the effectiveness of our method, we evalu- method has a strong ability to exclude similar interference.
ated it in the CPLID as well. This portion of insulator defects Overall, the proposed approach has significant potential for
detection results is shown in Fig. 13. The green box denotes a wide range of applications. However, detection of defective
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
insulators through aerial images is still not sufficiently based [15] M. J. B. Reddy, C. B. Karthik, and D. K. Mohanta, “Condition mon-
on data-driven methods. In a future study, we will utilize the itoring of 11 kV distribution system insulators incorporating complex
imagery using combined DOST-SVM approach,” IEEE Trans. Dielectr.
GAN to generate defective insulator images. Electr. Insul., vol. 20, no. 2, pp. 664–674, Apr. 2013.
[16] Z. Zhao, G. Xu, Y. Qi, N. Liu, and T. Zhang, “Multi-patch deep features
for power line insulator status classification from aerial images,” in Proc.
Int. Joint Conf. Neural Netw. (IJCNN), Jul. 2016, pp. 3187–3194.
VI. C ONCLUSION [17] S. Ren, K. He, R. Girshick, and J. Sun, “Faster R-CNN: Towards
In this paper, a novel deep CNN cascading architecture is real-time object detection with region proposal networks,” IEEE Trans.
Pattern Anal. Mach. Intell., vol. 39, no. 6, pp. 1137–1149, Jun. 2017.
presented for both localization and defect detection of insu- [18] N. Silberman, D. Hoiem, P. Kohli, and R. Fergus, “Indoor segmentation
lators. Defect inspection is converted to a two-level object and support inference from RGBD images,” in Proc. Eur. Conf. Comput.
detection problem based on the cascading CNN. To remedy Vis. (ECCV), Sep. 2012, pp. 746–760.
[19] J. Dai, Y. Li, K. He, and J. Sun, “R-FCN: Object detection via region-
the small number of defect images in real inspection environ- based fully convolutional networks,” in Proc. Conf. Neural Inf. Process.
ments, a data augmentation methodology is proposed, and we Syst. (NIPS), Dec. 2016, pp. 379–387.
describe how to use it to improve the insulator defect detec- [20] M. Unger, T. Pock, W. Trobin, D. Cremers, and H. Bischof, “TVSeg-
interactive total variation based image segmentation,” in Proc. Brit.
tion performance of CNNs. The proposed methodology can Mach. Vis. Conf. (BMVC), Sep. 2008, pp. 44–46.
significantly contribute to a new horizon of deep learning for [21] O. Ronneberger, P. Fischer, and T. Brox, “U-Net: Convolutional
the detection of defects using aerial images. The defect detec- networks for biomedical image segmentation,” in Proc. Int. Conf. Med.
Image Comput. Comput. Assist. Intervent., Oct. 2015, pp. 234–241.
tion precision and recall of the proposed method are 0.91 [22] T. Chen et al., “MXNet: A flexible and efficient machine learning
and 0.96 using our standard insulator dataset. Experimental library for heterogeneous distributed systems,” arXiv preprint arXiv:
results have shown that our method can meet the robustness 1512.01274, 2015.
[23] M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn, and
and accuracy requirements of insulator defect detection. A. Isserman. The PASCAL Visual Object Classes Challenge 2007
(VOC2007) Results. Accessed: May 20, 2017. [Online]. Available: http://
www.pascal-network.org/challenges/VOC/voc2007/workshop/index.html
R EFERENCES [24] I. E. Nordeng, A. Hasan, D. Olsen, and J. Neubert, “DEBC detection
with deep learning,” in Proc. Scandinavian Conf. Image Anal. (SCIA),
[1] K.-C. Park, Y. Motai, and J. R. Yoon, “Acoustic fault detection technique May 2017, pp. 248–259.
for high-power insulators,” IEEE Trans. Ind. Electron., vol. 64, no. 12, [25] P. Dollar, R. Appel, S. Belongie, and P. Perona, “Fast feature pyramids
pp. 9699–9708, Dec. 2017. for object detection,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 36,
[2] Y. Zhai, D. Wang, M. Zhang, J. Wang, and F. Guo, “Fault detection of no. 8, pp. 1532–1545, Aug. 2014.
insulator based on saliency and adaptive morphology,” Multimedia Tools [26] P. Dollar. (2006). Piotr’s Computer Vision MATLAB Toolbox (PMT).
Appl., vol. 76, no. 9, pp. 12051–12064, May 2017. [Online]. Available: http://pdollar.github.io/toolbox/
[3] C. Martinez, C. Sampedro, A. Chauhan, and P. Campoy, “Towards [27] J. Chen, Z. Liu, H. Wang, A. Núñez, and Z. Han, “Automatic defect
autonomous detection and tracking of electric towers for aerial power detection of fasteners on the catenary support device using deep con-
line inspection,” in Proc. Int. Conf. Unmanned Aircraft Syst. (ICUAS), volutional neural network,” IEEE Trans. Instrum. Meas., vol. 67, no. 2,
May 2014, pp. 284–295. pp. 257–269, Feb. 2018.
[4] C. Sampedro, C. Martinez, A. Chauhan, and P. Campoy, “A supervised [28] X. Yu, X. Wu, C. Luo, and P. Ren, “Deep learning in remote sensing
approach to electric tower detection and classification for power line scene classification: A data augmentation enhanced convolutional neural
inspection,” in Proc. Int. Joint Conf. Neural Netw. (IJCNN), Jul. 2014, network framework,” GISci. Remote Sens., vol. 54, no. 5, pp. 741–758,
pp. 1970–1977. May 2017.
[5] H. Zhao, R. Dai, and C. Xiao, “A machine vision system for stacked sub-
strates counting with a robust stripe detection algorithm,” IEEE Trans.
Syst., Man, Cybern., Syst., to be published.
[6] Y.-P. Huang, L. Sithole, and T.-T. Lee, “Structure from motion technique Xian Tao received the M.Sc. degree in mechan-
for scene detection using autonomous drone navigation,” IEEE Trans. ical and automotive engineering from the
Syst., Man, Cybern., Syst., to be published. China University of Mining and Technology,
[7] A. Çağlar, Y. Rezaeitabar, S. Dogru, and L. Ulusoy, “Railway fas- Beijing, China, in 2013 and the Ph.D. degree in
tener inspection by real-time machine vision,” IEEE Trans. Syst., Man, control theory and control engineering from the
Cybern., Syst., vol. 45, no. 7, pp. 1101–1107, Jul. 2015. Institute of Automation, Chinese Academy of
Sciences (IACAS), Beijing, in 2016.
[8] Z. Zhao, N. Liu, and L. Wang, “Localization of multiple insulators by
He is currently an Assistant Professor with the
orientation angle detection and binary shape prior knowledge,” IEEE
Research Center of Precision Sensing and Control,
Trans. Dielectr. Electr. Insul., vol. 22, no. 6, pp. 3421–3428, Dec. 2015.
IACAS. His current research interests include
[9] B. Li, D. Wu, Y. Cong, Y. Xia, and Y. Tang, “A method of insulator
deep learning and automated surface inspection for
detection from video sequence,” in Proc. 4th Int. Symp. Inf. Sci. Eng.
industry.
(ISISE), Dec. 2012, pp. 386–389.
[10] S. Liao and J. An, “A robust insulator detection algorithm based on local
features and spatial orders for aerial images,” IEEE Geosci. Remote Sens.
Lett., vol. 12, no. 5, pp. 963–967, May 2015. Dapeng Zhang received the B.Sc. and M.Sc.
[11] Z. Zhao, G. Xu, and Y. Qi, “Representation of binary feature pooling for degrees in mechatronic engineering from the Hebei
detection of insulator strings in infrared images,” IEEE Trans. Dielectr. University of Technology, Tianjin, China, in 2003
Electr. Insul., vol. 23, no. 5, pp. 2858–2866, Oct. 2016. and 2006, respectively, and the Ph.D. degree in
[12] M. Oberweger, A. Wendel, and H. Bischof, “Visual recognition and mechatronic engineering from the Beijing University
fault detection for power line insulators,” in Proc. 19th Comput. Vis. of Aeronautics and Astronautics, Beijing, China, in
Win. Workshop, Feb. 2014, pp. 1–8. 2011.
[13] Q. Wu and J. An, “An active contour model based on texture distribu- He is currently an Associate Professor with
tion for extracting inhomogeneous insulators from aerial images,” IEEE the Research Center of Precision Sensing and
Trans. Geosci. Remote Sens., vol. 52, no. 6, pp. 3613–3626, Jun. 2014. Control, Institute of Automation, Chinese Academy
[14] V. S. Murthy, K. Tarakanath, D. K. Mohanta, and S. Gupta, “Insulator of Sciences, Beijing, and the University of the
condition analysis for overhead distribution lines using combined Chinese Academy of Sciences, Beijing. His current research interests include
wavelet support vector machine (SVM),” IEEE Trans. Dielectr. Electr. robotics and automation, in particular, medical robot and virtual robotic
Insul., vol. 17, no. 1, pp. 89–99, Feb. 2010. surgery.
This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination.
TAO et al.: DETECTION OF POWER LINE INSULATOR DEFECTS USING AERIAL IMAGES ANALYZED WITH CNNs 13
Zihao Wang received the B.Sc. degree in control Hongyan Zhang received the B.S. and M.S. degrees
science and engineering from the Civil Aviation in applied physics and telecommunication engi-
University of China, Tianjin, China, in 2015, where neering from Xidian University, Xi’an, China, in
he is currently pursuing the M.Sc. degree in con- 2000 and 2003, respectively, and the Ph.D. degree
trol science and engineering with the Sino-European in control theory and control engineering from
Institute of Aviation Engineering. the Institute of Automation, Chinese Academy of
His current research interests include computer Science, Beijing, China, in 2011.
vision and deep learning. He is currently with the School of Information
Science and Technology, Hainan Normal University,
Haikou, China. His current research interests
include applied mathematics, computer vision, and
nondestructive testing.