Professional Documents
Culture Documents
Abstract— Plant diseases are a principal threat to the safety of II. RELATED WORK
food. In agriculture sectors, it is the greatest challenge to
identify plant diseases. The state-of-the-art Convolutional Plant diseases recognition has been discussed over the
Neural Network (CNN) gives excellent results to solve image years. Many researchers have developed many suitable
classification tasks in computer vision. Transfer Learning architectures with the help of machine learning techniques
enables us to develop a deep CNN network in a most cost and gave their thoughts to detect plant diseases.
effective way. In this work, a Transfer Learning based CNN Sindhuja et al. [1] described a fast, cost effective and
model was developed for the identification of plant diseases reliable health monitoring sensor. To monitor plant health
precisely. The dataset, we have used is consists of 70295 training and diseases, they represented various technologies that have
images and 17572 validation images holding 38 different classes
of plant leaves images. We have focused mainly on ResNet50
been used to detect the plant diseases. Waldchen et al [2]
network, a popular CNN architecture as our pre-trained model published a review of plant disease detection with the help of
in Transfer Learning. Additionally, several Transfer Learning computer vision techniques. Their review comprised of
architectures were experimented with few other popular pre- almost 120 literature researches and also a broad scale
trained models (VGG16, VGG19, AlexNet) and compared with description of datasets was given.
the proposed model. The proposed model has given the best Erika et al. [3] proposed a four layers CNN model which
performance of 99.80 % training accuracy. contains 7 types of diseases along with healthy cucumber
leaves. They remarked good and bad condition of images and
Keywords— CNN, Artificial intelligence, Transfer Learning,
found an average accuracy of 82.3%. Powara et al in [4]
Plant disease, Pattern recognition, ResNet50.
compared some handcrafted feature descriptors techniques
I. INTRODUCTION with CNN models. Their comparison includes Histogram of
Oriented Gradients (HOG) based features combined with k-
In the world, there are 570 million farms. More than 90%
people are related to the farming. Farmers occupy a large share Nearest Neighbors (KNN), and HOG-BOW integrated with
of the 80% of the food. SVM and MLP classifiers. They compared these models with
AlexNet and GoogleNet, both trained from scratch. Piyush et
Food security is threatened by a number of factors such as al. [5] used color based techniques to segment particular
climate change, degeneration in pollinators, plant diseases etc. region of interest in the images of plants. In this study, YcbCr,
Plant diseases are an impendence to safety of food as well as HIS and CIELB color models were used for detecting disease
disastrous consequences for farmers. To prevent plant spots. In the Reference [6], few texture features such as
diseases, farmers waste their financial and other resources. In exhaustion, homogeneity and congruence were obtained.
this changing environment, we need to identify these diseases. They calculated the grizzle level form on images and linked
Some diseases are difficult to recognize with their visible
it with color selection and discovered maize leaf diseases.
symptoms through human eye. In this situation, we need to
use technology to detect these diseases. Otherwise people
have to depend on imported food which will increase the Sachin D. Khirade, A.B. Patil [7] applied some
expenditure as well as can lead to health risk for people. There segmentation techniques and extracted features from leaf
are some applications such as fungicide, disease-specific images and then implemented neural network as a classifier
chemical, and pesticide application. These applications could with back propagation. Authors in [8] developed K-Means
be carried if we have the early information on plant diseases. technique and a pre-trained neural network based model for
By using these applications, we can control diseases and leaf and stem disease detection. Melike et al. [9] presented a
flourish our productivity. CNN based model and Learning Vector Quantization (LVQ)
algorithm based method for the detection of tomato leaf
In this case, CNNs can be used in detecting plant diseases. diseases and classification. Authors in [10] applied transfer
CNN is one of the most powerful technique in pattern learning approach with inception_v3 and mobilenet
recognition with large amount of data. CNN benefits with very architectures and showed that inception_v3 finished with
promising result to detect these diseases. In previous works, better accuracy.
various classification architectures of CNNs were used to
detect diseases. In this paper, we have developed a deep learning model by
the process of Transfer Learning, where the pre-trained
Remaining paper is arranged as follows: Section II weights have been collected from the popular ResNet50
describe previous related works. Methodology has been model. These added layers in our proposed model mainly
explained in Section III. Section IV holds experimental result benefits to the feature extraction process with minimum
and analysis. Concluding remarks is in Section V. computational cost. Additionally, Fine-Tuning has been
performed to improve the detection accuracy. The dataset in
2500
NUMBER OF IMAGE
2000
1500
1000
500
0
600
Number of images
500
400
300
(a) (b) (c) 200
100
0
Corn(maize)he…
Grape…
strawberry…
Tomato…
Tomato…
Orange…
Pepper…
Soyabean…
Apple scab
Cherry powder
potato healthy
Tomato heathy
Apple healthy
Fig. 5. Some augmentation of pepper bel bacterial spot leaf image: (a)
original, (b) rotation, (c) right shift, (d) vertical shift, (e) rotation and right
shift, (f) left shift.
D. ResNet50
AlexNet, AlexNetOWTBn, GooLeNet, Overfeat, VGG
models are more common in transfer learning. They stacked
many convolutional layers. There are some difficulties with
deep CNN networks: optimization of the networks, vanishing
gradient problem, and the degradation problems.
The Residual network (ResNet) brings a new idea. It
benefits to solve complicated tasks and also increases the
detection accuracy. ResNet tries to solve the difficulties in
training process of deep CNN, the saturation and degradation
of accuracy. In this paper, we have used ResNet50
architecture. ResNet50 had 50 layers of residual networks.
Fig. 6 shows ResNet50 architecture.
The ResNet50 structure has different groups of identical
layers as indicated by different colors in the Fig. 6. The curve
lines represent the identify blocks that is used to indicate the
use of previous layers in the following layers. It is the key
difference in ResNet50 that counterfeit problem of vanishing Fig. 6. ResNet50 architecture
or exploding gradients, degradation problem (accuracy first
saturates and then degrades) in training very deep networks. F. Fine-Tuning
In the Fig. 6, the first layer has 64 filters with the kernal Fine-tuning is used for increasing the efficiency of a
size of 7×7, which is followed by a maxpooling layer of size function. It makes small modification to improve the
3×3. The first group of layers (as indicated by gray color) outcome. The adjustment process is so crucial that some
consists of three identical blocks. In the same way group two, change affects the training process a lot with respect to the
group three, and group four have 4 indentical blocks, 4 computation time needed, the speed of convergence and the
identical blocks and 3 indentical blocks respectively. In use of processing units. This process of fine tuning was
between some groups, the curves marked with blue color, repeated over again and again to improve the accuracy of our
represent the identity block that connects two layers of model. These are listed in the Table I.
different sizes.
TABLE I. PARAMETERS SETUP FOR THE PROPOSED MODEL
After all these blocks, there is a total 38 fully connected
layers responsible for the clasification task. In our proposed Parameter Value
model, we did not use these fully connected layers. Batch size 32
E. Performed Tests Steps per epoch 550
We have done many testings in different experimental Epoch 25
setups to analysis the performances of the proposed model. Validation steps 1
Several network parameters have been changed through the Optimizer SGD (stochastic gradient descent)
instruction made to the model. We have split the total dataset Learning rate default
into 80% for training purpose and 20% for validation purpose. Decay default
Then, we have experimented the dataset with our proposed Momentum default
IV. RESULTS AND DISCUSSION In our experiment with proposed model, images were
Our model was instructed to start training with the resized into 224×224. Then the augmentation was performed.
We used the weights of pre-trained ResNet50 model. We used
training dataset comprised of both the actual images and the
stochastic gradient descent (SGD) optimizer, softmax
images from augmentation. Then validation has been activation and batch size equals 32. In this model, learning
performed to generalize the model. Fig. 7 shows a good rate, decay, momentum was set to default value. In this way,
convergence of proposed network in the training and we were able to increase the number of classes where most of
validation phases. Although, there is a single low peak in the the previous works include less number of classes of diseases.
validation curve, it shows a 100% validation accuracy for the
remaining major portion of the curve. Then, we applied the ResNet50 architecture to the
classifier on the new plant diseases dataset and then tested the
Furthermore, Fig. 8 shows the spreading of losses (both performance of the model using test images. We experimented
training loss and validation loss) with respect to number of by changing weights of the pre-trained ResNet50 model. This
epoch in the training and validation phases. The curve in Fig. was used to compare our model with other popular pre-trained
8 emphasizes how many images were accurately identified in networks in transfer learning. We analyzed to find out which
the validation process. one is the most appropriate pre-trained network for this
dataset. We compared with three models known as VGG16,
VGG19, AlexNet. The results of these experiments are shown
in Table II. It shows that model with ResNet50 has high
accuracy as well as low loss. The most promising result is the
minimum time required in completing every step.
Lastly, 33 images were used in the testing phase from a
total number of 38 classes. Some predicted images are shown
in Fig. 9. In that testing, our proposed models exhibit 100%
testing accuracy with 33 true classifications out of 33 images.
Fig. 10 shows the accuracy and losses of AlexNet,
VGG19, VGG16, ResNet50 models in the training and
validation process. Here we see that, ResNet50 model
achieved the highest success rate. The losses and convergence
Fig. 7. Accuracy distribution in the training process
time are also lower than the models with other pre-trained
networks.
Fig. 9. Some predicted images with the proposed model (out of 33 images).
TABLE III. CONVERGENCE COMPARISON WITH THREE MODELS USING
DIFFERENT PRE-TRAINED NETWORKS
REFERENCES
[1] S. Sankaran, A. Mishra, R. Ehsani, and C. Davis, “A review of
advanced techniques for detecting plant diseases,” Computers and
Electronics in Agriculture, vol. 72, no. 1, pp. 1–13, 2010.
[2] J. Wäldchen, P. Mäder, “Plant Species Identification Using Computer
Vision Techniques: A Systematic Literature Review,” Archives of
Computational Methods in Engineering, Vol. 25, Issue 2, pp 507–543,
April 2018.
[3] E. Fujita, Y. Kawasaki, H. Uga, S. Kagiwada, and H. Iyatomi, “Basic
investigation on a robust and practical plant diagnostic system,” 15th
IEEE International Conference on Machine Learning and Applications
(ICMLA 2016), December 2016.
[4] P. Pawara, E. Okafor, O. Surinta, L. Schomaker, and M. Wiering,
“Comparing Local Descriptors and Bags of Visual Words to Deep
Convolutional Neural Networks for Plant Recognition,” 6th
International Conference on Pattern Recognition Applications and
Methods (ICPRAM 2017), pages 479-486, 2017.
[5] P. Chaudhary, A. K. Chaudhari, A. N. Cheeran, and S. Godara, “Color
transform based approach for disease spot detection on plant leaf,”
International Journal of Computer Science and Telecommunications,
vol. 3, no. 6, pp. 65–69, 2012.
[6] J. K. Patil and R. Kumar, “Feature extraction of diseased leaf images,”
Journal of Signal & Image Processing, vol. 3, no. 1, p. 60, 2012.
[7] S. D. Khirade, A.B. Patil, “Plant Disease Detection Using Image
Processing,” 2015 International Conference on Computing
Communication Control and Automation, Feb. 2015.
[8] D. A. Bashish, M. Braik, S. B. Ahmad, “A framework for detection and
classification of plant leaf and stem diseases,” International Conference
on Signal and Image Processing, Dec. 2010.
[9] M. Sardogan, A. Tuncer, Y. Ozen, “Plant Leaf Disease Detection and
Classification Based on CNN with LVQ Algorithm,” 3rd International
Conference on Computer Science and Engineering (UBMK), Sept.
2018.
[10] G. Suresh, V. Gnanaprakash, R. Santhiya, “Performance Analysis of
Different CNN Architecture with Different Optimisers for Plant
Disease Classification”, 5th International Conference on Advanced
Computing & Communication Systems (ICACCS), March 2019.
[11] Salathegroup, "plantvillage_deeplearning_paper_dataset", (2016),
GitHub repository . [Online]. Available:
https://github.com/salathegroup/plantvillage_deeplearning_paper_dat
aset. [Accessed: 10- May- 2019].
[12] Mohanty, S.P., Hughes, D.P., Salathé, M., 2016. Using deep learning
for image-based plant disease detection. Front. Plant Sci. 7
http://dx.doi.org/10.3389/fpls.2016. 01419. Article: 1