You are on page 1of 5

2021 International Conference on Information and Communication Technology

or Sustainable Development (ICICT4SD), 27-28 February, Dhaka


2021 International Conference on Information and Communication Technology for Sustainable Development (ICICT4SD) | 978-1-6654-1460-9/21/$31.00 ©2021 IEEE | DOI: 10.1109/ICICT4SD50815.2021.9396986

Rice Disease Detection using Intensity Moments


and Random Forest
Sristy Saha, Sk. Md. Masudul Ahsan
Department of Computer Science and Engineering (CSE)
Khulna University of Engineering & Technology (KUET)
Khulna, Bangladesh
Email: saha1807565@stud.kuet.ac.bd, smahsan@cse.kuet.ac.bd

Abstract—Improvement of an automated method for recog- obligatory in the first step of recognizing the paddy leaf
nizing and categorizing various plant diseases is an evolving disease. This system is involved with the recognizable proof
research area. Usually, it is very time-consuming to recognize of the rice leaf diseases, which is consists of pre-processing of
plant diseases in remote areas, because of the communication gap
between the farmer and the specialist. A programmed layout can the image, converting the RGB images into gray images. Then
help a farmer to discern rice plant diseases. The automatic system intensity moments extraction, and lastly classification of the
that is referred to here can detect the main three types of rice leaf diseases. Different categories of image formats like JPEG,
leaf diseases (Bacterial leaf blight, Leaf blast, and Brown spot) JPG, PNG, GIF, BMP, etc. farmers can have when they capture
by the Random Forest decision tree classifier. I n tensity moments images from the paddy field. So the collection of various kinds
are needed here for extracting features properly. This proposed
system obtains 91.47% accuracy and can classify rice diseases of rice leaf disease images is the first task. Color processing is
nicely in their primary stage. By adding some more collaborative supposed the most significant work in image preparation and
features, the obtained result can assist the developer to rapidly significant perception for identifying various classes. In the
identify plant diseases. This will also help the agriculturalists in pre-processing step of an image, resizing the image, remove
active decision-taking for defending the plant professionally from noise, or segmentation those calculation is essential often.
ample harm.
Index Terms—Plant diseases, Decision tree, Image processing, Texture features, structure features, and geometric features
Intensity moments, Random Forest are usually well known to all. Rice leaf image represents
one or more features. Normally for feature extraction feature
I. I NTRODUCTION vector is created using Mean, Standard Deviation, Variance,
Midrange, IQR, and Median. The classification technique is
Rice is the most significant n o u rishment h a r vests o n the propounding for recognizing whether the images are affected
planet, especially in Bangladesh. In Bangladesh farmers usu- by diseases or not. Various rice leaf diseases can purposive
ally go through with lots of problems in their rice plant. Un- by some classifiers like SVM, Random Forest, and Euclidean
perceived infections of paddy can prompt significant h a r m to distance using KNN. Machine learning and image processing
harvest development and eventually diminished the creation of technique can smoothly identify three types of rice leaf disease
yields. Various sicknesses influence t he g rowth o f p addy, three in an early stage.
primary diseases that usually happen in the plant: Bacterial
Leaf Blight of rice, Blast, and Brown spot of rice. Blight, II. L ITERATURE R EVIEW
Blast, and Brown spot diseases consisting of some similar Joshi and Jadhav [1] had shown four types of rice disease
characteristics. Someone can easily be perplexed by viewing detection by developing a prototype. K-Nearest Neighbor
all these same features of plant diseases. Above all crops can (KNN) and Minimum Distance Classifier (MDC) are applied
be influenced b y s i c knesses a t s e v eral p h a ses. E a r ly s i g ns of
to categorize different features such as the color of an affected
some plant diseases is not an easy task to find b y t he farmers. portion of the leaf, and the shape of the leaf. Here affected
They are not attentive to suitable controlling to cure their rice region cropped part is under consideration. Four types of
disease. If farmers can identify conditions of diseases smoothly rice diseases finding system are initiated by Narmadha and
at the primary stage it will be so helpful for them. By this Arulvadivu [2]. Paddy leaf disease detection and identification
identification, f armers c an s ave s taple f ood, r ice. W ithout rice, of the affected part is calculated by K-means techniques. Pre-
many people cannot stay on a single day. Most of the countries processing is designed to remove noise, which is compulsory.
economy is mostly dependent on rice as well. So saving rice Image segmentation, feature extraction, and classification of
from various diseases is very helpful for both life and the leaves, these three phases are required in [3] proposed by
economy. author Deshmukh et al. In their experiment, the Artificial
For this reason, an appreciative leaf disease recognition pro- Neural Network (ANN) is conducted for categorizing rice leaf
cess of rice is a super necessary task for both life and economy. diseases. A software prototype was defined by Phadikar and
In the work of recognizing the leaf image, different schemes Sil [4]. This method was for rice disease finding, using zoomed
already can identify diseases nicely. Few undertakings are images. A Self-Organizing Map (SOM) neural network is

978-1-6654-1460-9/21/$31.00 ©2021 IEEE


166

Authorized licensed use limited to: East Carolina University. Downloaded on June 15,2021 at 13:40:09 UTC from IEEE Xplore. Restrictions apply.
applied for identifying the infected part of the rice leaf.
The digitized color image of paddy leaf disease was offered by
Anthonys and Wickramarachchi [5]. Initially, texture, shape,
color features of the color image are extracted from rice leaf.
Mathematical morphology help to identify rice leaf disease.
They extracted the spot of rice leaf by the process of segmenta-
tion. For better classification of plant leaf diseases membership
function is required in their work. Using a machine-learning
algorithm a robust rice disease detection process was proposed
by Ramesh and vydeki [6]. Here proposed architecture relates
the finding process of the rice leaf disease. Images of healthy
rice leaf and blast disease rice leaf both are mandatory in
their offered method. They only discuss rice blast disease
identification processes in their work. Elangovan and Nalini
[7] submitted an image processing technique for noticing
different leaf diseases. SVM classifier and K-means clustering
were connected for leaf disease classification. Mangla et al. [8]
had suggested a rice leaf detection strategy applying machine
learning and image processing. SVM help in finding the
correct classification of disease.
Bashir et al [9] had revealed a research tool that detects
rice leaf diseases and can suggest cure respective diseases.
This is based on the image-processing approach to investigate
and categorize three types of rice leaf diseases. This method
classifies the disease on the plant leaf using an accomplished
classifier. Khirade and Patil [10] discussed some piece removal Fig. 1: Proposed architecture of rice leaf disease detention.
and arrangement systems to extract the features from the
diseased leaf. SVM classifier, Self-organizing feature map,
One-third of images (i.e. 176 images) from the rice leaf
and backpropagation algorithm, etc. are linked here for clas-
dataset attained for testing the disease. Though intensity mo-
sification of the disease. Rice disease finding process using
ments are not novel, their usage is not well explored. In this
outline and color texture features had suggested by Yeo et
system, we experimented with various intensity moments as a
al [11] This disease classification process was based on the
feature vector to figure out the best combination for detecting
SVM classifier. Phadikar et al. [12] developed a computerized
rice disease for this very specific dataset.
system that has been used to classify the rice leaf brown spot
and the blast diseases. Morphological changes are noticed in
A. Training Phase
the rice leaf if itâs triggered by the diseases. In their proposed
method SVM classifier is proficient in classifying the rice leaf Image Accusation: The dataset [18] contains three types
diseases. Suman and Dhruvakumar had suggested appropriate of rice leaf diseases (Blast, Blight, and Brown spot) images
preprocessing techniques [13], using histogram plots in normal are used in the work. The provided dataset is of optimum
and diseased leaves. Features like shape and color are required age of 3 to 4 weeks. The reason behind it is a leaf which
for grouping the leaf diseases using the SVM classifier. is too old or too young isn’t suitable for detecting diseases
Many authors have already work on plant leaf disease de- perfectly. In fig 2 some sample of rice leaf disease is shown.
tection techniques. But this proposed method is done on Image Pre-processing: Displaying larger images sometimes
the leaf diseases primary stage with high accuracy. This makes the problem in storage. So resizing this image into
paper is arranged by section III proposed methodology, IV 300*300 pixels and converting the RGB image into a
experimental results, and discussion, and finally in section V grayscale image is required.
concluded the paper. Feature Extraction: When the input data containing ample
records with a large input data size then data can modify into
III. P ROPOSED M ETHOD a set of features for better calculation [14]. There are several
This system is for diagnosing rice leaf disease contains some methods to extract color features from images. The color
processes like - first acquisition of the image of rice leaf, pre- moment is an effective and easy color feature functioning
process the image, extracting features from those images, and process. Its math foundation lies that any color distribution
classification of the image according to disease name. Datasets can show with its moment [14]. Here for feature extraction
are split-up into two sets. Two-third of data from the dataset intensity moment is under consideration for getting results in
i.e. total of 352 images (276 original images and the remaining an effective and meaningful way.
are augmented from) are avail of training.

167

Authorized licensed use limited to: East Carolina University. Downloaded on June 15,2021 at 13:40:09 UTC from IEEE Xplore. Restrictions apply.
(a) Blast (b) Blight (c) Brown spot

(d) Blast (e) Blight (f) Brown spot


Fig. 4: System architecture of random forest decision tree.

Fig. 2: Some sample images from the dataset [18].


B. Testing Phase
Using the equivalent processes of the training phase the
feature of the query images are extracted. After Extracting
these features, a feature vector is erected for the query image.
This feature vector is redirecting to a classifier name random
forest decision tree classifier. Row sampling and feature
sampling of rice leaf datasets are consigning through decision
trees. That classifier can easily classify what type of disease
is containing in the query image.

Classification using Random Forest: Random forest is


a supervised ensemble learning algorithm that is applied de-
ployed for both classifications as well as regression problems.
Usually, the forest is consists of trees and more trees mean
a more robust forest. Likewise, the random forest algorithm
generates decision trees on data models and then gets the
estimate of the diseases from each of them. Many separate
learners are created by this ensemble learning bagging method.
Some trees may be incorrect, multiple trees will be correct as a
result. Thus every decision tree has high variance. Aggregation
is generated for predictions using the CART (Classification
Fig. 3: Result of search process for selecting feature. and Reintegration Tree) algorithm. [15]. So as a congregation
the trees are capable of taking in the proper direction. Fig
4, represents the system architecture of the random forest
decision tree. When the system combines all the decision
Seven intensity moments i.e. Mean, Median, Mode, trees regarding majority vote, thus high variance will get
Midrange, Range, IQR, and Standard deviation are used for converted through low variance. Because when row sampling
extracting the features from the image applying respective and feature sampling are proceeding into the decision tree,
formulas. Calculating these seven features a total of 127 com- it tends to become an expert for these specific rows of the
binations (7C1 +7C2 +7C3 +7C4 +7C5 +7C6 +7C7 ) is evaluated dataset they have. A total of 352 images of sample features
for each classifier (KNN, SVM, and random forest), which from training data went through the decision tree. Before that,
is called an exhaustive search process. An exhaustive search we calculate which features are best for this detection process
process assists to intuit which combination is best for this of rice leaf disease. Now, this process will able to give the
dataset. It also helps to find out which classifier is suitable approximately correct detection, when some query images
for obtaining the best result. Fig 3 shows that Mean, Mode, send to this system. Finally, by the calculation of majority
Range, and IQR are sufficient to get the best result using a voting, what types of diseases the leaf is containing come out.
random forest classifier.

168

Authorized licensed use limited to: East Carolina University. Downloaded on June 15,2021 at 13:40:09 UTC from IEEE Xplore. Restrictions apply.
Fig. 5: Random forest decision tree for classifying rice leaf diseases.

TABLE I: ACCURACY OF THE SYSTEM


The left branch value and another branch value is just above
or equal to that threshold value. Mean, Mode, Range, and IQR Disease Type Accuracy Overall Accuracy
these four features respectively denote as x1, x2, x3, and x4 Blast 86.66%
are utilized to represent the decision nodes of the decision tree. Blight 93.44% 91.47%
Brown spot 94.54%
And leaf nodes 1, 2, and 3 respectively present Blight, Blast,
and Brown spot disease which is in fig 5.
TABLE II: EXPERIMENTAL RESULT

IV. R ESULT AND D ISCUSSION


Disease Type Precision Recall
The training phase and testing phase are the two main parts Blast 0.867 0.962
Blight 0.934 0.919
of this work. In the training phase, did extract the features Brown spot 0.945 0.867
of the rice leaf image. Then the feature vector is created and Overall Result 0.914 0.916
these features are stored in the database as training elements.
The testing phase allows to find out how well the images are TABLE III: CONFUSION MATRIX FOR SYSTEM EVALUATION
trained. So, generating the feature vector, the query image is
sent to the random forest decision tree classifier for identifying Predicted Classes
diseases. Correctly classify and misclassification of the Blast,
Blast Blight Brown spot
Blight, and Brown spot diseases are respectively 51, 57, 52,
Actual Classes

and 8, 4, 3. So, the accuracy of each leaf disease and the Blast 0.867 0.050 0.083
overall accuracy is shown in Table I. Precision and Recall of
Blight 0.016 0.934 0.049
the outcome of individual disease and the overall accuracy of
the system are exposed in Table II. 91.47% accuracy attained Brown spot 0.018 0.036 0.945
in this proposed system which is pretty good for classifying
diseases. If the affected part of the leaf is under consideration
this accuracy will be higher. Normalized confusion matrices are shown in Table III.
Confusion matrix displays the real and predicted levels Disease recognition result: The proposed system identifies
of data of a grouping problem [16]. The total number of disease with 91.47% accuracy. In fig 6, the red label and blue
predicted class and target class of Blast, Blight, and Brown level name respectively indicates the original class and rice
spot disease are respectively: 52, 57, 52 and 60, 61, 55. leaf diseases that are determined by this system.

169

Authorized licensed use limited to: East Carolina University. Downloaded on June 15,2021 at 13:40:09 UTC from IEEE Xplore. Restrictions apply.
rice leaf have been extracted. We will include color, shape,
texture features, Hu moment, and test this by some different
types of algorithms in future work. Neural networks can be the
next step for detecting plant diseases more quickly. By doing
some minor modifications this same method can be realistic
to identify other leaf diseases.
R EFERENCES
[1] A. A. Joshi and B. Jadhav, “Monitoring and controlling rice diseases
using Image processing technique,” 2016 International Conference on
(a) Correct detection (b) Correct detection Computing, Analytics and Security Trends (CAST), pp. 471–476, 2016.
[2] R. P. Narmadha and G. Arulvadivu, “Detection And Measurement
of Paddy Leaf Disease Symptoms using Image Processing,” 2017
International Conference on Computer Communication and Informatics
(ICCCI), pp. 1–4, 2017.
[3] R. Deshmukh and M. Deshmukh, “Detection of paddy leaf diseases,
” International Journal of Computer Applications, vol. 975, pp. 8887,
2015.
[4] S. Phadikar and J. Sil, “Rice Disease Identification using Pattern Recog-
nition Techniques, ” 2008 11th International Conference on Computer
and Information Technology, pp. 420–423, 2008.
[5] G. Anthonys and N. Wickramarachch, “An Image Recognition System
for Crop Disease Identification of Paddy fields in Sri Lanka,” 2009
International Conference on Industrial and Information Systems (ICIIS),
(c) Correct detection (d) Incorrect detection pp. 403–407, 2009.
[6] S. Ramesh and D. vydeki “Rice Blast Disease Detection and Classi-
Fig. 6: Different rice disease detection example. fication Using Machine Learning Algorithm,” 2018 2nd International
Conference on Micro-Electronics and Telecommunication Engineering
TABLE IV: PERFORMANCE COMPARISON (ICMETE), pp. 255–259, 2018.
[7] K. Elangovan and S. Nalini, “Plant Disease Classification Using Image
Segmentation and SVM Techniques,” International Journal of Compu-
Reference Overall Accuracy tational Intelligence Research, vol. 13(7), pp. 1821–1828, 2017.
Prajapati et al. [17] 62% [8] N. Mangla, P. B. Raj, S. G. Hegde and R. Pooja, “Paddy leaf disease
Phadikar and Sil [4] 68.10% detection using image processing and machine learning, ” Int J Innov
Anthonys and Wickramarachchf [5] 70% Res Elec Electron Instrument Control Eng, vol. 7(2), pp. 97–99, 2019.
Joshi and Jadhav [1] 89.23% [9] K. BASHIR, M. REHMAN and M. BARI, “Detection and classification
This System 91.47% of rice diseases: An automated approach using textural features, ”
Mehran University Research Journal of Engineering and Technology,
vol. 38(1), pp. 239–250, 2019.
[10] S. D. Khirade and A. B. Patil, “Plant Disease Detection Using Image
Performance Comparison: Table IV represents the evalu- Processing, ” 2015 International conference on computing communica-
tion control and automation, pp. 768–771, 2015.
ation of this method with some existing works. Prajapati et al. [11] Q. Yao, Z. Guan, Y. Zhou, J. Tang, Y. Hu and B. Yang, “Application
[17] had proposed the dataset which was created manually of support vector machine for detecting rice diseases using shape and
by separating infected leaves into various disease classes. color texture features,” 2009 international conference on engineering
computation, pp. 79–83, 2009.
They had consulted the farmers and then the agriculturist [12] S. Phadikar, J. Sil and A. K. Das, “Classification of rice leaf diseases
for getting the exact plant disease name. It is to be noted based on morphological changes,” International Journal of Information
that, the comparative results shown in Table IV are of the and Electronics Engineering, vol. 2(3), pp. 460–463, 2012.
[13] T. Suman and T. Dhruvakumar, “Classification of paddy leaf diseases
different datasets. This is because no benchmark dataset or using shape and color features,” International Journal of Electrical and
source implementation was available to experiment on the Electronics Engineers, vol. 7(1), pp. 239–250, 2015.
same ground. Since it is burdensome to accurately implement [14] G. Athanikar and P. Badar, “Potato Leaf Diseases Detection and Classifi-
cation System, ” International Journal of Computer Science and Mobile
the state-of-the-art methods on this dataset, we presented the Computing, vol. 5(2), pp. 76–88, 2016.
results here to get an overview of how different methods [15] M. E. El-Telbany and M. Warda, “An empirical comparison of tree-based
perform on a similar type of dataset. It can be seen that learning algorithms: an egyptian rice diseases classification case study,
” International Journal of Advanced Research in Artificial Intelligence,
the proposed method shows a quite satisfactory accuracy for vol. 5(1), pp. 22–26, 2016.
practical use. [16] M. Vassallo-Barco, L. Vives-Garnique, V. Tuesta-Monteza, H. I. MejÃa-
Cabrera and R. Y. Toledo, “Automatic Detection of Nutritional Deficien-
V. C ONCLUSION cies In Coffee Tree Leaves Through Shape And Texture Descriptors, ”
Journal of Digital Information Management, Vol. 15(1), 2017.
A method for classifying the rice leaf diseases like Blast, [17] H. B. Prajapati, J. P. Shah and V. K. Dabhi, “Detection and classification
Blight, and the Brown spot is designed. This work describes of rice plant diseases, ” Intelligent Decision Technologies, vol. 11(3),
pp. 357–373, 2017.
three types of rice leaf disease using a machine learning algo- [18] A. K. G. Francisco, “GitHub,” 2020 [Online]. Available:
rithm with high correctness. Rice leaves containing diseases https://github.com/aldrin233/RiceDiseases-DataSet. [Accessed August
can smoothly identify by using the random forest decision tree. 2020].
The Matlab application is essential for doing the work. In this
implementation intensity moments of the affected part of the

170

Authorized licensed use limited to: East Carolina University. Downloaded on June 15,2021 at 13:40:09 UTC from IEEE Xplore. Restrictions apply.

You might also like