Professional Documents
Culture Documents
Project Report
On
“Plant Disease Detection System”
Submitted by
Guided by
Mrs. Z. A. Zalte
(Assistant Professor)
This is to certify that, Mr. Shaikh Saud Abdul Saboor, Quazi fahad
Arsalan, Shaikh Shoeb, Siddique Shakeeb has successfully completed
project work entitled “Plant Disease Detection System” in partial fulfillment
for award of Bachelor of Technology(Computer Engineering) Degree of Dr.
Babasaheb Ambedkar Technological University, Lonere Dist. Raigad.
List of Abbreviations i
List of Figures ii
List of Screenshots iii
1. INTRODUCTION 1
1.1 Introduction 1
1.2 Necessity 4
1.3 Objectives 5
1.4 Exisitng system 6
1.5 Proposed system 6
2. LITERATURE SURVEY 8
2.1 Introduction 8
2.2 Deep Learning 9
3. SYSTEM DEVELOPMENT 11
3.1 General Discript ion 11
3.2 Technical Discription 12
3.3 UML Diagram 14
3.4 Sample Code and Output 18
4. PERFORMANCE ANALYSIS 21
4.1 Different Modules of System 21
4.2 Functional Module of System 21
5. CONCLUSION 25
5.1 Conclusion 25
5.2 Future Scope 25
Refrences iv
Acknowledgement v
List of Abbreviations
I
List of figures
II
List of Screenshots
III
1. Introduction
1.1 Introduction
Plant disease detection is an Innovative and Enlightening System helping the
users to know the disease, trainings or any interesting things taking place
around their Area. This Organization aids the native community to keep
themselves up to date about the events around their locality or zone or in their
town. There are 2 things for this method to work; one for the image processing
and another is machine learning. The user is permitted to sight the disease only
ohis town while user can supplement disease connected to any town. Admin
will show if any misuse or inappropriate or false disease added by any users
and will take specific act. The Front end used is Android Studio and backend
as SQL Server. The user has to record into the system to using this app and
can bring up-to-date his details as well. The healthy leaf is shown first and so
on, the user can also restore the disease resultant the latest one shown first and
current disease will be shut. The user can add an image and a title connected to
the leaf. The user can use only 500 words to give a lecture on disease. The
Appearance and texture of evaluation the disease is thrilling and amazing as
the system offers swipe to move to the next or earlier disease with transition
properties. The agriculturist in provincial regions may think that it’s hard to
differentiate the malady which may be available in their harvests. It's not
moderate for them to go to agribusiness office and discover what the infection
may be. Our principal objective is to distinguish the illness introduce in a plant
by watching its morphology by picture handling and machine learning. Pests
and Diseases results in the destruction of crops or part of the plant resulting
indecreased food production leading to food insecurity. Toxic pathogens, poor
disease control, drastic climate changes are one of the key factors which arises
in dwindled food production. Various modern technologies have emerged to
minimize postharvest processing, to fortify agricultural sustainability and to
maximize the productivity. Various Laboratory based approaches such as
polymerase chain reaction, gas chromatography, mass spectrometry,
1
thermography and hyper spectral techniques have been employed for disease
identification. However, these techniques are not cost effective and are high
time consuming. In recent times, server based and mobile based approach for
disease identification has been employed for disease identification. Several
factors of these technologies being high resolution camera, high performance
processing and extensive built in accessories are the added advantages1
resulting in automatic disease recognition. Modern approaches such as
machine learning and deep learning algorithm has been employed to increase
the recognition rate and the accuracy of the results. Various researches have
taken place under the field of machine learning for plant disease detection and
diagnosis, such traditional machine learning approach being random forest,
artificial neural network, support vector machine (SVM), fuzzy logic, K-
means method, Convolution neural networks etc. Random forests are as a
whole, learning method for classification, regression and other tasks that
operate by constructing a forest of the decision trees during the training time.
Unlike decision trees, Random forests overcome the disadvantage of over
fitting of their training data set and it handles both numeric and categorical
data.
2
one of the key factors which arises in dwindled food production. Various
modern technologies have emerged to minimize postharvest processing, to
fortify agricultural sustainability and to maximize the productivity. Various
Laboratory based approaches such as polymerase chain reaction, gas
chromatography, mass spectrometry, thermography and hyper spectral
techniques have been employed for disease identification. However, these
techniques are not cost effective and are high time consuming.In recent times,
server based and mobile based approach for disease identification has been
employed for disease identification. Several factors of these technologies
being high resolution camera, high performance processing and extensive built
in accessories are the added advantages resulting in automatic disease
recognition Plant disease detection is a portable system which runs under
automaton stage which is established with native Java language. App will run
smoothly and very fast. Google Material Design is implemented because of
which app looks more attractive and beautiful with good user impression and
experience. Combined with admin and user it manages news, various category,
notification and many things whenever we want. Main preference is good code
with good design. We can save money and time by using this app and we can
create our own design and special apps or different kind of app based on our
requirements. Plant disease detection is an Innovative and Enlightening
System helping the users to know the disease, trainings or any interesting
things taking place around their Area. This Organization aids the native
community to keep themselves up to date about the events around their
locality or zone or in their town. There are 2 things for this method to work;
one for the image processing and another is machine learning. The user is
permitted to sight the disease only of his town while user can supplement
disease connected to any town. Admin will show if any misuse or
inappropriate or false disease added by any users and will take specific act.
The Front end used is Android Studio and backend as SQL Server. The user
has to record into the system to using this app and can bring up-to-date his
details as well. The healthy leaf is shown first and so on, the user can also
restore the disease resultant the latest one shown first and current disease will
be shut. The user can add an image and a title connected to the leaf.
3
1.2 Necessity
In this changing environment, appropriate and timely disease identification
including early prevention has never been more important. There are several
ways to detect plant pathologies. Some diseases do not have any visible
symptoms, or the effect becomes noticeable too late to act, and in those
situations, a sophisticated analysis is obligatory. However, most diseases
generate some kind of manifestation in the visible spectrum, so the naked eye
examination of a trained professional is the prime technique adopted in
practice for plant disease detection. In order to achieve accurate plant disease
diagnostics a plant pathologist should possess good observation skills so that
one can identify characteristic symptoms to find out whether the leaf is
diseased or healthy, certain steps must be followed. i.e., Preprocessing,
Feature extraction, Training of classifier and Classification. Preprocessing of
image, is bringing all the images size to a reduced uniform size. Then comes
extracting features of a preprocessed image which is done with the help of
HOG .Hog is a feature descriptor used for object detection. In this feature
descriptor the appearance of the object and the outline of the image is
described by its intensity gradients. One of the advantages of Hog feature
extraction is that it operates on the cells created. Any transformations don’t
affect this. Augmentation Process.: The main purpose of applying
augmentation is to increase the dataset and introduce slight distortion to the
images which helps in reducing overfitting during the training stage. In
machine learning, as well as in statistics, overfitting appears when a statistical
model describes random noise or error rather than underlying relationship. The
image augmentation contained one of several transformation techniques
including affine transformation, perspective transformation, and simple image
rotations Affine transformations were applied to express translations and
rotations (linear transformations and vector addition, resp.) where all parallel
lines in the original image are still parallel in the output image. To find a trans-
formation matrix, three points from the original image were needed as well as
their corresponding locations in the output image. For perspective
transformation, a 3×3transformation matrix was required. Straight lines would
remain straight even after the transformation. For the augmentation process,
simple image rotations were applied, as well as rotations on the different axis
4
by various degrees. Hu moments: Image moments which have the important
characteristics of the image pixels helps in describing the objects. Here Hu
moments help in describing the outline of a particular leaf. Hu moments are
calculated over single channel only. The first step involves converting RGB to
Gray scale and then the Hu moments are calculated. This step gives an array
of shape descriptors. Harlicka Texture: Usually the healthy leaves and
diseased leaves have different textures. Here we use Harlicka texture feature to
distinguish between the textures of healthy and diseased leaf. It is based on the
adjacency matrix which stores the position of (I, J). Texture is calculated
based on the frequency of the pixel I occupying the position next to pixel J. To
calculate Harlicka texture, it is required that the image be converted to gray
scale. Color Histogram: Color histogram gives the representation of the colors
in the image. RGB is first converted to HSV color space and the histogram is
calculated for the same. It is needed to convert the RGB image to HSV since
HSV model aligns closely with how human eye discerns the colors in an
image. Histogram plot provides the description about the number of pixels
available in the given color range
1.3 Objectives
Modern approaches such as machine learning and deep learning algorithm has
been employed to increase the recognition rate and the accuracy of the results.
Random forests are as a whole, learning method for classification, regression
and other tasks that operate by constructing a forest of the decision trees
during the training time. Unlike decision trees, Random forests overcome the
disadvantage of over fitting of their training data set and it handles both
numeric and categorical data. The histogram of oriented gradients (HOG) is an
element descriptor utilized as a part of PC vision and image processing for the
sake of object detection Here we are making utilization of three component
descriptors:
1. Hu moments
2. Harlicka texture
3. Color Histogram
5
1.4 Existing System (Limitations)
The problem of efficient plant disease protection is closely related to the
problems of sustainable agriculture and climate change. Research results
indicate that climate change can alter stages and rates of pathogen
development; it can also modify host resistance, which leads to physiological
changes of host-pathogen interactions. The situation is further complicated by
the fact that, today, diseases are transferred globally more easily than ever
before. New diseases can occur in places where they were previously
unidentified and, inherently, where there is no local expertise to combat them.
Timely and accurate diagnosis of plant diseases is one of the pillars of
precision agriculture.
6
in the image. RGB is first converted to HSV color space and the histogram is
calculated for the same. It is needed to convert the RGB image to HSV since
HSV model aligns closely with how human eye discerns the colors in an
image. Histogram plot provides the description about the number of pixels
available in the given color range.
7
2. Literature Survey
2.1 Introduction
In 2015, S. Khirade et Al. tackled the problem of plant disease detection using
digital image processing techniques and back propagation neural network
(BPNN) [1]. Authors have elaborated different techniques for the detection of
plant disease using the images of leaves. They have implemented Otsu’s
thresholding followed by boundary detection and spot detection algorithm to
segment the infected part in leaf. After that they have extracted the features
such as color, texture, morphology, edges etc. for cla ssification of plant
disease. BPNN is used for classification i.e. to detect the plant disease.
Shiroop Madiwalar and Medha Wyawahare analyzed different image
processing approaches for plant disease detection in their research [2]. Authors
analyzed the color and texture features for the detection of plant disease. They
have experimented their algorit hms on the dataset of 110 RGB images. The
features extracted for classification were mean and standard deviation of RGB
and YCbCr channels, grey level cooccurrence matrix (GLCM) features, the
mean and standard deviation of the image convolved with Gabor f ilter.
Support vector machine classifier was used for classification. Authors
concluded that GCLM features are effective to detect normal leaves. Whereas
color features and Gabor filter features are considered as best for detecting
anthracnose affected lea ves and leaf spot respectively. They have achieved
highest accuracy of 83.34% using all the extracted features. Peyman
Moghadam et Al. demonstrated the application of hyperspectral imaging in
plant disease detection task [3]. visible and nearinfrared (VNI R) and short
infrared (SWIR) spectrums were used in this research. Authors have used k--
wave means clustering algorithm in spectral domain for the segmentation of
leaf. They have proposed a novel grid removal algorithm to remove the grid
from hyperspectral images. Authors have achieved the accuracy of 83% with
vegetation indices in VNIR spectral range and 93% accuracy with full
spectrum. Though the proposed method achieved higher accuracy, it requires
the hyperspectral camera with 324 spectral bands so the s olution becomes too
8
costly. Sharath D. M. et Al. developed the Bacterial Blight detection system
for Pomegranate plant by using features such as color, mean, homogeneity, SD,
variance, correlation, entropy, edges etc. Authors have implemented grab cut
segm entation for segmenting the region of interest in the image [4]. Canny
edge detector was used to extract the edges from the images. Authors have
successfully developed a system which can predict the infection level in the
fruit. Garima Shrestha et Al. depl oyed the convolutional neural network to
detect the plant disease [5]. Authors have successfully classified 12 plant
diseases with 88.80% accuracy. The dataset of 3000 high resolution RGB
images were used for experimentation. The network has 3 blocks of co
nvolution and pooling layers. This makes the network computationally
expensive. Also the F1 score of the model is 0.12 which is very low because
of higher number of false negative predictions.
9
pieces. And thanks to the demand from gamers around the world, GPUs
(graphics processing units) make it possible for us to leverage deep learning
algorithms to build and train models with impressive results in a timeefficient
manner! So to all the parents who don’t like your kids playing games: Gaming
has its silver Living.
10
3. System Development
11
Input device: Standard Keyboard and Mouse
Output device: High Resolution Monitor
Memory: 1GB RAM1
Different plant leaf images (healthy and affected)
Software Requirements
Operating system: Android 9 and upper
IDE: Python IDLE
Data: Leaf Dataset
Modules: NumPy, pandas, matplotlib, sklearn
Processor: snapdragon pr any standard processor
Abnormal Normal
Defined Region
Classified
12
Flow Chart
Image Acquisition
Image Segmentation
13
3.4 UML and Data flow Diagram
14
Activity Diagram
15
Use Case Diagram
16
Data Flow Diagram
17
3.5 Sample Code and Output
Sample Code
18
Fig 3.5.2 : Tomato Module Code 2
19
Output Screenshots
20
4. Performance Analysis
21
• Analyzing and manipulating the image which includes data compression and
image enhancement and spotting patterns that are not to human eyes like
satellite photographs. Output is the last stage in which result can be altered
image or report that is based on image analysis.
4. Feature extraction
In machine learning, pattern recognition and in image processing, feature
extraction starts from an initial set of measured data and builds derived values
(features) intended to be informative and non-redundant, facilitating the
subsequent learning and generalization steps, and in some cases leading to
better human interpretations. Feature extraction is a dimensionality reduction
process, where an initial set of raw variables is reduced to more manageable
groups (features) for processing, while still accurately and completely
describing the original data set. When the input data to an algorithm is too
large to be processed and it is suspected to be redundant (e.g., the same
measurement in both feet and meters, or the repetitiveness of images presented
as pixels), then it can be transformed into a reduced set of features (also
named a feature vector). Determining a subset of the initial features is called
feature selection. The selected features are expected to contain the relevant
22
information from the input data, so that the desired task can be performed by
using this reduced representation instead of the complete initial data.
5. Feature selection
In machine learning and statistics, feature selection, also known as variable
selection, attribute selection or variable subset selection, is the process of
selecting a subset of relevant features (variables, predictors) for use in model
construction.
Feature selection techniques are used for four reasons:
• simplification of models to make them easier to interpret by
researchers/users,
• shorter training times,
• to avoid the curse of dimensionality,
• enhanced generalization by reducing overfitting (formally, reduction of
variance
23
mode of the classes (classification) or mean prediction (regression) of the
individual trees. Random decision forests correct for decision trees' habit of
over fitting to their training set. Random forest classifier creates a set of
decision trees from randomly selected subset of training set. It then aggregates
the votes from different decision trees to decide the final class of the test
object.
24
5. Conclusion
5.1 Conclusion
The main aim of our project is to detect whether it is diseased or healthy leaf
with the help of a Random Forest classifier. The objective of this algorithm is
to recognize abnormalities that occur on plants in their greenhouses or natural
environment. The image captured is usually taken with a plain background to
eliminate occlusion. The algorithm was contrasted with other machine
learning models for accuracy. Using Random Forest classifier, the model was
trained using 160 images of papaya leaves. The model could classify with
approximate 70 percent accuracy. The accuracy can be increased when trained
with vast number of images and by using other local features together with the
global features such as SIFT (Scale Invariant Feature Transform), SURF
(Speed Up Robust Features) and DENSE along with BOVW (Bag of Visual
Word). There are many methods in automated or computer vision plant
disease detection and classification process, but still, this research field is
lacking. In addition, there are still no commercial solutions on the market,
except those dealing with plant species recognition based on the leaf's images.
Cloud
Cloud storage is a service which lets you store data by transferring it over the
Internet or another network to an offsite storage system maintained by a third
party. There are hundreds of different cloud storage systems which include
personal storage which holds and/or backs up emails, pictures, videos and
other personal files of an individual, to enterprise storage which lets
businesses use cloud storage as a commercially-supported remote backup
solution where the company can securely transfer and store data files or share
them between.
25
Drones
Drone photography allows images and audio/video to be captured that might
not be otherwise possible for human photographers and
videographers.capacity can be enabled by the flight abilities of drones, their
small size or their ability to tolerate harsh environments.
26
References
[1] Ghaiwat Savita N, Arora Parul. Detection and classification of plant leaf diseases
using image processing techniques: a review. Int J Recent Adv Eng Technol
2014;2(3):2347– 812. ISSN (Online).
[2] Dhaygude Sanjay B, Kumbhar Nitin P. Agricultural plant leaf disease detection
using image processing. Int J Adv Res Electr Electron Instrum Eng 2013;2(1).
[3] Mrunalini R Badnakhe, Deshmukh Prashant R. An application of K-means
clustering and artificial intelligence in pattern recognition for crop diseases. Int Conf
Adv Inf Technol 2011;20. 2011 IPCSIT.
[4] Arivazhagan S, Newlin Shebiah R, Ananthi S, Vishnu Varthini S. Detection of
unhealthy region of plant leaves and classification of plant leaf diseases using texture
features. Agric Eng Int CIGR 2013;15(1):211–7.
[5] Kulkarni Anand H, Ashwin Patil RK. Applying image processing technique to
detect plant diseases. Int J Mod Eng Res 2012;2(5):3661–4.
[6] Bashir Sabah, Sharma Navdeep. Remote area plant disease detection using image
processing. IOSR J Electron Commun Eng 2012;2(6):31–4. ISSN: 2278-2834.
[7] Naikwadi Smita, Amoda Niket. Advances in image processing for detection of
plant diseases. Int J Appl Innov Eng Manage 2013;2(11). Patil Sanjay B et al. Leaf
disease severity measurement using image processing. Int J Eng Technol
2011;3(5):297–301.
[8] Chaudhary Piyush et al. Color transform-based approach for disease spot
detection on plant leaf. Int Comput Sci Telecommun 2012;3
IV
Acknowledgement
Ability & ambition are not enough for success. Success in any endeavor
depends a lot on the support, guidance, encouragement from our guides,
friends & our closer to perfection has helped us to motivate ourselves into
giving the best in our means. Gratitude is often the hardest emotion to express
& often one doesn’t find adequate words to convey what one feels.
V
VI