You are on page 1of 48

Comprehensive Doctoral Committee Meeting

Precise Agriculture System using Big Data analytics in


predicting yield by Irrigation Scheduling And plant disease
detection
06, April, 2022
School of Computing – Computer Science and Engineering(DSBS)
Full Time / Batch - Jan 2021
Supervisor Name of the Candidate
Dr. T.Karthick N.Vasudevan
Assistant Professor RC2113003011006
DSBS, SRMIST, Kattankulathur DSBS, SRMIST, Kattankulathur
Presentation Overview
 Course Work Details
 Literature Survey
 Research Gap
 Problem Statement
 Objectives
 First Objective Introduction
 Proposed Methodology
 Milestones
Works to be done
References
2
Course Work Completion Details

Sl.No Course Code and Title Core and Elective Status Grade

1 RME001 – RESEARCH METHODOLOGY – CORE


COURSE
Core Course Completed B+

2 RPE17001 – RESEARCH AND PUBLICATION


ETHICS – CORE COURSE Core Course Completed B+

3 20ITC504J – MACHINE LEARNING FOR DATA


ANALYTICS– CORE COURSE Core Course Completed O

4 CS3039 – DIGITAL IMAGE AND VIDEO


PROCESSING – CORE COURSE Core Course Completed A

5 CS3022 – INTERNET OF EVERYTHING – CORE


COURSE Core Course Completed A

OVERALL CGPA : 8.12


3
INTRODUCTION

• IoT is an infrastructure of interconnected objects, people or


systems for the purpose of connecting and exchanging data with
other devices and systems over the Internet.
• Big data analytics is the use of analytic techniques against
very large, diverse data sets from different sources and in different
sizes.
• Precision Agriculture System is the application of technologies like
IoT, Big Data and analytics on an agricultural field

3
LITERATURE SURVEY
AUTHOR AND TITLE YEAR AND PUBLICATION CONCEPTS DISCUSSED /
PROBLEM IDENTIFIED

Yafeng Zhao, Zhen Chen, Xuan Gao, IEEE Transactions 2021, DOI DoubleGAN is used for expand the data
Wenlong Song, Qiang Xiong, Junfeng 10.1109/TCBB.2021.3056683 set. The 256*256-pixel plant leaf
Hu and Zhichao Zhang, “ Plant Disease images were generated using small
Detection using Generated Leaves data set, a two-stage GAN named
Based on DoubleGAN” DoubleGAN is proposed.
The experimental results proved that
the classification accuracy for the
dataset generated with DoubleGAN is
higher than the original dataset.

Aliyu Muhammad Abdu, Musa Mohd Springer 2020, Here, the ROI (EROI) algorithm with
Mokji, and Usman Ullah Sheikh, “Deep https://doi.org/10.1007/978-3-030- pathological inference of the disease
Learning for Plant Disease 66645-3_6 symptom is used to generate the
Identification from Disease Region segmented image data for enhanced
Images” feature representation in DL models.
The results from different pre-trained
DL models AlexNet, ResNet, and VGG
were used to evaluate the efficiency of
the approach.

5
LITERATURE SURVEY
AUTHOR AND TITLE YEAR AND PUBLICATION CONCEPTS DISCUSSED /
PROBLEM IDENTIFIED
Xinda Liu, Weiqing Min , Shuhuan Mei, Lili IEEE Transaction 2021, DOI Here, they had constructed the first large-
Wang, and Shuqiang Jiang, “Plant Disease 10.1109/TIP.2021.3049334 scale plant disease dataset with 271 plant
Recognition: A Large-Scale Benchmark disease categories and 220,592 images.
Dataset and a Visual Region and Loss Then, they had designed an approach using
Reweighting Approach” cluster distribution of patch features and
reweighted patch features to highlight
diseased patches and the loss using learned
weights.

Tan Nhat Pham , Ly Van Tran, and Son Vu IEEE Access 2020,DOI They identified initial disease on plant leaves
Truong Dao, “Early Disease Classification of 10.1109/ACCESS.2020.3031914 with minor disease blobs by an ANN method.
Mango Leaves Using Feed-Forward Neural A list of several measurement-based features
Network and Hybrid Metaheuristic Feature that represents the blobs are chosen using
Selection” feature selection algorithm. The chosen
features are used as inputs for an ANN.
They compared the results obtained using
this methods with another popular CNN
models (AlexNet, VGG16, ResNet-50)

6
LITERATURE SURVEY
AUTHOR AND TITLE YEAR AND PUBLICATION CONCEPTS DISCUSSED /
PROBLEM IDENTIFIED
Zihao Mao, Jiaming Chen and Meng Yang, Springer 2019, https://doi.org/10.1007/978- they had proposed convolutional neural
“Multi-branch Structure for Hierarchical 3-030-31726-3_45 network based an end-to-end multi-branch
Classification in Plant Disease Recognition” hierarchical classification model.
The proposed model can select the sub-class
from the current cluster iteratively through
their designed Select Branch.
Meanwhile, a generalized model in this
hierarchical structure is used to create the
model more scalable for similar grouping task.

Siddharth Singh Chouhan , Ajay Kaul, Uday IEEE Transaction 2018, DOI The author had proposed Radial Basis
Pratap Singh and Sanjeev Jain, “Bacterial 10.1109/ACCESS.2018.2800685 Function Neural Network (BRBFNN) using
Foraging Optimization Based Radial Basis Bacterial foraging optimization for
Function Neural Network (BRBFNN) for identification and classification of plant leaf
Identification and Classification of Plant Leaf disease.
Diseases: An Automatic Approach Towards The seeding and grouping the points which
Plant Pathology” are same in some manner using region
growing approach was used for feature
extraction process.
Here the proposed BRBFNN was compared
with k-means algorithm and genetic
algorithms.

7
LITERATURE SURVEY
AUTHOR AND TITLE YEAR AND PUBLICATION CONCEPTS DISCUSSED /
PROBLEM IDENTIFIED
Xihai Zhang , Yue Qiao , Fanfeng Meng, IEEE Access 2018, DOI GoogleNet model is used with deep learning
Chengguo Fan and Mingming Zhang, 10.1109/ACCESS.2018.2844405 to improve the accuracy of corn leaf diseases
“Identification of Maize Leaf Diseases Using identification and reduces the number of
Improved Deep Convolutional Neural network parameters.
Networks” The different kind of maize images were
trained and tested with this model GoogleNet
by adjusting the parameters, pooling
combinations, classifiers and dropout
operations.
The number of parameters of the improved
models is smaller than that of other
structures like VGG and AlexNet structures.

Shanwen Zhang , Xiaowei Wu, Zhuhong You Elsevier, Computers and Electronics in This method uses combined shape and color
and Liqing Zhang, “Leaf image based Agriculture 2017, pp. 135–141 features from unhealthy leaf images in
cucumber disease recognition using sparse training to form a dictionary, and then get an
representation classification”, approximate SR of the input leaf images by a
sparse model solver.
The experimental results on the leaf image
database for cucumber diseases showed that
this method is efficient and feasible, with an
average recognition rate higher than 85.7%.

8
LITERATURE SURVEY
AUTHOR AND TITLE YEAR AND PUBLICATION CONCEPTS DISCUSSED /
PROBLEM IDENTIFIED

Johan .J ; Javier .B.; Edgar .S “Smart Soil IEEE sensor Journal 2018, DOI An autonomous WSN with Internetof-
Parameters Estimation System Using an 10.1109/JSEN.2018.2867432 Things, and cloud-service
Autonomous Wireless Sensor Network communication techniques is proposed
with Dynamic Power Management for the intelligent estimation of soil
Strategy” parameters.
It operates with an adaptive dynamic
power management strategy that
establishes an energy-accuracy
balance.

Wei Xu , Zhaa Zhang, Qiming Qin, Jian IEEE transaction on Geoscience and The alpha approximation method is
Hui, ‘Soil Moisture Estimation With Remote sensing, 2019 used to augment the measured data,
SVR and Data Augmentation Based on and can avoid solving the
Alpha Approximation Method” undetermined system of equations
with constraints directly.
Then, these data are applied in a
support vector regression machine for
soil moisture estimation.

9
LITERATURE SURVEY
AUTHOR AND TITLE YEAR AND PUBLICATION CONCEPTS DISCUSSED /
PROBLEM IDENTIFIED

Mitul sheth & Homera Durani, Proceedings of the 2nd International Here, by using the hyperspectral image
‘Estimate soil moisture using conference on Electronics, of one region of India they find the soil
Hyperspectal Remote Sensing for Crops Communication and Aerospace moisture of that area. The
“ Technology (ICECA 2018) hyperspectral imaging is to get the
spectrum for each pixel in the image of
a scene, with the purpose of finding
objects, identifying materials, or
detecting processes

Matt head, Malcolm Coull, Richard Imaging journal 2016 2,35 A neural network model using different
Gatkin and David Donnelly types of soil profile imagery is
“Automated Soil Physical Parameter developed to estimate the structure,
Assessment Using Smartphone and texture and drainage of the soil.
Digital Camera” .

10
LITERATURE SURVEY
AUTHOR AND TITLE YEAR AND PUBLICATION PROBLEM IDENTIFIED

Al-Gaadi K. A., Hassaballa A. A., Tola E Plus One Journal -2016 Prediction of Potato crops yield
”Prediction of Potato Crop Yield Using through analyzing their bioactivities
Precision Agriculture Technique” using satellite images gave
considerable information to improve
productivity.

Danilo T, , Silva R, Walter J., Cristiano Computers and Electronics in Here, They have shown that cotton
Z“Convolutional neural networks in Agriculture journal, 2020 crop yield prediction from the images
predicting cotton yield from images of using image analysis techniques and
commercial fields” convolution neural network

11
Research Gap
1. There is no generic model for agriculture providing
a more precise decision for irrigation scheduling with
low water and power consumption.
 Wide Sensor Network is used to develop agriculture system.
 But Challenges in such WSN include high bandwidth demand,
high energy consumption and covers only small areas.

4
Research Gap
2. Observing the plant and disease detection of the
plant is very risky in the agricultural field.
The techniques proposed so far are usually limited to
visible range images and not focused on the stages of
diseases.
The edge detection method, CNN and color feature
methods are used to detect plant disease
Still in complex backgrounds, It is very difficult to perform
segmentation i.e. separate the image from the leaf and
stem.

5
Research Gap
3. Yield predictions are important for managing the
quality of operation, aiming at increasing efficiency and
reducing losses.
 Time series forecasting and random forest method are
widely used with the Object detection algorithms to
predict future values based on previously observed values
with average climate conditions and plant images.
 It requires the use of several datasets since crop
yield depends on many different factors such as weather,
soil and plant disease.

6
Problem Statement
• Developing a precise agriculture system for providing
a more precise decision with an effective irrigation
scheduling system, plant disease management
system and yield identification system.

7
Overview of Research work

16
Objectives
• Design an effective plant disease detection and
management system
• Develop an efficient irrigation scheduling system for
low water and power consumption
• Predict the crop yield time accurately

8
PLANT DISEASE DETECTION

9
Fungal, viral and bacterial diseases
• The Fungus pollutes the plants by destroying and
emphasizing their cells.
• Bacterial diseased plants show symptoms like leaf
spots and wilts. In an advanced stage, these
infections occur, which are extremely difficult to
diagnose.
• Viral infections are frequently disrupted by a lack of
nutrients and pesticides.

10
11
• The Existing techniques in early 2000’s
• Support vector machine,
• Delta color Difference algorithm,
• Color Histogram and text Features,
• Sparse Representation

12
• SVM- It can solve linear and non-linear
problems and work well for many practical
problems.
• The idea of SVM is simple. The algorithm
creates a line or a hyperplane which separates
the data into classes.
• support vector machine work with different
kernel functions in classifying plant disease.

13
• Delta Color Difference algorithm - it’s a
measurement of how much a displayed color can
differ from its input color.
• A color histogram of an image represents the
distribution of the composition of colors in the
image
• Text Features may be specific structures in the image
such as points, edges or objects.

14
• Sparse representations classification (SRC) is a
powerful technique for pixelwise classification of
images and it is increasingly being used for a wide
variety of image analysis tasks.
• Sparse representations describe or highlight the
essential features.

15
• From 2010 to 2018, The neural networks had played
major role in recognizing the plant diseases.
• Neural Networks are a set of algorithms that tries to
recognize the patterns, relationships, and information
from the data through the process which is inspired
by and works like the human brain

16
• The mostly used neural network methods during
those periods were
• CNN
• GoogleNet
• VGG(Visual Geometry Group),
• MCNN (Multilayer CNN) etc.

17
• CNN- CNN as a machine learning algorithm that can
take in an input image, assign importance (learnable
weights and biases) to various aspects/objects in the
image, and be able to differentiate one from the
other.
• The main advantage of CNN compared to its
predecessors is that it automatically detects the
important features without any human supervision.

18
19
• There are three types of layers in a convolutional
neural network:
• Convolutional layer,
• Pooling layer, and
• Fully connected layer.

20
• CNN can be used with multiple layers as Multi
layers CNN
• GoogleNet - GoogLeNet is a convolutional
neural network that is 22 layers deep
• VGG-16 is a convolutional neural network that
is 16 layers deep

21
• Generative adversarial networks,
• Extended ROI (Region of Interest),
• LSTM and
• Inception V3 are used currently to notice the plant
Diseases.
• The mostly used neural network methods with GAN
are
• LeafGAN,
• DATGAN and
• DoubleGAN .

22
• Generative adversarial networks (GANs) are
algorithmic architectures that use two neural
networks, putting one against the other in
order to generate new, synthetic instances of
data that can pass for real data.
• They are used widely in image generation,
video generation and voice generation.

23
• A region of interest (ROI) is a portion of an
image that we want to filter or operate on in
some way.
• We can represent an ROI as a binary mask
image.
• In the mask image, pixels that belong to the
ROI are set to 1 and pixels outside the ROI are
set to 0 .

24
• The Long Short-Term Memory Network or LSTM for
short is an LSTM architecture
• It specifically designed for sequence prediction
problems with spatial inputs, like images or videos.
• LSTMs use a series of 'gates' which control how the
information in a sequence of data comes into, is
stored in and leaves the network. There are three
gates in a typical LSTM; forget gate, input gate and
output gate.
• Inception-v3 is a convolutional neural network that is
48 layers deep.
25
Author and Year Plant type Disease Types Techniques Used Dataset Accuracy
Usama Mokhtar et al Tomato Early blight or Powdery SVM real sample images 91.5%
[1], 2007 mildew

H. Ali et al [2],, 2008 Citrus Greenng, Downy and Delta color Difference Plant Village 90%
Anthracnose algorithm, Color
Histogram and text
Features

Zhang et al. [3], 2009 Cucumber Scab, Powdery, Downy, Sparse Representaion (SR Plant Village 85.7%
bacterial Angular, Model)
Anthracnose, Gray mold and
Corynespora cassiicola

Lsharda Mohanthy Pomegranate Leaf Blight and Scab CNN Clicked Image dataset 93.4%
et al.[4], 2016

X.Zhang et al [5], maize southern leaf blight, Gray leaf GoogleNet Plant village 92.9%
2018 spot, round spot, northern
leaf blight, Curvularia leaf
spot, dwarf mosaic brown
spot
and rust

Jie Hang et al. [6], Apple and Cherry Scab, Frogeye, Rust and VGG16 Plant Leaf Disease 91.7%
2018 Powdery Library
Author and Year Plant type Disease Types Techniques Used Dataset Accuracy
U. P. Singh et al [8] Mango Anthracnose Disease MCNN Real time mango dataset 90.13%
2019 and plant village

Dai et al [10], Tomato, Strawberry, Scroch, Fungus spot, Frogey DATFGAN DIV2K 91.1%
2020 Apple spot, Scab

CAP et al. [11], 2020 Cucumber brown spot , Melon yellow LeafGAN CUCUMBER DATA SET 97.9%
spot virus ,
powdery mildew

Hu Wan et al [12], Apple, Black Rot, Apple Scab, Rust, Inception-V3 Plant Village 88%
2020 Potato Early Blight, Late Blight, Leaf
Mold

A. M. Abdu et al. Potato Early Bligh and Late Blight Extended ROI PV dataset 95.37%
[13], 2020

Xinda liu et al [14], Cassava, Fruit, Crop pest, Soyabean leaf Blight LSTM weighted function PlantVillage datasets 91.78%
2021 Apple, Soyabean leaf
Blight

Apple, Late blight, mosaic virus, leaf DoubleGAN Plant Village 90.80%
Yafeng Zhao et al. Corn, mold, early blight, Two-
[15], 2021 Grape, spotted spider mites, Bacterial
Potato spot, yellow leaf curl virus
partial leaf spot and
target spot
Proposed Methodology
• CNN automatically extracts features from images,
and from these features it learns to detect and
recognize different objects.
• Early layers extract simple features like edges, and as
layers proceed features become more and more
complex. At the end, CNN uses all extracted features
to make a final prediction.
• The major drawback in basic architecture of CNN that
only presence of feature is captured and nowhere in
this approach spatial information is stored.

28
• Capsules are the groups of neurons that encode spatial information
as well as the probability of an object being present.
• In capsule network, corresponding to each entity in an image, there
is a capsule which gives Probability that the entity exists and
Instantiation parameters of that entity.
• The CNN layer is used to capture local entity features such as small
circle patches, Big circle patches and yellow colored patches
• The capsule-layer is used to capture the overall features such as
“disease type” and “disease stage,” using the low level features
frequency and spatial information.

29
30
DataSet Description (Grape Leaves):

Classes Number of Images


Black Rot 1632

Black Measles 1172

Leaf Blight 1587

Healthy 1370

Total 5761

• Implemented CNN algorithms using python with tensorflow


keras API and got accuracy of 91%.
• Working on Capsule layers with CNN to get improved
accuracy for plant disease detection and to identify plant
disease stages.
31
Milestones
 Completed the First DC Meeting on 27.01.2021.
 Completed the Course Work
 Presented the paper titled “Analysis of Plant Leaf Diseases Recognition
using Image processing with Machine Learning Techniques” at IEEE
International Conference on Advances in Computing, Communication
and Applied Informatics organized by St.Joseph College of Engineering,
Chennai from 28-01-2022 to 29-01-2022
 The paper was accepted for IEEE explore(Scopus Indexed) and yet to be
published
 Submitted the paper titled “An Efficient Capsule neural network for
Grape Plant Leaf Disease Detection” in a scopus indexed journal which
is under review.
 Submitted the work on plant disease detection techniques at DPRC
which is held on 24-03-2022.
32
Works to be Done (Short Term Goal)
• Working on Capsule with CNN to get improved accuracy for
plant disease detection.
• Completing the second objective of Research work
• Prepare and publish the paper in the Scopus and SCI journal
based on the above work

34
References
1. U Mokhtar, A.S Mona, HeshamHefny, “Tomato leaves diseases detection
approach based on support vector machines”, Institute of Electrical and
Electronic Engineers 2007.
2. H Ali et al., “Symptom based automated detection of citrus diseases using
color histogram and textural descriptors”, Elsevier, Comp. and Elex. in
Agriculture 2008.
3. Z Shanwen, W Xiaowei, Y Zhuhong and Z Liqing, “Leaf image based cucumber
disease recognition using sparse representation classification”, Elsevier,
Comp. and Elex. in Agriculture 2009.
4. Lorick Jain, M.A HarshaVardhan, M.L Nishanthand S.S Shylaja, “Cloud-based
system for Supervised Classification of Plant diseases using Convolutional
Neural Networks”, Inst. of Electrical and Electronic Engineers 2017.
44
5. Z Xihai, Q Yueand Z Ming, “Identification of Maize Leaf Diseases Using
Improved Deep Convolutional Neural Networks”, Institute of Electrical
and Electronic Engineers Access 2018.
6. H Jie, Z Dexiang, C Peng, Z Jun andW Bing, “Classification of Plant Leaf
Diseases Based on Improved Convolutional Neural Network”,
Multidisciplinary Digital Publishing Inst.Sensors 2019.
7. M Zihao, CJiaming and Y Meng, “Multi-branch Structure for Hierarchical
Classification in Plant Disease Recognition”, Springer 2019.
8. S UdayPratap et al., “Multilayer Convolution Neural Network for the
Classification of Mango Leaves Infected by Anthracnose Disease”,
Institute of Electrical and Electronic EngineersAccess 2019.
45
9. KMuhammad Attique, M IkramUllahLali ,S Muhammad, Abdulaziz Saud and A
Talha, “An Optimized Method for Segmentation and Classification of Apple
Diseases Based on Strong Correlation and Genetic Algorithm Based Feature
Selection”, Inst. of Electrical and Electronic Engineers Access 2019.
10. D Qiang, C Xi ,Q Yan , and Z Youhua, “Crop Leaf Disease Image Super-Resolution
and Identification with Dual Attention and Topology Fusion Generative
Adversarial Network”, Inst. of Electrical and Electronic Engg.Access 2020.
11. C QuanHuu, U Hiroyuki, K Satoshi and I Hitoshi, “LeafGAN: An Effective Data
Augmentation Method for Practical Plant Disease Diagnosis”, Inst. of Electrical
and Electronic Engg. Transaction 2020.
12. Wan Hu, L Zheng Lu, Q Wang and C Yuanyuan, “Plant Disease Classification
Using Deep Learning Methods”,Association for Computing Machinery 2020.
46
13. A Aliyu Muhammad, M Musa Mohd and S Usman Ullah, “Deep Learning for
Plant Disease Identification from Disease Region Images”, Springer 2020.
14. L Xinda,M Weiqing ,M Shuhuan, W Lili and J Shuqiang, “Plant Disease
Recognition: A Large-Scale Benchmark Dataset and a Visual Region and Loss
Reweighting Approach”, Inst. of Electrical and Electronic Engg. Transaction
2021.
15. Z Yafeng, C Zhen,H Junfengand Z Zhichao, “Plant Disease Detection using
Generated Leaves Based on DoubleGAN”, Inst. of Electrical and Electronic
Engg. Transactions 2021.

47
Thank you

You might also like