Professional Documents
Culture Documents
Image Classification
Image Classification
leaves
Dataset:
The plantvillage dataset consists of images of leaves from various
plants. The images of corn leaves are used for this experiment. There
are 3 kinds of diseased leaves available in the dataset and they are
individually classified along with healthy leaf images. So, the model
classifies a given image in on of the 4 categories:
1. Healthy(1162 images)
2. Gray spot(513 images)
3. Common rust(1192 images)
4. Blight(985 images)
Data Preparation:
The dataset is split into train, validation and test set in the ratio
0.6:0.2:0.2.
Models:
5. Roc Curves:
a. Healthy vs Not Healthy:
5. Roc curves:
a. Healthy vs not:
d. Blight vs not:
6. Roc curve:
a. Healthy vs not:
d. Blight vs not:
The accuracy did not improve from the baseline model. But
during training this model, we can observe that the training
loss has gone down a lot more than validation loss, it in fact
reached 0.01. This is an indication of over-fitting. The
model remembers a little too much then generalizes the
features from the training set. This happens because the
dataset is imbalanced and the training dataset is relatively
small wrt model complexity. This is another problem of
complex models. They become a little more complex than
required. But we cannot test every single hyperparameter
and make the exact model we want, so we will introduce
dropout layers in the model.
3. Roc curves:
a. Healthy vs not:
b. Gray Spot vs not:
d. Bight vs not: