Professional Documents
Culture Documents
Open Eyes
V. Data visualization
No-Yawn
Yawn
VII. Training and testing dataset
Distribution of dataset
PROS:
➢ Image recognition challenges need a high level
of precision.
➢ Without any human intervention, it
automatically recognises the relevant
properties.
➢ Weight distribution.
CONS:
➢ CNN does not encode object location and
orientation.
➢ Inability to be spatially invariant in the face of
incoming data.
➢ A large amount of training data is necessary.
ResNet-152
A convolutional neural network with 152 layers is called
ResNet-152. The ImageNet database contains a
pretrained version of the network that has been trained
on more than a million photos. The pretrained network
can categorise photos into 1000 different object
categories, including several animals, a keyboard, a
mouse, and a pencil. The network has therefore
acquired rich feature representations for a variety of
images. The network accepts images with a resolution
of 224 by 224.
CONS:
➢ Error detection becomes challenging for deeper
networks.
➢ The learning could be quite ineffective if the
network is too small.
ResNet-152 Training loss vs Validation loss A Confusion matrix is a Mx M matrix used to evaluate
the performance of a classification model, where M is
the number of target classes. The matrix determines the
current target values to those estimated by the machine
learning algorithms. This provides a comprehensive
picture of how well the classification model is operating
and what sorts of errors it is producing.
For the CNN model, the accuracy went off when the
input image was a “yawn” image, but the model
CNN – Precision, Recall, F1-Score
XI. Future Scope
Moving ahead, we can do a few things to improve the
outcomes and fine-tune the models.