You are on page 1of 11

Deep Learning Model with

High F1-score for Imbalanced Medical


Data set

PRESENTED BY

G . R. S. N A G A K U M A R
(JUNIOR RESEARCH FELLOW)

D E PA R T M E N T O F E L E C T R I C A L E N G I N E E R I N G

N AT I O N A L I N S T I T U T E O F T E C H N O L O G Y - A N D H R A P R A D E S H
Motivation
Deep learning (DL) is one of the branches of AI that has seen exponential growth in recent years.
The scientific community has focused its attention on DL due to its
 versatility
 high performance
 high generalization capacity
 multidisciplinary uses

A large amount of Medical Data and the development of more powerful computers has also fostered an interest
in this area.
The Features of Medical Data like Eye Image will provide the information about different stages of Diabetic
Retinopathy(DR) and Glaucoma which leads to Loss of Vision.
As per Literature, an Edge Device is not available to detect the status of both DR and Glaucoma based on
Fundus image of Eye
This Device will assist the Ophthalmologist to analyze the severity of the diseases
Overview

Disorder Symptoms Treatment


Diabetic Retinopathy •Double vision Management of Diabetes
•Blurred vision laser treatment
•Loss of vision Surgery

Glaucoma •Eye pain Medical and Surgery


•Red-eye
•Blurred vision
•Vision Loss
Deep learning Based Screening Model

Convolutional
Variational Auto Encoder
(CVAE) Output Data

Input Data Healthy


Spatial
Features Pyramid Classifier
Pooling DR

GL
Methodology
Stages to develop an Edge Device for Screening various Imbalanced Medical Image Data
1. Learning of domain specific features for comprising fundus image patches through Convolutional
Variational Autoencoder (CVAE)
2. Domain Specific feature based image classifier with multiscale saliency and soft F loss
3. Model compression through quantization and pruning
4. Evaluation of compressed model on benchmark datasets
Stage - 1

 CVAE is constructed and trained on Fundus Images


 The Encoder can represent the (nxnx3) patches in to a d-dimensional vector(z-vector), it learns the domain
specific features
 Structural Similarity Index measure (SSIM) is considered as regularizer for reconstructive loss and aimed to
match the Structure
 Mean Square Error (MSE) is to match the mean brightness

Deep Learning Screening Model


Stage - 2
Diabetic
retinopathy

CVAE CLASSIFIER Glaucoma


ENCODER
Normal

Loss = BCE + F1

 Dilated Convolution can effectively reduce the computational burden


 Depth wise separable convolution are employed reduce compute complexity in basic CNN classifiers
 Binary Cross Entropy Loss is chosen to reduce the complexity of boundary to be constructed by the
classifier
 F1- Score is used as regularizer in loss function
 F1-Score is employed to avoid discontinuity in loss function.
Stage - 3

 The weights of stage 2 are of 64-bit precision after training the entire model
 The weights are quantized to 32-bit precision for development model
 The deployment model is fine-tuned such that the output of model in stage 2 matches with output of compressed model

Stage - 4

 Libraries like ONXX, TensorFlow servings, TensorFlow lite etc are employed to deploy the compressed model on
embedded board.
Selection of Embedded Boards
Edge Device – JETSON Nano
Edge Devices from Texas instruments

TDA4VM processor starter kit for edge AI vision systems - SK-TDA4VM


DR

Resized to (244,244,3) RESNET50


GL
CLASSIFIER

Normal
Loss = F1-Loss + Categorical Cross Entropy

DR
Resized to
(244,244,3) RESNET50
GL
CLASSIFIER

Normal

ASPP Block
Loss = F1-Loss + Categorical Cross Entropy
DR

Resized to (244,244,3) RESNET50


GL
CLASSIFIER

Normal
Loss = F1-Loss + Categorical Cross Entropy

DR
Resized to
(244,244,3) RESNET50
GL
CLASSIFIER

Normal

ASPP Block
Loss = F1-Loss + Categorical Cross Entropy

You might also like