You are on page 1of 5

ENGINEERING SCIENCE AND TECHNOLOGY INTERNATIONAL RESEARCH JOURNAL, VOL.3, NO.

4, DEC, 2019 28

ISSN (e) 2520-7393


Using Machine Learning Algorithms to enhance ISSN (p) 2521-5027
Received on 15th Dec, 2019
Dysplasia Diagnosis Revised on 28th Dec, 2019
www.estirj.com

Muhammad Adil1, Irshad Ahmed Sumra2, Saqlain Nawaz3

Department of Computer Science, Bahria University, Lahore, Pakistan.


Email adil.concordia@gmail.com1 isomro28@gmail.com2 msaqlain.nawaz @gmail.com 3

Abstract: Nowadays, Machine learning is emerging with lightning speed in the field of medicine. Barrett's esophagus (BE) is a
forerunner to esophageal adenocarcinoma (OAC). The diagnosis of esophageal cancer is relatively new. Dysplasia arising in Barrett’s
esophagus as it is probably going to be amiable to recuperating treatment. Normal cells can become cancer cells. Before the shape of
cancer cells in the body's conditions, cells go through extraordinary changes that are called high hyperplasia and dysplasia. In this
study, predictive models for the detection of esophageal cancer include several techniques such as logistic regression, random forest,
and decision trees, resulting in effective and accurate decision making.

Keywords: Machine Learning, Barrett's esophagus, Esophageal Adenocarcinoma, Dysplasia.

1. Introduction gastrointestinal tract. I-Scan utilizes post-processing light


filtering innovation to improve various elements on the
E sophageal cancer is the most widely recognized cancer
on the planet and is the main source of death with
mucosa and microvasculature [3]. There is also developing
proof to suggest increased precision of dysplasia
cancer on the planet. Timely determination and estimation identification using I-Scan in the assessment of colorectal
of esophageal malignant growth have gotten key in disease polyps [4,5,6]. Sifting all component data to support the
explore, as it may help to confirm the number of patients diagnosis of clinical disorders is a troublesome and tedious
later. The significance of sorting out cancer patients into task. The purpose of this study is to recognizable proof
high or okay gatherings has driven many research groups to esophageal adenocarcinoma based on the characteristics of
think about the utilization of machine learning, from the the extricated tumor.
biomedical and bioinformatics fields. In spite of the fact that
the utilization of machine learning strategies can improve This paper is structured as follows: Section 2 discusses some
our comprehension of the advancement of esophageal related works. In section 3 discusses the different algorithms
malignant growth, which requires a fitting degree of and data gather resources. Section 4 presents the result and
approval to think about these methodologies in regular discussion in detail. Finally, section 5 presents the
clinical practice. With the development of different clinical conclusion and future research.
technologies, different tumor characteristics have been
combined to diagnose esophageal cancer.. Highlighting and 2. Related Work
selection are essential to the standardization of the criteria
Vinay Sehgal et al [7], used the decision tree to trainee no
established by the data mining methods. However, there is
expert doctors and this produces results 63% of the medical
no information about her request for a diagnosis of
student after training them which is not sufficient at all but
esophageal cancer. Here, we demonstrate the diagnostic
when he used a decision tree for registrars he can produce
ability of artificial intelligence to detect esophageal cancer.
68%. J.Bergman et al [8], he introduces two problems related
Barrett’s esophagus is the main noticeable premalignant
to medical and involves selection making to the reader. One
lesion. Endoscopic for Barrett's esophagus is performed to
of them related to the concerns coronary care issues and the
recognize adenocarcinoma at a pre-beginning time that is
contrary macrolide prescriptions in both he introduced
probably going to be tractable to corrective treatment.
decision-making tools for the doctor to predict. He achieved
Current surveillance strategies ordinarily focused on white
an accuracy of 78% and 90% and 80% by using the fast and
light to get irregular 4-quadrant biopsies each 2 centimeters
frugal decision tree used tool for web-based training for
through Barrett's esophagus segment [1]. This methodology
education purposes. Mention a way in which we train
samples lower than 5 percent of Barrett's epithelium and is
students by using websites. I.Kononenko et al [9], method to
in this manner inclined to missing before lesions [2]. An epic
predict for the helicopter training by using machine learning.
endoscopic picture upgrade innovation, I-Scan, has been
Present in Inductive and Bayesian learning in the field of
created to improve lesion acknowledgment in the
health to diagnosis such as article applied artificial

Corresponding author Email address: adil.concordia@gmail.com1


M. ADIL et.al: USING MACHINE LEARNING ALGORITHMS TO ENHANCE DYSPLASIA DIAGNOSIS 29

intelligence. P. D. James et al [10], assess the importance of • Diagnostic information Transparency.


trainee performance in proceed endoscopy: a systematic
Clarification of the information and decisions
review. P. Sharma et al [11], advancement and validation of
produced ought to be straightforward to the clinician.
a classification system to distinguish high-grade dysplasia
It ought to have the option to examine and comprehend
and esophageal adenocarcinoma in Barrett's esophagus
the information it creates. For instance, the information
using tight band imaging. Abidi, Samina et al [12], Made and
conceived consequently will furnish the specialist with
estimate a support foundation of PC called "Diabetes Web-
another viewpoint on the given issue, and may uncover
Centric Information and Support Environment" (DWISE) in
the event that you need essential health care than it is useful. new impediments and regularities that clinicians have
never observed plainly.
3. Methodology • Ability to Explanation.
The framework ought to have the option to clarify The main plausibility for specialists to acknowledge
decisions while analyzing new patients. At the point when the "discovery" rating is in a circumstance where such
looked with a startling answer for another issue, the doctor a rating was improved by an a lot bigger distinction
will require further clarification, else, the person in question than every single other rating, including the doctor
won't genuinely think about the framework's proposals. himself. Be that as it may, such a circumstance is
generally unimaginable.
Fig.1 Demonstrate the stages involved in our proposed work.
After the expert's reviews data is in raw format. So then • Decrease the quantity of tests.
preprocessing involves and then feature extract. Understanding information assortment in therapeutic
practice is regularly costly, tedious, and negative to
patients. Along these lines, it is essential to
characterize the individuals who can perform
dependable trial of patients with a modest quantity of
information. All applicant algorithms can be checked
by giving a restricted measure of information. Be that
as it may, the way toward deciding the correct subset
of information can be tedious, as it is essentially a
typical issue.

3.2 Description of some condition of the algorithms.


• Decision Tree
Decision Tree is a decision-production instrument that
utilizations trees, for example, diagrams or decision
models and their potential results, including
Figure 3.1: Proposed Methodology opportunity results, asset expenses, and utilities. This
is a method for showing an algorithm that contains just
3.1 Requirement of machine learning algorithms contingent control proclamations.
E=−i∑Cpilog2pi
• Handle missing information.
• Random forest
In the therapeutic conclusion, most patients' records come
up short on certain information in the patient's depiction. Random forests or random decision forests are a couple
The Machine learning algorithm should have the option of learning for order, regression and different
to appropriately manage such inadequate subtleties of assignments that produce countless deciduous trees
patients. during preparing and plan classes that are the method
of classes.
• Handle uproarious information.
Medicinal insights are commonly inclined to
vulnerabilities and blunders. Subsequently, compelling E = −c X I=1pi ×log (pi)
applications must be found to deal with the clamor • Logistic regression
information of machine learning algorithms reasonable
Logistic regression used to characterize real
for restorative applications.

Copyright ©2019 ESTIRJ-VOL.3, NO.4 (28-32)


M. ADIL et.al: USING MACHINE LEARNING ALGORITHMS TO ENHANCE DYSPLASIA DIAGNOSIS 30

information and to portray the connection between a


reliant twofold factor and the autonomous variable of
at least one ostensible, minor, interim, or proportion
levels. B means β
z= B₀ + B₁X
h Θ (x) = sig (Z)
I.e. h Θ (x) = 1/ (1 + e^-(B₀ + B₁X)

Table .2. Different algorithms and their performance. A B

Algorithms Effectiveness Pellucidity Handling


Missing
Data
Decision Good Very good Acceptable
Tree

Random Good Very good Acceptable


forest
C D
Figure 3.2. Vascular and Mucosal
Logistic Good Good Acceptable
regression
Patterns use for diagnosis:

3.3 Data gathering M1: Villous pits or regular circular.


The patient’s data used in this study gather from liver M2: Irregular pits or featureless mucosa or distorted
clinic Lahore under the supervision of doctor Irfan. 200 V1: Uniform vessels and regular.
patients were selected for this study in which 130 were
males and 70 were females and the mean ages were 53. V2: Dilated tortuous vessels, irregular.

Table .3. Patient’s details 4. Results and Discussion


The result section describes in two tables in table 4
Cases 200 highlight the accuracy, sensitivity, and specificity of
combine algorithms Random Forest and k-NN, SVM
Female 70 and KNN and table 5 provide the result of the single
algorithm.
Male 130
Table 4. Performance of the combine algorithms
Mean age( in the year) 53.4 Algorithm Accuracy Specificity Sensitivity

Random 81% 72% 86%


Bowel preparation Forest and
k-NN
Fair 110

Excellent 40
SVM and 0.95% 95% 90%
Good 50 KNN

In this study graded lesions based on and vascular (V)


mucosal (M) Table 5. Performance of the algorithms

Copyright ©2019 ESTIRJ-VOL.3, NO.4 (28-32)


M. ADIL et.al: USING MACHINE LEARNING ALGORITHMS TO ENHANCE DYSPLASIA DIAGNOSIS 31

Algorithms Logistic Decision Random


regression tree forest

Time to
0.74 0.45 0.60
Build a
model
Relative
absolute 42.66 39.95 62.78
error

Root
relative 62.88 58.23 67.15
squared
error Figure.3. Results of the time to build

Accuracy 84.88 98.23 89.15

Table 4. The best model to be used for diagnosing


esophagus cancer as found in this analysis is the decision tree
model with the top 5 predictors, radius means, esophageal
regular circular, esophageal irregular pits, esophageal
uniform vessels, and esophageal dilated tortuous vessels. It
gives a prediction accuracy of 95%.

Figure.4. Accuracy results

In the course of the most recent decade, Machine learning


has become continuously utilized inside the field of
medication. For example, machine learning for breast
cancers utilizing ultrasonography. Models utilize
understanding specific data to foresee a restorative yield or
to help specialists in making an identification [13]. Different
models incorporate machine learning to visualize in
Figure.1. Results of the root relative squared error melanoma, anticipate weakness for cerebrovascular ailment,
danger of repeat of breast malignant growth, and
determination of thyroid sickness. Exceptionally intriguing,
these frameworks frequently outflank the recognition
capacities of pros in each field [14]. Right now, there is no
institutionalized method for surveying the skill of
endoscopists who perform endoscopy in patients with BE.
There are no leaving appropriate assessment procedures or
rules to gauge the precision, affectability, and explicitness of
accumulated outcomes.

5. Conclusion and Future work


The machine learning is a division or branch of learning data
that uses the mathematical model to produce a computerized
algorithm. This idea can be a prediction model. Machine
Figure.2. Results of the root absolute error learning is included in "Learning" key database key features
to enable predictions about other, neutral data. One possible
way will be to separate the types of two types according to
the features of measuring articles. Prediction models can be
used to create subjects about a disease. In this study apply

Copyright ©2019 ESTIRJ-VOL.3, NO.4 (28-32)


M. ADIL et.al: USING MACHINE LEARNING ALGORITHMS TO ENHANCE DYSPLASIA DIAGNOSIS 32

algorithms on data set to accurately predict dysplasia. This classification system to identify high-grade dysplasia
opens the door to standardized training and qualitative and esophageal adenocarcinoma in Barretts esophagus
assessment for those who perform endoscopy in the breast using narrow-band imaging,” Gastroenterology, vol.
esophagus. 150, no. 3, pp. 591–598, 2016.

References [12] Abidi, Samina, et al. "Diabetes-Related Behavior


Change Knowledge Transfer to Primary Care
[1] K. K. Wang and R. E. Sampliner, “Updated guidelines Practitioners and Patients: Implementation and
2008 for the diagnosis, surveillance and therapy of Evaluation of a Digital Health Platform." JMIR medical
Barrett’s esophagus,” Am. J. Gastroenterol., vol. 103, informatics 6.2 (2018): e25.
no. 3, p. 788, 2008.
[13] W.-J. Kuo, R.-F. Chang, D.-R. Chen, and C. C. Lee, et
[2] P. Sharma et al., “A critical review of the diagnosis and
al. “Data mining with decision trees for diagnosis of
management of Barretts esophagus: the AGA Chicago
breast tumor in medical ultrasonic images,” Breast
Workshop,” Gastroenterology, vol. 127, no. 1, pp. 310–
Cancer Res. Treat., vol. 66, no. 1, pp. 51–57, 2001.
330, 2004.
[3] G. Lipman et al., “Systematic assessment with I-SCAN [14] B. Sierra and P. Larranaga, et al .“Predicting survival in
magnification endoscopy and acetic acid improves malignant skin melanoma using Bayesian networks
dysplasia detection in patients with Barretts esophagus,” automatically induced by genetic algorithms. An
Endoscopy, vol. 49, no. 12, pp. 1219–1228, 2017. empirical comparison between different approaches,”
Artif. Intell. Med., vol. 14, no. 1–2, pp. 215–230, 1998.
[4] M. R. Banks et al., “High resolution colonoscopy in a
bowel cancer screening program improves polyp
detection,” World J. Gastroenterol. WJG, vol. 17, no.
38, p. 4308, 2011.
[5] A. Hoffman et al., “High definition colonoscopy
combined with i-Scan is superior in the detection of
colorectal neoplasias compared with standard video
colonoscopy: a prospective randomized controlled
trial,” Endoscopy, vol. 42, no. 10, pp. 827–833, 2010.
[6] F. Pigò et al., “i-Scan high-definition white light
endoscopy and colorectal polyps: prediction of
histology, interobserver and intraobserver agreement,”
Int. J. Colorectal Dis., vol. 28, no. 3, pp. 399–406, 2013.
[7] V. Sehgal et al., “Machine Learning Creates a Simple
Endoscopic Classification System that Improves
Dysplasia Detection in Barretts Oesophagus amongst
Non-expert Endoscopists,” Gastroenterol. Res. Pract.,
vol. 2018, 2018.
[8] J. Bergman et al., “59 A Novel, Interactive Web-Based
Educational Tool Improves Detection and Delineation
of Barrett’s Oesophagus Related Neoplasia (Born): The
Born Project,” Gastrointest. Endosc., vol. 85, no. 5, p.
AB48, 2017.
[9] I. Kononenko, “Inductive and Bayesian learning in
medical diagnosis,” Appl. Artif. Intell. an Int. J., vol. 7,
no. 4, pp. 317–337, 1993.
[10] P. D. James, L. Antonova, M. Martel, and A. Barkun,
“Measures of trainee performance in advanced
endoscopy: a systematic review,” Best Pract. Res. Clin.
Gastroenterol., vol. 30, no. 3, pp. 421–452, 2016.
[11] P. Sharma et al., “Development and validation of a

Copyright ©2019 ESTIRJ-VOL.3, NO.4 (28-32)

You might also like