0% found this document useful (0 votes)
58 views25 pages

FFFFFFFFFFFFFFFFFFFFFF

The document is a technical seminar report on 'Lung Cancer Disease Detection Using Machine Learning Approach' submitted by Sowhard Patel B J for the Bachelor of Engineering degree in Information Science and Engineering at Acharya Institute of Technology. It discusses the use of machine learning, specifically Convolutional Neural Networks (CNN), for early detection of lung cancer through image analysis, highlighting the importance of early diagnosis for improving patient outcomes. The report includes acknowledgments, an abstract, methodology, and a literature review, emphasizing the project's objective to enhance diagnostic accuracy and efficiency.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
58 views25 pages

FFFFFFFFFFFFFFFFFFFFFF

The document is a technical seminar report on 'Lung Cancer Disease Detection Using Machine Learning Approach' submitted by Sowhard Patel B J for the Bachelor of Engineering degree in Information Science and Engineering at Acharya Institute of Technology. It discusses the use of machine learning, specifically Convolutional Neural Networks (CNN), for early detection of lung cancer through image analysis, highlighting the importance of early diagnosis for improving patient outcomes. The report includes acknowledgments, an abstract, methodology, and a literature review, emphasizing the project's objective to enhance diagnostic accuracy and efficiency.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd

VISVESVARAYA TECHNOLOGICAL UNIVERSITY

"Jnana Sangama", Belagavi - 590 018

A Technical Seminar Report on

“Lung Cancer Disease Detection Using Machine


Learning Approach”
submitted in partial fulfillment of the requirement for the award of the degree

Bachelor of Engineering
in
Information Science and Engineering

Submitted by

SOWHARD PATEL B J
1AY20IS079

Under the Guidance of


Prof. Sushma T M
Assistant Professor

DEPARTMENT OF INFORMATION SCIENCE AND ENGINEERING


ACHARYA INSTITUTE OF TECHNOLOGY
(AFFILIATED TO VISVESVARAYA TECHNOLOGICAL UNIVERSITY, BELAGAVI, RECOGNISED BY AICTE, NEW DELHI)
Acharya Dr. Sarvepalli Radhakrishnan Road, Soldevanahalli, Bengaluru - 560107

2023-2024
DEPARTMENT OF INFORMATION SCIENCE AND ENGINEERING
ACHARYA INSTITUTE OF TECHNOLOGY
(Affiliated to Visvesvaraya Technological University, Belagavi, Recognized by AICTE, New Delhi)
Acharya Dr. Sarvepalli Radhakrishnan Road, Soldevanahalli, Bengaluru - 560107
2023 - 2024

Certificate
This is to certify that the Technical Seminar (18ISS84) entitled “Lung Cancer
Disease Detection Using Machine Learning Approach” carried out by Sowhard
Patel B J (1AY20IS079), is bonafide student of Acharya Institute of Technology,
Bengaluru in partial fulfillment for the award of the degree of Bachelor of
Engineering in Information Science and Engineering of the Visvesvaraya
Technological University, Belagavi during the year 2023-24. It is certified that all
corrections/suggestions indicated for Internal Assessment have been incorporated
in the report deposited in the departmental library. The Technical Seminar report
has been approved as it satisfies the academic requirements in respect of Technical
Seminar prescribed for the said Degree.

Signature of the Guide Signature of the HOD


Prof. Sushma T M Dr. Kala Venugopal

Name of the Seminar Coordinators: Signature with Date


1. Prof. Dhananjaya M K
2. Prof. Yogesh N
ACKNOWLEDGEMENT

The satisfaction that accompanies the successful completion of Technical Seminar Report
would be incomplete without the mention of the people who made it possible through constant
guidance and encouragement.

I would take this opportunity to express my gratitude to Sri. B. Premnath Reddy, Founder
Chairman, Acharya Institutes, Dr. Rajath Hegde M M, Principal, and Prof. C K Marigowda,
Vice Principal, Acharya Institute of Technology for providing the necessary infrastructure to
complete this Technical Seminar Report.

I wish to express my deepest gratitude to Dr. Kala Venugopal., Associate Professor and
Head of the Department, Information Science and Engineering and also would like to thank
Technical Seminar coordinators Prof. Dhananjaya M K and Prof. Yogesh N for their constant
support.

I wish to express my sincere thanks to my guide Prof. Sushma T M, Assistant Professor,


Department of Information Science and Engineering for helping me throughout and guiding me
from time to time.

A warm thanks to the faculty of Department of Information Science and Engineering, who
have helped me with their views and encouraging ideas.

Sowhard Patel B J(1AY20IS079)

i
Abstract

The analysis and study of lung diseases has been the most intriguing investigation zone of medical
experts from early days to the present day. To address this concern, a diagnosis system like this can
only help diminish the odds of getting risk to human live by early discovery of malignant growth.
By and by a couple of structures are proposed and still an enormous number of them are still a
hypothetical plan. In the ensuing philosophy, the performance of a neural network model is
examined to address this issue of recognizing cancerous cells in image data, an average issue in
therapeutic imaging applications. In an attempt to accomplish this task, a lung cancer identification
framework is developed based on AI and deep neural system, where in the methodology depends on
supervised learning for which a better precision has been obtained, especially by using the deep
learning mechanism. CNN classification is a game plan of lung tumor classification. The framework
includes various methods, for instance, picture acquisition, pre-preparing, enhancement,
segmentation, feature extraction, and neural framework identification. To put it concisely, machine
learning approach can give an unprecedented opportunity to improve decision support in lung cancer
treatment at low cost

ii
TABLE OF CONTENTS

Page No
Acknowledgement i
Abstract ii

Chapter 1 INTRODUCTION 1
1.1 Introduction 1
1.2 Literature review 2
1.3 Motivation to do the project work 3
1.4 Objective of the work 3
Chapter 2 BACKGROUND THEORY 4
2.1 Introduction to the project title 4
2.2 Theoretical discussions and Analysis 5
Chapter 3 METHODOLOGY 7
3.1 Proposed Methodology 7
3.2 Implementation 8
3.3 Algorithm 9
3.4 CNN approach 11
Chapter 4 RESULT ANALYSIS 13
4.1 Lung cancer stage evaluation 13
4.2 Model Testing of Image data 15
4.3 Classification Result 16
Chapter 5 CONCLUSIONS 17
5.1 Conclusion 17
REFERENCES 18
LIST OF FIGURES

Table No Table Title Page No

1 Architecture Diagram 6
2 Block Diagram of CNN 9
3 Data Flow Diagram of Lung Nodule 11
4 Table of Lung cancer stage evaluation 13
5 Table of Model Testing of Image data 15
6 Bar Graph of Total Number of Cancer Type 16
Lung Cancer Disease Detection Using Machine Learning Approach

Chapter 1

INTRODUCTION
1.1 Introduction
Lung cancer is one of the most prevalent and deadly forms of cancer worldwide, with
a high mortality rate, primarily because it is often diagnosed at an advanced stage. Early
detection and accurate diagnosis are critical for improving patient outcomes and
increasing survival rates. Machine learning, a subset of artificial intelligence, has
emerged as a powerful tool for assisting in the diagnosis of lung cancer. This approach
utilizes various machine learning algorithms and techniques to analyse medical data,
such as medical images (X-rays and CT scans), clinical information, and genetic data,
to aid in the early and accurate diagnosis of lung cancer. Machine learning models can
assist medical professionals in identifying potential cases of lung cancer, classifying
tumor types, and predicting the progression of the disease, ultimately improving patient
care and treatment.Due of enormous pervasiveness of smoking and air contamination
around the globe, lung malignancy is a lethal ailment and thus global problem in
ongoing decades. Lung Cancer constitute 41% of cancer burden in India. It happens
when cells in the lung mutate. They grow disorderly and cluster together to form a
tumor. Thus it is crucial to diagnose metastatic tumor at initial stage itself to ensure
timely treatment and increase survival rate. However, the diagnosis process is time
consuming and costly as it requires human intellect to make critical decisions. After
exploring the elegance of machine learning in all other domains for image object
detection, it is strongly believed that deep learning methods can contribute largely
towards this pressing issue. In this current examination, it has been proposed to use the
profound neural network system to improve the chances of tumor identification. Here,
supervised learning approaches are utilized with NSCLC Radio genomics lung
malignancy CT(computed tomography) picture dataset. A framework is built that
comprises of numerous steps, for example, image extraction, pre-processing,
binarization, thresholding, segmentation, feature extraction.

AIT/ISE/2023-2024 1
Lung Cancer Disease Detection Using Machine Learning Approach

1.2 Literature Review

In this research study, Yutong Xie and colleagues [1] employed a multi-view
information-based collective (MV-KBC) deep model to differentiate malignant lung
tumors from normal lung nodules using chest CT data. They employed a total of 9 KBC
[1] sub-models to train this model. To evaluate its performance, the model was tested
on the LIDC-IDRI dataset and compared against five contemporary classification
methods. In a related study, Qing Wu and Wenbing Zhao [2] proposed the use of an
EDM AI algorithm that incorporates vectored histograms for the early detection of
small-cell lung carcinoma (SCLC) and the identification of malignant malignancies.

In the study conducted by Lilik Anifah and collaborators [3], the researchers introduced
a method for detecting lung cancer by leveraging an Artificial Neural Network Back-
propagation approach and Gray Level Co-occurrence Matrices (GLCM) features. To
assess this approach, they accessed lung data from the Cancer Imaging Archive
Database, which comprised 50 CT images. The process involved several key steps,
which included image pre-processing, segmentation, feature extraction, and the
application of the Neural Network Back-propagation method, which had three layers.
This method was employed to recognize and classify tumor growth patterns in the lung
images.

Professor Anuradha Deshpande and Dhanesh Lokhande's research, as described in their


work [4], centered on the prediction of lung cancer using a combination of image
processing techniques, watershed segmentation, and Support Vector Machines (SVM).
This unique approach involved consolidating critical features from different types of
medical images to create a Fused Image format. CT (computed tomography) images
are adept at visualizing denser tissues, while MRI (magnetic resonance imaging) excels
in capturing softer tissues. By merging pertinent information from both types of images,
they successfully obtained comprehensive data in the fused image format. This fusion

process not only enhanced the quality of the resulting image but also ensured a more
complete and informative representation for lung cancer prediction.

AIT / ISE / 2023-2024 2


Lung Cancer Disease Detection Using Machine Learning Approach

Abbas Khosravi and Amin Khatami's study, as detailed in their work [6], addressed the
challenge of classifying high-dimensional datasets. Traditionally, classifying such
datasets posed difficulties. However, their approach employed autoencoders and deep
learning techniques to enable successful classification.

1.3. Motivation to do the project

Lung cancer is a global health challenge, with high morbidity and mortality rates, and
it represents a significant burden on healthcare systems and patient well-being. The
motivation for undertaking a project on lung cancer disease diagnosis using a machine
learning approach stems from several compelling reasons. lung cancer disease
diagnosis using a machine learning approach is to harness the power of technology to
address a pressing healthcare challenge, improve patient outcomes, and contribute to
the ongoing fight against this devastating disease. This research has the potential to
revolutionize the way we diagnose and manage lung cancer, ultimately saving lives
and reducing the burden on patients and healthcare systems.

1.4 Objective of the work

The primary objective of this project is to establish a framework that serves as a


valuable tool for clinical specialists, enabling them to validate and cross-confirm their
diagnostic findings related to lung cancer. The conventional diagnostic process is
known to be time-consuming, resource-intensive, and costly. In contrast, this tool,
which is based on deep learning techniques, offers the capability to identify tumor
growth and predict cancer stages.By leveraging the automation of image processing
and artificial intelligence, this tool significantly reduces the human effort required for
detecting the presence of cancer cells within medical images. This not only enhances
the efficiency of the diagnostic process but also provides a more accurate and reliable
means of confirming the diagnosis,ultimately benefiting both healthcare professionals.

AIT / ISE / 2023-2024 3


Lung Cancer Disease Detection Using Machine Learning Approach

Chapter 2

BACKGROUND THEORY

2.1. Introduction to the Project title

Lung cancer is a highly prevalent and deadly disease, particularly in developing


countries, with a concerning mortality rate of 19.4% [1]. Detecting lung tumors early is
of utmost importance as it significantly impacts the chances of survival. Various
imaging techniques, including Computed Tomography (CT), Sputum Cytology, Chest
X-ray, and Magnetic Resonance Imaging (MRI), are employed for early detection,
primarily categorizing tumors into two classes: (i) non-cancerous (benign) and (ii)
cancerous (malignant) [2].Manual analysis and diagnosis systems have limitations, but
they can be greatly enhanced through the implementation of image processing
techniques. Many studies have explored these techniques for early cancer detection,
although the accuracy of early cancer detection remains a challenge. With the
emergence of advanced machine learning techniques, researchers are striving to
improve early cancer diagnosis. Neural networks, in particular, play a crucial role in
distinguishing cancer cells from normal tissues, serving as a potent tool for developing
AI-based cancer detection systems. The effectiveness of cancer treatment relies on
accurately segregating tumor cells from normal cells, with the classification of tumor
cells and neural network training forming the core of machine learning-based cancer
diagnosis [3].In this context, the paper introduces a Convolutional Neural Network
(CNN) based approach to classify lung tumors as either benign or malignant. This
method leverages the power of deep learning to enhance the accuracy of lung cancer
diagnosis, potentially leading to earlier detection and more effective treatment
strategies.

2.2. Theoretical Discussion and Analysis


Lung cancer stands as the leading cause of cancer-related fatalities, encompassing
cancers originating in the windpipe, main airways, and lungs. It emerges due to the
uncontrolled growth and dissemination of cells within the lung tissue.

AIT/ISE/2023-2024 4
Lung Cancer Disease Detection Using Machine Learning Approach

risk of lung cancer. The primary risk factor for lung cancer in Indian men is the
excessive use of tobacco, including cigarettes and beedis. Remarkably, among Indian
women, smoking is less prevalent, suggesting the presence of other contributing factors.
These factors include exposure to radon gas, air pollution, and workplace chemicals.
Lung cancer can be categorized as primary lung cancer when it initiates within the lung
itself, while secondary lung cancer occurs when cancerous cells from another part of
the body spread to the lungs. The stage of the cancer is determined by the tumor's size
and its extent of spread. Early-stage cancer is characterized by small tumors confined
to the lung, whereas advanced-stage cancer has infiltrated surrounding tissues or other
areas of the body. Improving the survival rate for lung cancer hinges on early detection
and intervention. A comprehensive understanding of risk factors can be instrumental in
preventing the onset of this devastating disease. The application of machine learning
techniques for detection can offer a pivotal advancement in the field of early cancer
diagnosis. Moreover, if we can streamline and enhance the diagnostic process for
radiologists, it represents a critical stride toward the overarching objective of achieving
improved early detection.

AIT / ISE / 2023-2024 5


Lung Cancer Disease Diagnosis Using Machine Learning Approach

Chapter 3

METHODOLOGY

3.1. Proposed Methodology

lung cancer from chest CT images using Convolutional Neural Networks (CNN). In the initial
stage, lung regions are extracted from the CT image, and within these regions, individual slices
are segmented to identify tumors. These segmented tumor regions are utilized to train the CNN
architecture. Subsequently, the trained CNN is employed to test patient images. The primary
objective of this research is to determine whether the tumor detected in a patient's lung is
malignant or benign. The proposed system operates as follows: lung regions are identified and
segmented, tumors within these regions are isolated and used for CNN training. Once trained,
the CNN is capable of discerning cancerous presence in lung CT images, aiding in the crucial
differentiation between malignant and benign tumors.

Figure 3.1 Architecture Diagram

AIT / ISE / 2023-2024 6


Lung Cancer Disease Diagnosis Using Machine Learning Approach

3.2 Implementation
This model comprises of following phases:
• Image Processing:
In the realm of image processing, the primary focus is on the manipulation and analysis
of images. Images are typically composed of a combination of RGB (Red, Green, Blue)
colors. Image processing encompasses a range of operations and techniques that are
applied to images with the objective of enhancing the image quality or extracting
valuable information from it. In this context, images serve as the input, and the output
can either be an improved image or specific properties and data derived from the image
itself.

• Image Filtering:
Image filtering is a technique used to modify or enhance an image, with the aim of
emphasizing particular features or reducing unwanted elements. This process
encompasses various operations, including smoothing, sharpening, noise reduction, and
edge enhancement. Images can be filtered in either the frequency domain or the spatial
domain. Image filtering, the value of a specific pixel in the output image is determined
by applying a particular algorithm or operation to the values associated with that pixel.
This transformation is instrumental in improving the quality and interpretability of
images in a wide range of applications, including image enhancement, noise reduction,
and feature extraction.

• Feature Extraction:
Image processing often involves the crucial step of feature extraction, which entails the
detection and representation of specific elements or characteristics of interest within an
image. This step is pivotal as it serves as a bridge, transitioning the image from a visual
representation to a numerical or alphanumerical data format. In this framework, we
utilize Convolutional Neural Networks (CNNs) as a part of this process. CNNs are
employed to automatically extract and represent meaningful features from images,
enabling subsequent data processing, analysis, or interpretation. This integration CNNs

AIT / ISE / 2023-2024 7


Lung Cancer Disease Diagnosis Using Machine Learning Approach

• Segmentation:
Segmentation is a technique employed in image processing to divide a digital image
into multiple segments or sets of pixels. This process is used to identify and delineate
objects and boundaries within an image, such as lines, curves, or other structural
elements. Segmentation is instrumental in creating regions or segments in which all the
pixels share a
common property or characteristic. One of the most straightforward methods for
segmentation is thresholding. Thresholding involves defining a specific threshold
value, and pixels in the image are then assigned to segments based on whether their
values are above or below this threshold. It's a fundamental and commonly used
technique for image segmentation and plays a vital role in various image analysis and
computer vision applications.

• Edge Detection:
Edge detection is a fundamental and essential tool in the fields of image processing and
computer vision. It is used to identify points in a digital image where there are sharp
changes in image brightness or where discontinuities occur. Edge detection is
particularly effective at pinpointing significant discontinuities in intensity values within
the image. Edges typically manifest as boundaries between two distinct regions in an
image, and they often hold vital information about the shapes and structures present.
Therefore, extracting edges is crucial for understanding the underlying features of an
image. In our research, we employ the Canny edge detection operator, a well-known
and widely used algorithm, to precisely identify and extract these edges, aiding in
various image analysis and computer vision tasks.

• Feature Recognition:
A feature, in the context of image and data analysis, refers to a region or part of an
object that exhibits specific and noteworthy geometric properties. The primary
objective of feature recognition is to mathematically extract higher-level features or
manufacturing data from lower-level entities, which could include elements like
surfaces, edges, and curves. This technology represents a pioneering approach in the
field, enabling the extraction of features and their associated parameters from solid

AIT / ISE / 2023-2024 8


Lung Cancer Disease Diagnosis Using Machine Learning Approach

models. The significance of feature recognition lies in its capacity to identify and group
topological entities, such as faces within a solid model, into functionally important
features. These features might include characteristics like ribs, holes, or slots. The use
of Convolutional Neural Networks (CNN) in this context is particularly noteworthy.
CNNs have the ability to identify relationships and connections within information and
acquire their knowledge through experience or training, as opposed to traditional
programming methods. This enables more robust and adaptive feature recognition in
complex data and image analysis tasks.

3.3. Algorithm
Convolutional Neural Networks (CNNs), a type of artificial neural network, have
gained significant prominence in the realm of computer vision, extending their appeal
to a wide array of domains, including radiology. CNNs are engineered to autonomously
and adaptively acquire spatial hierarchies of features through backpropagation,
employing various components like convolution layers, pooling layers, and fully
connected layer. Understanding the core principles and advantages of CNNs, as well as
their inherent limitations, is of utmost importance in harnessing their potential for
enhancing diagnostic radiology. The ultimate aim is to bolster the performance of
radiologists and elevate the quality of patient care. CNNs have revolutionized the field
of computer vision and have been applied to a wide range of applications, from image
recognition to medical image analysis and autonomous driving.

CNN consist of following 4 layers as shown in Fig.2

Figure 3.2 Block Diagram of CNN

AIT / ISE / 2023-2024 9


Lung Cancer Disease Diagnosis Using Machine Learning Approach

• Convolutional Layer:
Within the convolutional layer, a crucial operation involves the extraction of distinct
features at the pixel level through the use of feature detectors or kernels. Multiple
convolutions are performed on the input data, each employing a different filter. This
process yields a set of diverse feature maps. Subsequently, all these feature maps are
combined to produce the final output of the convolutional layer. This mechanism allows
the network to learn and represent various features from the input data.

• Pooling:
Max pooling is a technique employed to reduce training time and address overfitting
issues in the network. It accomplishes this by selecting and extracting the highest pixel
value from a feature that needs to be extracted. Max pooling helps in downsizing the
spatial dimensions of feature maps while retaining the most important information,
improving network efficiency and generalization.

• Flattening:
Flattening is the process of converting the pooled feature maps, which are in a 3D
format, into a single vector or column. This flattened vector serves as the input for the
next layer, effectively transforming the data from a 3D structure into a 1D format, the
flattening layer simplifies the data structure, allowing the CNN to transition from spatial
feature extraction to the final classification or regression tasks typically handled by the
fully connected layers.

• Fully Connected Layer


The fully connected layers are a type of neural network layer where each neuron in one
layer is connected to every neuron in another layer, akin to a multi-layer perceptron
(MLP). These layers receive inputs from the preceding feature analysis step and
introduce weighted connections to make predictions for the correct label. The output of
this process constitutes the final classification decision.

AIT / ISE / 2023-2024 10


Lung Cancer Disease Diagnosis Using Machine Learning Approach

3.4 Convolutional Neural Network Approach

Image handling utilizing Convolutional neural systems(CNN) has been utilized in


different fields, for example, facial recognition, analyzing documents, historic and
environmental collections, understanding climate, drug discovery, video analysis,
advertising etc.

Figure 3.3 Data Flow Diagram of Lung Nodule

CT scan images are considered for this work. Due to high complexity of lung cancer
pathology images, predicting patient outcome from lung cancer is still very challenging
and will require large amount of data for model developments. So, current focus is on
developing patient outcome prediction models based on image features extracted based

AIT / ISE / 2023-2024 11


Lung Cancer Disease Diagnosis Using Machine Learning Approach

on deep learning classification using CT scan images. Eventually, using such deep
learning techniques, it makes easy to process various formats of data, such as imaging
and economic features.

Image preprocessing is a vital stage in the image analysis pipeline and involves several
key steps, such as reading the image, resizing, noise reduction, and applying various
morphological operations. The primary objective of image preprocessing in the context
of Convolutional Neural Networks (CNN) is to enhance, restore, or correct images. This
process aims to eliminate unwanted distortions and improve image quality, making it
more suitable for subsequent analysis. Feature extraction is another essential step,
focused on isolating relevant entities from the image while discarding less pertinent
ones. This is often performed to reduce the dimensionality of the data. Feature
extraction techniques may encompass the identification of geometric traits such as
edges, corners, and junctions, or the extraction of specific attributes like facial features.
The result is a more compact and informative representation of the image.

Segmentation pertains to the division of an image into distinct regions or segments,


which is particularly valuable in tasks where isolating specific objects or areas of
interest is essential.
Recognition is the final stage and encompasses the identification and detection of
objects or features within an image. This process involves determining what the
identified objects or features are and their respective locations within the image.
Overall, these stages together form a comprehensive approach to image analysis,
enabling the extraction of meaningful information and knowledge from visual data.

AIT / ISE / 2023-2024 12


Lung Cancer Disease Diagnosis Using Machine Learning Approach

Chapter 4

RESULT ANALYSIS

4.1. Lung cancer stage evaluation

Table 1 shows the stage of evaluation. Staging is a way of describing where the cancer
is located, if or where it has spread, and whether it is affecting other parts of the body.

Table 4.1. Lung cancer stage evaluation

To establish the cancer's stage, doctors conduct various diagnostic tests, and staging
might not be finalized until all tests are completed. Understanding the stage is crucial
for guiding treatment decisions and offering insights into a patient's prognosis, which
pertains to the likelihood of recovery. It's important to note that different types of cancer
have distinct stage descriptions. In the case of non-small cell lung cancer (NSCLC), a
lower stage number is generally associated with a more favorable outcome. However,
it's important to emphasize that no doctor can accurately predict an individual's life
expectancy based solely on the cancer's stage. This is due to the fact that lung cancer
varies from person to person, and the response to treatment can differ based on the
specific characteristics of each tumor.

AIT / ISE / 2023-2024 13


Lung Cancer Disease Diagnosis Using Machine Learning Approach

• Stage I
A stage I lung cancer is a small tumor that has not spread to any lymph nodes. Stage I
is divided into 2 substages based on the size of the tumor:Stage IA tumors are 3
centimeters (cm) or less in size. Stage IA tumors may be further divided into IA1, IA2,
or IA3 based on the size of the tumor. Stage IB tumors are more than 3 cm but 4 cm or
less in size.

• Stage II
Stage II lung cancer is divided into 2 substages .A stage IIA cancer describes a tumor
larger than 4 cm but 5 cm or less in size that has not spread to the nearby lymph nodes.
Stage IIB lung cancer describes a tumor that is 5 cm or less in size that has spread to
the lymph nodes within the lung, called the N1 lymph nodes. A stage IIB cancer can
also be a tumor more than 5 cm wide that has not spread to the lymph nodes.

• Stage III
Stage III lung cancers are classified as either stage IIIA, IIIB, or IIIC. The stage is based
on the size of the tumor and which lymph nodes the cancer has spread to. Stage III
cancers have often spread extensively to the lymph nodes, but have not spread to other
distant parts of the body.
If stage III NSCLC is suspected, the doctor will want to make sure the cancer has not
spread to other parts of the body. For this evaluation, ASCO recommends a physical
examination, assessment of the patient’s medical history, a CT scan of the chest and
upper abdomen, and a PET-CT scan and MRI of the brain (see Diagnosis). For some
people, lymph nodes may also need to be tested for cancer by endoscopy or surgery. A
team of cancer care specialists generally work together to recommend the most
appropriate treatment plan based on the stage and other characteristics of the cancer as
well as other medical conditions the patient may have.
For many stage IIIA and stage IIIB cancers, it may be difficult, or sometimes
impossible, to remove the tumor completely with surgery alone. Stage IIIC cancers, in
general, cannot be removed with surgery and may need to be treated with a combination
of chemotherapy and radiation followed by immunotherapy.

AIT / ISE / 2023-2024 14


Lung Cancer Disease Diagnosis Using Machine Learning Approach

• Stage IV
Stage IV means the lung cancer has spread to more than 1 area in the other lung, the
fluid surrounding the lung or the heart, or distant parts of the body through the
bloodstream. Once cancer cells get into the blood, the cancer can spread anywhere in
the body. But, NSCLC is more likely to spread to the brain, bones, liver, and adrenal
glands. Stage IV NSCLC is divided into 2 substages. Stage IVA cancer has spread
within the chest and/or has spread to 1 area outside of the chest. Stage IVB has spread
outside of the chest to more than 1 place in 1 organ or to more than 1 organ.

4.2 Model Testing of Image data

In the experimental setup, in Table no.2, it shows total number of 1000 images of lung
cancer. The project contains mainly Lung cancer images type 1, Lung cancer images
type 2, Lung
cancer images type 3, and Lung cancer images type 4.

Table 4.2 Model Testing of Image data

AIT / ISE / 2023-2024 15


Lung Cancer Disease Diagnosis Using Machine Learning Approach

4.3. Classification Result

The graph 1 represents final classification result. The graph shows that total numbers
of images of lung cancer type 1 were 210, total numbers of images of lung cancer type
2 were 175,
total numbers of images of lung cancer type 3 were 385and total number of images of
lung cancer type 3 were 230.

Graph 4.3 Total number of cancer type

AIT / ISE / 2023-2024 16


Lung Cancer Disease Diagnosis Using Machine Learning Approach

Chapter 5

CONCLUSIONS

5.1. Conclusion

In the past, the diagnosis of lung cancer involved a series of time-consuming tests,
often subjecting patients to unnecessary check-ups and procedures. This process not
only consumed valuable time but also resulted in patients undergoing tests that may not
have been essential. To streamline the diagnostic process and reduce unnecessary
testing, it is crucial to introduce a preliminary screening test that can inform both the
patient and the doctor about the likelihood of lung cancer.

Today, machine learning algorithms have emerged as indispensable tools for predicting
and classifying medical data with a high degree of accuracy. These algorithms play a
vital role in improving diagnostic precision, minimizing errors, and enhancing the
overall efficiency of the diagnostic process. Additionally, incorporating advanced pre-
processing techniques can further boost the accuracy of these predictive models.

By combining the power of machine learning algorithms and enhanced pre-processing


methods, it becomes possible to provide patients and healthcare professionals with a
more efficient, accurate, and timely assessment of the likelihood of lung cancer. This
approach represents a significant advancement in the field of medical diagnostics, with
the potential to improve patient outcomes and reduce unnecessary testing and anxiety.

AIT / ISE / 2023-2024 17


Lung Cancer Disease Diagnosis Using Machine Learning Approach

REFERENCES

[1] Yut ongXie,, “ Knowledge-based Collaborat ive Deep Learning forBenign


Malignant Lung Nodule Classificat ion on Chest CT ” ,2018,IEEE .

[2] Vijayakumar, T. "Classification of Brain Cancer Type using MachineLearning."


Journal of Artificial Intelligence 1, no. 02 (2019): 105-113.

[3] LilikAnifah, Haryanto, RinaHarimurt i, “ Cancer lung det ect ion on CTScan image
using ANN backpropagation based gray level co occurrencemat rix feat ure.” 978-1-
5386-3172-0/17/ 2017 IEEE .

[4] P rof. Anuradha Deshpande, Dhanesh Lokhande, “ Lung cancer detectionwith


fusion of CT and MRI image using image processing.” (IJARCET)Volume 4 Issue 3,
March 2015

[5] RachidSammouda, “ Segmentation and analysis of CT chest images forearly lung


cancer detect ion.” Global Summit on Comput er &Informat ion Technology 978-1-
5090-2659-3/17 2017 IEEE.

[6] KwetisheJoroDanjuma, ” Performance Evaluation of Machine Learning Algorithms


in Post-operative Life Expectancy in the Lung Cancer Patients” Department of
Computer Science, ModibboAdama University of Technology, Yola, Adamawa State,
Nigeria

[7] Zehra Karhan1, Taner Tunç2, ”Lung Cancer Detection and Classification with
Classification Algorithms” IOSR Journal of Computer Engineering (IOSR-JCE) e-
ISSN: 2278-0661,p-ISSN: 22788727, Volume 18, Issue 6, Ver. III (Nov.-Dec. 2016),
PP 71-77

AIT / ISE / 2023-2024 18


Lung Cancer Disease Diagnosis Using Machine Learning Approach

[8] Ada, RajneetKaur, ” A Study of Detection of Lung Cancer Using Data Mining
Classification Techniques ” International Journal of Advanced Research in Computer
Science and Software Engineering, Volume 3, Issue 3, March 2021.

[9] YutongXie,, “Knowledge-based Collaborative Deep Learning for Benign Malignant


Lung Nodule Classification on Chest CT” ,2018, IEEE .

[10] Vijayakumar, T. "Classification of Brain Cancer Type using Machine Learning."


Journal of Artificial Intelligence 1, no. 02 (2019): 105-113.

[11] LilikAnifah, Haryanto, RinaHarimurti, “Cancer lung detection on CT Scan image


using ANN backpropagation based gray level co occurrence matrix feature.” 978-1-
5386-3172-0/17/ 2017 IEEE .

[12] Prof. Anuradha Deshpande, Dhanesh Lokhande, “Lung cancer detection with
fusion of CT and MRI image using image processing.” (IJARCET) Volume 4 Issue 3,
March 2015 .

[13] RachidSammouda, “Segmentation and analysis of CT chest images for early lung
cancer detection.” Global Summit on Computer & Information Technology 978-1-
5090-2659-3/17 2017 IEEE.

[14] Abbas Khosravi, Amin Khatami, “Lung cancer classification using deep learned
features on low population dataset.” Canadian Conference on Electrical and Computer
Engineering (CCECE) 978-1-5090-5538- 8/17 2017 IEEE .

AIT / ISE / 2023-2024 19

You might also like