Professional Documents
Culture Documents
ON
Submitted By:
KOLKATA-52
∙ ACKNOWLEDGEMENT
∙ ABSTRACT
∙ OBJECTIVE
∙ INTRODUCTION
∙ PROPOSED WORK
∙ FUTURE SCOPE
∙ REFERENCES
ACKNOWLEDGEMENT
We would like to express our gratitude towards Dr. Sabnam Sengupta for giving us the opportunity of
working on this project. We would also like to thank everyone who has helped us in this project.
OBJECTIVE
This project aims to build a data model which will predict the probability of lung cancer in an individual.
In addition to that we also aim to help people in diagnosing lung cancer in its early stages which might
eventually save their life.
ABSTRACT
The occurrence of lung cancer has increased rapidly and become the most common cancer in men in most
countries. Lung cancer accounts for around 1,095,000 new cancer cases and 951,000 deaths each year in
men, and 514,000 cases and 427,000 deaths in women, representing about 12.7% of all new cancer cases
each year and 18.2% of cancer deaths. These numbers are incredibly frightening which is why we’ve
chosen our topic with an aim of helping people in diagnosing lung cancer in its early stages which might
eventually save their life.
INTRODUCTION
The early diagnosis of Lung cancer is obvious but the diagnosis is costly in the developing countries.
Therefore based on different and most common risk factors of lung cancer a risk prediction system of
lung cancer is proposed in this study which will be cost effective and easy to use.
Initially cancer and non-cancer patients’ data were collected from different diagnostic centres. Data of
male and female patients whose age was between 20-70 years old are taken
PROPOSED WORK
Firstly we analyse data of already diagnosed lung cancer patients and detect
patterns with the help certain data mining algorithms.
Then we visualize the processed data and try to find insights from it.
Then we build a data model with the already processed data and map each factor
with a risk value.
Finally we ask the user to input their lifestyle details and predict their likelihood of
developing lung cancer based on the model made in the previous step.
FUTURE SCOPE
In the future, we are planning on introducing more factors which might affect the probability of a person
of developing lung cancer in order to make the prediction more accurate. Aside from that, we also plan on
using computer algorithms to create computer-aided programs that are better able to identify cancer in CT
scans than radiologists or pathologists.
SOFTWARE REQUIREMENT
python 3.8
Keras
Matplotlib
PlotLy
Numpy
Django
OpenC
HARDWARE REQUIREMENT
REFERENCES
• Amorim R, Mirkin B (2012). Minkowski metric, feature weighting and anomalous cluster
initializing in K-Means clustering. Pattern Recognition, 45,1061-75.
• https://www.cancer.gov/types/lung/research