You are on page 1of 5

PROJECT SYNOPSIS

ON

Predicting the Probability of Lung Cancer in an Individual.

Submitted By:

Abhishek Kumar Pandey – 11500218062


Deepak Karamchandani – 11500218046
Divya Kumari – 11500219060
Soujit Saha - 11500218009

Under the guidance of

Dr. Sabnam Sengupta

MAULANA ABUL KALAM AZAD UNIVERSITY OF TECHNOLOGY


(Formerly known as WBUT)

B.P. PODDAR INSTITUTE OF MANAGEMENT AND TECHNOLOGY


PODDAR VIHAR, 137 VIP ROAD

KOLKATA-52

Signature of the Guide


CONTENTS

∙ ACKNOWLEDGEMENT
∙ ABSTRACT

∙ OBJECTIVE

∙ INTRODUCTION

∙ PROPOSED WORK

∙ FUTURE SCOPE

∙ SOFTWARE REQUIREMENTS ∙ CONCLUSION

∙ REFERENCES
ACKNOWLEDGEMENT

We would like to express our gratitude towards Dr. Sabnam Sengupta for giving us the opportunity of
working on this project. We would also like to thank everyone who has helped us in this project.

OBJECTIVE

This project aims to build a data model which will predict the probability of lung cancer in an individual.
In addition to that we also aim to help people in diagnosing lung cancer in its early stages which might
eventually save their life.

ABSTRACT

The occurrence of lung cancer has increased rapidly and become the most common cancer in men in most
countries. Lung cancer accounts for around 1,095,000 new cancer cases and 951,000 deaths each year in
men, and 514,000 cases and 427,000 deaths in women, representing about 12.7% of all new cancer cases
each year and 18.2% of cancer deaths. These numbers are incredibly frightening which is why we’ve
chosen our topic with an aim of helping people in diagnosing lung cancer in its early stages which might
eventually save their life.
INTRODUCTION

The early diagnosis of Lung cancer is obvious but the diagnosis is costly in the developing countries.
Therefore based on different and most common risk factors of lung cancer a risk prediction system of
lung cancer is proposed in this study which will be cost effective and easy to use.
Initially cancer and non-cancer patients’ data were collected from different diagnostic centres. Data of
male and female patients whose age was between 20-70 years old are taken

PROPOSED WORK

The proposed work should be progressed in the following way-

 Firstly we analyse data of already diagnosed lung cancer patients and detect
patterns with the help certain data mining algorithms.
 Then we visualize the processed data and try to find insights from it.
 Then we build a data model with the already processed data and map each factor
with a risk value.
 Finally we ask the user to input their lifestyle details and predict their likelihood of
developing lung cancer based on the model made in the previous step.

FUTURE SCOPE

In the future, we are planning on introducing more factors which might affect the probability of a person
of developing lung cancer in order to make the prediction more accurate. Aside from that, we also plan on
using computer algorithms to create computer-aided programs that are better able to identify cancer in CT
scans than radiologists or pathologists.

SOFTWARE REQUIREMENT
 python 3.8
 Keras
 Matplotlib
 PlotLy
 Numpy
 Django
 OpenC

HARDWARE REQUIREMENT

∙ Processor= Min i3(i5 recommended)


∙ RAM=Min 4GB
CONCLUSION
Large numbers of people in world have cancer. Most of them do not even know they have it.
There is no remedy for cancer after completely affected. So the ability to predict cancer plays
an important role in the diagnosis process. In this paper we have proposed an effective cancer
prediction system based on data mining. We have provided an efficient approach for the
extraction of significant pattern from data warehouse for efficient prediction of cancer. The
proposed method is implemented using java. The proposed method can efficiently and
successfully predict the risk of cancer.

REFERENCES

• Amorim R, Mirkin B (2012). Minkowski metric, feature weighting and anomalous cluster
initializing in K-Means clustering. Pattern Recognition, 45,1061-75.

• Yael Ben-Haim,Elad Tom-Tov (2010) A streaming parallel decision tree algorithm. J


Machine Learning Res, 11, 849-72

• Muhammad ASapon, Khadijah Ismail, SuehazlynZainudin (2011). Prediction of diabetes by using


artificial neuralPrediction of diabetes by using artificial neural network. 2011 International
Conference on Circuits, System and Simulation, 7, 299-303.

• https://www.cancer.gov/types/lung/research

You might also like