Professional Documents
Culture Documents
• There are multiple variables in the dataset which may/may not be the
symptoms getting cancer like pollution, lung diseases, smoking, chest
pain, dry cough, swallowing difficulty, obesity, genetic risk, weight
loss and many more
• The dataset contains 21 different variables and data of over 1000+
patients
TOOLS LIBRARIES
Seaborn
Scikit Learn
• Logistic Regression
• KNN (K- Nearest Neighbor)
• Decision Tree
• SVM (Support Vector Machine)
Decision Tree model is giving the best result for each fold