Group Project:: Breast Cancer Classification

Uploaded by

Đức Anh Leo

0% found this document useful (0 votes)

5 views16 pages

Original Title

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

5 views16 pages

Group Project:: Breast Cancer Classification

Uploaded by

Đức Anh Leo

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 16

Search inside document

Artificial Intelligence

Group Project:
Breast Cancer Classification
Group4 – CityU7D
Tạ Thị Phương Anh
Nguyễn Đức Anh
Đoàn Lê Thiện Hảo
Hà Văn Nguyên
Nguyễn Việt Tùng
Table of content

1. Introduction section 3. Methodology & Algorithm

section

2. Dataset Description section 4. Evaluation section

1. Introduction section

We use the dataset to evaluate the goodness of the models, thereby selecting the best model.
This is a classification task. Because it classifies a diagnosis of breast cancer. The classes are:
benign and malignant.
2. Dataset Description section

Dataset has 569 rows and 33 columns. Attribute

number 2 is in categorical form, the rest are in
numerical form 1) ID number 2) Diagnosis (M =
malignant, B = benign) From 3 to 32.
Ten real-valued features are computed for e) smoothness (local variation in radius lengths)
each cell nucleus: f) compactness (perimeter^2 / area - 1.0)
a) radius (mean of distances from center g) concavity (severity of concave portions of the
to points on the perimeter) contour)
b) texture (standard deviation of gray- h) concave points (number of concave portions of
scale values) the contour)
c) perimeter i) symmetry
d) area j) fractal dimension ("coastline approximation" - 1)

The mean, standard error and "worst" or largest (mean of the three largest values) of these
features were computed for each image, resulting in 30 features. For instance, field 3 is Mean
Radius, field 13 is Radius SE, field 23 is Worst Radius.
We have the first attribute ID which is in
numeric form, so we remove them using
drop command. When the ID column is
lost, the dataset will still work normally
and will not be disturbed by the ID
number. For highly disparate data, we
separate out the disproportionate
columns of data to transform them using
the standard deviation method.

This dataset has a label for each data sample. This is a monitored issue. And there are no
missing values in this dataset. Just as there is no "noise" data in the dataset.
Compared to other countries in the world, the
United States is where breast cancer is the
second leading cause of death in women, after
lung cancer, but this rate is showing signs of
decreasing.
3. Methodology & Algorithm section
Logistic Regression, Decision Tree, Random Forest, Xgboost were applied

The main idea

We use it to get output that

can be transformed to return
a probability value.
Use decision tree algorithm to classify the
output of the dataset

The use of multiple decision tree algorithms at

random and then summing them
We choose them because these are supervised learning
algorithms with high accuracy, and there are some similarities
between them.
4. Evaluation section

Accuracy is used when the True Positives and

True negatives are more important
Based on the selected evaluation metrics, our
model received the highest score of 97.36%
Logistic Regression

There are 2 samples that are wrongly predicted: Fact is 1 (Malignant) ==> Prediction is 0 (Benign)
There is 1 sample that is wrongly predicted: Fact is 0 (Benign) ==> Prediction is 1 (Malignant)
Decision Tree

There are 4 samples that are wrongly predicted: Fact is 1 (Malignant) ==> Prediction is 0 (Benign)
There are 4 samples that is wrongly predicted: Fact is 0 (Benign) ==> Prediction is 1 (Malignant)
Random forest

There are 4 samples that are wrongly predicted: Fact is 1 (Malignant) ==> Prediction is 0
(Benign)
XGBOOST

There are 3 samples that are wrongly predicted: Fact is 1 (Malignant) ==> Prediction is 0 (Benign)
Thanks!
Any questions?

Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
IDS Project Group 11
Document35 pages
IDS Project Group 11
Faheem Akram
No ratings yet
Exact Statistical Inference for Categorical Data
From Everand
Exact Statistical Inference for Categorical Data
Guogen Shan
No ratings yet
Inteligencia Artificial
Document15 pages
Inteligencia Artificial
Sebastian Vallejo Rangel
No ratings yet
Data Scaling and Normalization
From Everand
Data Scaling and Normalization
Chuck Sherman
No ratings yet
Review Article: A Review of Feature Selection and Feature Extraction Methods Applied On Microarray Data
Document14 pages
Review Article: A Review of Feature Selection and Feature Extraction Methods Applied On Microarray Data
fahad dar
No ratings yet
Smart Business Problems and Analytical Hints in Cancer Research
From Everand
Smart Business Problems and Analytical Hints in Cancer Research
Zemelak Goraga
No ratings yet
Cancer Classification of Bioinformatics Data Using ANOVA: A. Bharathi, Dr.A.M.Natarajan
Document5 pages
Cancer Classification of Bioinformatics Data Using ANOVA: A. Bharathi, Dr.A.M.Natarajan
Nurlaeli Naelulmuna
No ratings yet
MATLAB Based Brain Tumour Extraction Using Artificial Neural Network
Document5 pages
MATLAB Based Brain Tumour Extraction Using Artificial Neural Network
Editor IJRITCC
No ratings yet
Linear Discriminant Analysis and Support Vector Machines For Classifying Breast Cancer
Document4 pages
Linear Discriminant Analysis and Support Vector Machines For Classifying Breast Cancer
IAES IJAI
No ratings yet
Gambar Cin
Document7 pages
Gambar Cin
Abdul Wahab 2008126345
No ratings yet
Clustering and Classification: - Task
Document16 pages
Clustering and Classification: - Task
asra18786
No ratings yet
Multilevel Classification Algorithm Using Diagnosis and Prognosis of Breast Cancer
Document3 pages
Multilevel Classification Algorithm Using Diagnosis and Prognosis of Breast Cancer
IIR india
No ratings yet
Genetic Based ID3 Classification Algorithm Diagnosis and Prognosis of Oral Cancer
Document3 pages
Genetic Based ID3 Classification Algorithm Diagnosis and Prognosis of Oral Cancer
IIR india
No ratings yet
12 Chi As 2-1
Document33 pages
12 Chi As 2-1
ram
No ratings yet
Diagnosis and Prognosis of Breast Cancer Using Multi Classification Algorithm
Document5 pages
Diagnosis and Prognosis of Breast Cancer Using Multi Classification Algorithm
Editor IJRITCC
No ratings yet
Jurnal
Document16 pages
Jurnal
Agoes Santika
No ratings yet
Diagnosis and Prognosis of Oral Cancer Using Classification Algorithm With Data Mining Techniques
Document3 pages
Diagnosis and Prognosis of Oral Cancer Using Classification Algorithm With Data Mining Techniques
IIR india
No ratings yet
Probability Distribution
Document16 pages
Probability Distribution
Md. Shahriar Kabir Rishat
No ratings yet
Data Mining Lab Maual Through Python 031023
Document22 pages
Data Mining Lab Maual Through Python 031023
Manish Kumar
No ratings yet
Numaamati, 07
Document13 pages
Numaamati, 07
Youtube Clone
No ratings yet
Breast Cancer Classification
Document18 pages
Breast Cancer Classification
Satwik Sridhar Reddy
No ratings yet
Microarray Review
Document5 pages
Microarray Review
hima
No ratings yet
Kohli 2018
Document4 pages
Kohli 2018
Chaithra D
No ratings yet
Synopsis (Heart Disease Prediction)
Document7 pages
Synopsis (Heart Disease Prediction)
HOD CSE
No ratings yet
QnA - Business Analytics
Document6 pages
QnA - Business Analytics
Rumani Chakraborty
No ratings yet
Statistics
Document30 pages
Statistics
Alex Fernadez
No ratings yet
LESSON 7: Non-Parametric Statistics: Tests of Association & Test of Homogeneity
Document21 pages
LESSON 7: Non-Parametric Statistics: Tests of Association & Test of Homogeneity
JiyahnBay
No ratings yet
Cluster Analysis in DNA Microarray Experiments: Sandrine Dudoit and Robert Gentleman
Document48 pages
Cluster Analysis in DNA Microarray Experiments: Sandrine Dudoit and Robert Gentleman
axtejada76
No ratings yet
Interview Questions
Document225 pages
Interview Questions
Pournima bhujbal
No ratings yet
Building A Simple Machine Learning Model On Breast Cancer Data
Document12 pages
Building A Simple Machine Learning Model On Breast Cancer Data
Khalifa Moiz
No ratings yet
Ankita Patra
Document17 pages
Ankita Patra
Anonymous pKuPK3zU
No ratings yet
Breast Cancer Detection and Prediction: Created by
Document20 pages
Breast Cancer Detection and Prediction: Created by
Get Out
No ratings yet
Automatic Human Brain Tumor Detection in MRI Image Using Template-Based K Means and Improved Fuzzy C Means Clustering Algorithm
Document18 pages
Automatic Human Brain Tumor Detection in MRI Image Using Template-Based K Means and Improved Fuzzy C Means Clustering Algorithm
Kishor Bhaduri
No ratings yet
DataAnalytics - R Clustering Method
Document8 pages
DataAnalytics - R Clustering Method
Ragini Agrawal
No ratings yet
Biostatistics Assignment: Dna Microarray: AN
Document14 pages
Biostatistics Assignment: Dna Microarray: AN
Akhil Nair
No ratings yet
Data Editing and Validation
Document19 pages
Data Editing and Validation
Honey Gupta
No ratings yet
Schizophrenia Ieee 2018
Document6 pages
Schizophrenia Ieee 2018
Ms. S.Sridevi Vels University
No ratings yet
Supervised PCA
Document19 pages
Supervised PCA
akshdeep singh
No ratings yet
Cancer Detection Using Data Mining
Document13 pages
Cancer Detection Using Data Mining
rishabh kumar
No ratings yet
Random Forest: Prediction of Genetic Susceptibility To Complex Diseases
Document7 pages
Random Forest: Prediction of Genetic Susceptibility To Complex Diseases
Abid Anjum
No ratings yet
Keogh Et Al 2018 Biometrics
Document12 pages
Keogh Et Al 2018 Biometrics
Panagiotis Karathymios
No ratings yet
Decision Tree Classifiers To Determine The Patient's Post-Operative Recovery Decision
Document13 pages
Decision Tree Classifiers To Determine The Patient's Post-Operative Recovery Decision
AI Coordinator - CSC Journals
No ratings yet
Application of Image Processing Techniques To Tissue Texture Analysis and Image Compression
Document10 pages
Application of Image Processing Techniques To Tissue Texture Analysis and Image Compression
chetansrinidhi
No ratings yet
D2 Basic Stat
Document53 pages
D2 Basic Stat
Idabagus Putu Putra Mahartana
No ratings yet
Jurnal Q4
Document5 pages
Jurnal Q4
Agoes Santika
No ratings yet
Topology Based Data Analysis Identifies A Subgroup of Breast Cancer With A Unique Mutational Profile and Excellent Survival
Document6 pages
Topology Based Data Analysis Identifies A Subgroup of Breast Cancer With A Unique Mutational Profile and Excellent Survival
J Luis Mls
No ratings yet
Model Evaluation and Selection
Document6 pages
Model Evaluation and Selection
Kishore Devineni
No ratings yet
Unit 1
Document21 pages
Unit 1
read4free
No ratings yet
Iim Iprobability
Document43 pages
Iim Iprobability
Marx Chryz
No ratings yet
Data Science Interview Q's - I
Document11 pages
Data Science Interview Q's - I
Raja
No ratings yet
Bile Nia 2018
Document9 pages
Bile Nia 2018
aneetachristo94
No ratings yet
Thesis Report
Document5 pages
Thesis Report
Angelie Lape
No ratings yet
An Advanced Breast Tumor Classification Algorithm: Dinesh Kumar, Vijay Kumar, Jyoti, Sumer Poonia, Felix Deepak Minj
Document9 pages
An Advanced Breast Tumor Classification Algorithm: Dinesh Kumar, Vijay Kumar, Jyoti, Sumer Poonia, Felix Deepak Minj
Goh Dody
No ratings yet
Image Clustering Using Genetic Algorithm With Tour
Document7 pages
Image Clustering Using Genetic Algorithm With Tour
Abdelrafik TOUZEN
No ratings yet
A Tour of Unsupervised Deep Learning For Medical Image Analysis
Document29 pages
A Tour of Unsupervised Deep Learning For Medical Image Analysis
Landon Gray
No ratings yet
A Review On Data Mining Techniques For Digital Mammographic Analysis
Document5 pages
A Review On Data Mining Techniques For Digital Mammographic Analysis
IIR india
No ratings yet
MATH03-CO4-Lesson1-Sampling and Sampling Distribution
Document20 pages
MATH03-CO4-Lesson1-Sampling and Sampling Distribution
Edward Snowden
No ratings yet
Preliminaries of Survey Sampling
Document19 pages
Preliminaries of Survey Sampling
Richelle Pausang
No ratings yet
Brain Tumer Extraction From Mri Image Using K-Means Clustring Tecnique
Document7 pages
Brain Tumer Extraction From Mri Image Using K-Means Clustring Tecnique
Rohit Arya
No ratings yet
Pepsico: A Report
Document23 pages
Pepsico: A Report
Đức Anh Leo
No ratings yet
Nguyen Duc Anh - CA7-010 - CityU7D
Document1 page
Nguyen Duc Anh - CA7-010 - CityU7D
Đức Anh Leo
No ratings yet
Final Asm of Fundemental of Ai Group 4 7D
Document10 pages
Final Asm of Fundemental of Ai Group 4 7D
Đức Anh Leo
No ratings yet
Order Letter
Document3 pages
Order Letter
Đức Anh Leo
No ratings yet
I. History, Vision/mission/objectives 1. History
Document4 pages
I. History, Vision/mission/objectives 1. History
Đức Anh Leo
No ratings yet
Practice - Order Letter
Document2 pages
Practice - Order Letter
Đức Anh Leo
No ratings yet
Test 2 OBL Answer
Document5 pages
Test 2 OBL Answer
Đức Anh Leo
No ratings yet
Nguyen Duc Anh. CA7-010
Document5 pages
Nguyen Duc Anh. CA7-010
Đức Anh Leo
No ratings yet
Nguyễn Đức Anh-chap-12-TOM
Document3 pages
Nguyễn Đức Anh-chap-12-TOM
Đức Anh Leo
No ratings yet
A Report On Walmart Incorporation: Student: Viet Tung Nguyen
Document20 pages
A Report On Walmart Incorporation: Student: Viet Tung Nguyen
Đức Anh Leo
No ratings yet
EDITED Chapter 3. How Stressed Are You. Health
Document2 pages
EDITED Chapter 3. How Stressed Are You. Health
bella barlett
No ratings yet
Personality Deliverables
Document4 pages
Personality Deliverables
api-299275979
No ratings yet
Novice-Mid: Presentational Rubric: Mi Juventud
Document2 pages
Novice-Mid: Presentational Rubric: Mi Juventud
api-26007136
No ratings yet
4º ESO WEEK 12 (1 - 5 June) Unit 9: Session 1: (Monday 1 - Tuesday 2 June)
Document2 pages
4º ESO WEEK 12 (1 - 5 June) Unit 9: Session 1: (Monday 1 - Tuesday 2 June)
TRUFAST008
No ratings yet
LibroDataStructuresAlgorithms DouglasBaldwin Greg Scragg
Document613 pages
LibroDataStructuresAlgorithms DouglasBaldwin Greg Scragg
Ramdon999
No ratings yet
Green
Document4 pages
Green
Fernanda Maziero
No ratings yet
Organizational Development Interventions
Document57 pages
Organizational Development Interventions
Amr Yousef
No ratings yet
3M Case Study
Document3 pages
3M Case Study
José F M Ávila
No ratings yet
Semiotics and Its Definition
Document4 pages
Semiotics and Its Definition
Sri Wirapatni
No ratings yet
Navi Dictionary
Document46 pages
Navi Dictionary
Nameeta Parate
No ratings yet
Technology Integration Template-Presentation 1
Document2 pages
Technology Integration Template-Presentation 1
api-352923336
No ratings yet
Case Analysis For The Army Crew Team
Document9 pages
Case Analysis For The Army Crew Team
Punit Nema
100% (1)
International English Language Testing System (Or Better Known As IELTS)
Document4 pages
International English Language Testing System (Or Better Known As IELTS)
Ahmad Hafid Hanifah
No ratings yet
FCE Writing Reference
Document12 pages
FCE Writing Reference
Jose Ramirez
No ratings yet
Local Media6441563015291784863
Document40 pages
Local Media6441563015291784863
john paul gallego
No ratings yet
Opinion Essay Lay Out
Document2 pages
Opinion Essay Lay Out
Karenza Thomas
67% (3)
The Seven Ages of Man
Document10 pages
The Seven Ages of Man
Margie Ballesteros Manzano
No ratings yet
IIM Teaching Research Skills in Grades K-5 CCSS Edition Sample
Document8 pages
IIM Teaching Research Skills in Grades K-5 CCSS Edition Sample
Active Learning Systems
No ratings yet
Why Are You Applying For Financial Aid?
Document2 pages
Why Are You Applying For Financial Aid?
Raniel R Billones
No ratings yet
AI Infrastructure 101
Document8 pages
AI Infrastructure 101
nicolepetrescu
No ratings yet
Psychoanalysis Report
Document16 pages
Psychoanalysis Report
ianne
No ratings yet
Measurement and Scaling
Document14 pages
Measurement and Scaling
imad
No ratings yet
04 Dressler
Document32 pages
04 Dressler
eduardotoledo
No ratings yet
Different Levels of Stress in Different Leadership Positions
Document3 pages
Different Levels of Stress in Different Leadership Positions
Nisha Lligo
No ratings yet
Contoh Competency Matrix Setiap Department
Document430 pages
Contoh Competency Matrix Setiap Department
KismiAzi
100% (1)
Chapter 2 Landscape
Document2 pages
Chapter 2 Landscape
valerie ann
No ratings yet
Analyze Project Charters. - Transcript
Document2 pages
Analyze Project Charters. - Transcript
Saki Daniel Comboni
No ratings yet
Attitude Formation
Document5 pages
Attitude Formation
Sooraj.rajendra Prasad
100% (1)
Dr. Jyoti Sharma: Academic Qualifications
Document2 pages
Dr. Jyoti Sharma: Academic Qualifications
Sahil Kumar
No ratings yet
Cafc BCR Revision Lectures & Notes
Document95 pages
Cafc BCR Revision Lectures & Notes
Gian
No ratings yet