Welcome to Scribd!

Skip carousel

Cancer Cell Classification Using Scikit

Uploaded by

20pba216 Pavithra Meenakshi M

0% found this document useful (0 votes)

72 views4 pages

Original Title

Cancer cell classification using Scikit

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

72 views4 pages

Cancer Cell Classification Using Scikit

Uploaded by

20pba216 Pavithra Meenakshi M

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 4

Search inside document

Cancer cell classification using Scikit-learn

The Breast cancer wisconsin (diagnostic) dataset. The dataset includes several data
about the breast cancer tumors along with the classification’s labels, viz.,
malignant or benign.

pip install scikit-learn

#Importing the necessary module and dataset.
# importing the Python module
import sklearn
# importing the dataset
from sklearn.datasets import load_breast_cancer

#Loading the dataset to a variable

data = load_breast_cancer()

#Organizing the data and looking at it.

label_names = data['target_names']
labels = data['target']
feature_names = data['feature_names']
features = data['data']

# looking at the data

print(label_names)
# each dataset of a tumor is labelled as either ‘malignant’ or ‘benign’.

print(labels)
#each label is linked to binary values of 0 and 1, where 0 represents malignant
tumors and 1 represents benign tumors.

print(feature_names)
# all the 30 features or attributes that each dataset of the tumor has. We will be
using the numerical values of these features in training our model and make the
correct prediction, whether or not a tumor is malignant or benign, based on this
features.

print(features)
# This is a huge dataset containing the numerical values of the 30 attributes of
all the 569 instances of tumor data.

#Organizing the data into Sets.

#Split our data into two sets, viz., training set and test set. We will be using the
training set to train and evaluate the model and then use the trained model to
make predictions on the unseen test set.

# importing the function

from sklearn.model_selection import train_test_split

# splitting the data

train, test, train_labels, test_labels = train_test_split(features, labels,
test_size = 0.33,
random_state = 42)

# The train_test_split() function randomly splits the data using the parameter
test_size. What we have done here is that, we have split 33% of the original data
into test data (test). The remaining data (train) is the training data. Also, we have
respective labels for both the train variables and test variables, i.e. train_labels and
test_labels.

#Building the Model.

For this model, using the Naive Bayes algorithm that usually performs well in
binary classification tasks. Firstly, import the GaussianNB module and initialize it
using the GaussianNB() function. Then train the model by fitting it to the data in
the dataset using the fit() method.

# importing the module of the machine learning model

from sklearn.naive_bayes import GaussianNB

# initializing the classifier

gnb = GaussianNB()

# training the classifier

model = gnb.fit(train, train_labels)

# making the predictions

predictions = gnb.predict(test)

# printing the predictions

print(predictions)

# the predict() function returned an array of 0s and 1s. These values represent the
predicted values of the test set for the tumor class (malignant or benign).
# importing the accuracy measuring function
from sklearn.metrics import accuracy_score

# evaluating the accuracy

print(accuracy_score(test_labels, predictions))

This machine learning classifier based on the Naive Bayes algorithm is 94.15%
accurate in predicting whether a tumor is malignant or benign.

References
https://www.geeksforgeeks.org/ml-cancer-cell-classification-using-scikit-learn/

Pro-Oxidant Strategies - Cancer Treatments Research
Document71 pages
Pro-Oxidant Strategies - Cancer Treatments Research
Spore Flux
No ratings yet
Essential Oils Anti-Cancer Guide
Document9 pages
Essential Oils Anti-Cancer Guide
Ραφαέλα Πηλείδη
No ratings yet
HIPERLEUKOSITOSIS & LEUKOSTASIS
Document16 pages
HIPERLEUKOSITOSIS & LEUKOSTASIS
Shapira al
No ratings yet
Adenocarcinoma Pulmonar
Document75 pages
Adenocarcinoma Pulmonar
Ligia Micaela García Xitamul
No ratings yet
Week 7 Laboratory Activity
Document12 pages
Week 7 Laboratory Activity
Gar Noob
No ratings yet
Multi-Output Classification With Machine Learning
Document10 pages
Multi-Output Classification With Machine Learning
panigrahisuman7
No ratings yet
Classification
Document40 pages
Classification
niranjan
No ratings yet
# Import Necessary Modules
Document2 pages
# Import Necessary Modules
4NM20IS003 ABHISHEK A
No ratings yet
Efficient Python Tricks and Tools For Data Scientists - by Khuyen Tran
Document20 pages
Efficient Python Tricks and Tools For Data Scientists - by Khuyen Tran
Khagen
No ratings yet
Efficient Python tricks for data scientists
Document20 pages
Efficient Python tricks for data scientists
Javier Velandia
100% (1)
The Art of Finding The Best Features For Machine Learning - by Rebecca Vickery - Towards Data Science
Document14 pages
The Art of Finding The Best Features For Machine Learning - by Rebecca Vickery - Towards Data Science
Hamdan Gani, S.Kom., MT
No ratings yet
DM 6,7
Document6 pages
DM 6,7
Angelina Tutu
No ratings yet
ML Classification Algorithms in Python
Document32 pages
ML Classification Algorithms in Python
Mukul Sharma
No ratings yet
Machine Learning Lab: Raheel Aslam (74-FET/BSEE/F16)
Document3 pages
Machine Learning Lab: Raheel Aslam (74-FET/BSEE/F16)
Raheel Aslam
No ratings yet
20mid0209 Lab - 6
Document11 pages
20mid0209 Lab - 6
R B SHARAN
No ratings yet
Assignment 3 Q1
Document5 pages
Assignment 3 Q1
Pratyush
No ratings yet
17 Ensemble Techniques Problem Statement
Document28 pages
17 Ensemble Techniques Problem Statement
Jadhav A.S
No ratings yet
Unit-III Advanced Machine Learning
Document8 pages
Unit-III Advanced Machine Learning
Suja Mary
No ratings yet
Scikit Learn
Document17 pages
Scikit Learn
RR
No ratings yet
ML_Practical file
Document15 pages
ML_Practical file
Jatin Mathur
No ratings yet
Introduction To Python and Computer Programming 1704298503
Document44 pages
Introduction To Python and Computer Programming 1704298503
el.tico.138623
No ratings yet
2324 BigData Lab3
Document6 pages
2324 BigData Lab3
Elie Al Howayek
No ratings yet
Compare classification algorithms
Document2 pages
Compare classification algorithms
ASHISH MALI
No ratings yet
Exp 3 Bi
Document12 pages
Exp 3 Bi
Smaranika Patil
No ratings yet
Machine Learning Hands-On Programs Program 1: Linear Regression - Single Variable Linear Regression
Document22 pages
Machine Learning Hands-On Programs Program 1: Linear Regression - Single Variable Linear Regression
KANTESH kantesh
100% (1)
Chapter 11
Document19 pages
Chapter 11
ramaraju
No ratings yet
Feature Selection
Document8 pages
Feature Selection
Abinaya C
No ratings yet
TD2345
Document3 pages
TD2345
ashitaka667
No ratings yet
tutorial 9_questions 2023
Document4 pages
tutorial 9_questions 2023
ceewang23
No ratings yet
2.1 ML (Implementation of Simple Linear Regression in Python)
Document8 pages
2.1 ML (Implementation of Simple Linear Regression in Python)
Muhammad shayan umar
No ratings yet
Coe Projects
Document7 pages
Coe Projects
tApIsH
No ratings yet
Kabir Khan 1147 . 4
Document4 pages
Kabir Khan 1147 . 4
mohammed.ibrahimdurrani.bscs-2020b
No ratings yet
Regression Linaire Python Tome II
Document10 pages
Regression Linaire Python Tome II
Elisée TEGUE
No ratings yet
8 Ejercicio - Optimización y Guardado de Modelos - Training - Microsoft Learn Ingles
Document13 pages
8 Ejercicio - Optimización y Guardado de Modelos - Training - Microsoft Learn Ingles
acxel david castillo casas
No ratings yet
2 Naive Bayee Algorithm - Jupyter Notebook
Document2 pages
2 Naive Bayee Algorithm - Jupyter Notebook
venkatesh m
No ratings yet
Name: Mussab Bin Shahid Sap-Id: 2024 Assignment: Machine-Learning
Document5 pages
Name: Mussab Bin Shahid Sap-Id: 2024 Assignment: Machine-Learning
Mussab Shahid
No ratings yet
Logistic Regression in Python for Salary Data Analysis
Document4 pages
Logistic Regression in Python for Salary Data Analysis
mohan
No ratings yet
How To Train A Model With MNIST Dataset
Document7 pages
How To Train A Model With MNIST Dataset
Magdalena Falkowska
No ratings yet
Logistic Regression
Document10 pages
Logistic Regression
Chichi Jnr
100% (1)
P05 The Regression Pipeline - Training and Testing Ans
Document13 pages
P05 The Regression Pipeline - Training and Testing Ans
YONG LONG KHAW
No ratings yet
Tutorial 6
Document8 pages
Tutorial 6
POEASO
No ratings yet
1 - An Introduction To Machine Learning With Scikit-Learn
Document9 pages
1 - An Introduction To Machine Learning With Scikit-Learn
yati kumari
No ratings yet
Diabetes Machine Learning Case Study
Document10 pages
Diabetes Machine Learning Case Study
Abhising
100% (1)
Machine Learning Bagging and Random Forests
Document30 pages
Machine Learning Bagging and Random Forests
All Uun
No ratings yet
Machine Learning K Means - Unsupervised
Document5 pages
Machine Learning K Means - Unsupervised
daniel
No ratings yet
Machine Learning Model Predicts Boston Housing Prices Using Linear Regression
Document7 pages
Machine Learning Model Predicts Boston Housing Prices Using Linear Regression
Kishan
No ratings yet
Exp 1
Document6 pages
Exp 1
Mr. S
No ratings yet
AI and ML Lab Manual
Document29 pages
AI and ML Lab Manual
Nithya Nair
No ratings yet
SVM Classifier in 40 Steps
Document11 pages
SVM Classifier in 40 Steps
nami
No ratings yet
Untitled
Document17 pages
Untitled
Edwar Benavente
No ratings yet
ID3 algorithm social network ads
Document8 pages
ID3 algorithm social network ads
Aman Bansal
No ratings yet
SVM Classification & Regression Models
Document3 pages
SVM Classification & Regression Models
4NM20IS003 ABHISHEK A
No ratings yet
Mlaifile1 3
Document27 pages
Mlaifile1 3
Krishna kumar
No ratings yet
CNN Implementation in Python
Document7 pages
CNN Implementation in Python
Muhammad Usman
No ratings yet
Codes
Document6 pages
Codes
Vamshi Krishna
No ratings yet
Confusion Matrix
Document6 pages
Confusion Matrix
amir
No ratings yet
Traffic Signs Recognition
Document12 pages
Traffic Signs Recognition
vijay kumar
No ratings yet
ID3 algorithm social network data
Document8 pages
ID3 algorithm social network data
Aman Bansal
No ratings yet
Reference Material
Document27 pages
Reference Material
Rahul Saini
No ratings yet
Logistic Regression For Malignancy Prediction in Cancer - by Luca Zammataro - Towards Data Science
Document32 pages
Logistic Regression For Malignancy Prediction in Cancer - by Luca Zammataro - Towards Data Science
Ghifari Raka
No ratings yet
Pattern
Document1 page
Pattern
ahmadkhalil
No ratings yet
Vertopal.com CV Assignment 2 Group02 (1)
Document12 pages
Vertopal.com CV Assignment 2 Group02 (1)
Manash Barman
No ratings yet
Tidaim 2
Document2 pages
Tidaim 2
neagaiuliancostin
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Targeted Percutaneous Microwave Ablation at The Pulmonary Lesion Combined With Mediastinal Radiotherapy With or Without Concurrent Chemotherapy in
Document8 pages
Targeted Percutaneous Microwave Ablation at The Pulmonary Lesion Combined With Mediastinal Radiotherapy With or Without Concurrent Chemotherapy in
Malekseuofi مالك السيوفي
No ratings yet
Review On Cancer and The Immune System
Document6 pages
Review On Cancer and The Immune System
Athenaeum Scientific Publishers
No ratings yet
CTStudy Questions
Document2 pages
CTStudy Questions
jaijai magbanua
No ratings yet
Prostate Specific Antigen (PSA) : Enzyme Immunoassay Test Kit Catalog Number: 10109
Document4 pages
Prostate Specific Antigen (PSA) : Enzyme Immunoassay Test Kit Catalog Number: 10109
yousrazeidan1979
No ratings yet
35 Multiple Myeloma Fact Sheet
Document2 pages
35 Multiple Myeloma Fact Sheet
Nutrisia Sayuti
No ratings yet
College of Nursing Silliman University Dumaguete City: Mrs. Corazon Ordonez, BSN-RN
Document23 pages
College of Nursing Silliman University Dumaguete City: Mrs. Corazon Ordonez, BSN-RN
Marodvi Zerna
No ratings yet
Cancer Basics
Document38 pages
Cancer Basics
Patrick Pengosro Mariano
No ratings yet
Hair Straightener Cancer Lawsuit Brands - Google
Document1 page
Hair Straightener Cancer Lawsuit Brands - Google
aradaw44
No ratings yet
Brain Tumor A Review of Its Demographic in A Rural Hospital of Sibu in Sarawak, Malaysia
Document4 pages
Brain Tumor A Review of Its Demographic in A Rural Hospital of Sibu in Sarawak, Malaysia
Harun NA
No ratings yet
Full Download Basic Allied Health Statistics and Analysis 4th Edition Koch Test Bank
Document35 pages
Full Download Basic Allied Health Statistics and Analysis 4th Edition Koch Test Bank
josephefwebb
100% (37)
Protocol SIOP 2001
Document171 pages
Protocol SIOP 2001
Omar Bendriss Alami
No ratings yet
Article Breast Ultrasound - Why and When by Dr. Hombal
Document2 pages
Article Breast Ultrasound - Why and When by Dr. Hombal
Domica Davis
No ratings yet
Cancer Research Assignment
Document5 pages
Cancer Research Assignment
api-212901753
No ratings yet
Mad-000550q - 171120 - 835 - Mad-000550 FT en 2020-09-21
Document2 pages
Mad-000550q - 171120 - 835 - Mad-000550 FT en 2020-09-21
Anca Neagu
No ratings yet
Breast Cancer Casting Type Calcifications. - Tabár
Document327 pages
Breast Cancer Casting Type Calcifications. - Tabár
Guillermo Rodríguez
No ratings yet
Mammography and Breast Localization For The Interventionalist
Document6 pages
Mammography and Breast Localization For The Interventionalist
rasminoj
No ratings yet
Kok 2020
Document32 pages
Kok 2020
Pilar Aufrasto
No ratings yet
An Efficient Gray-Level Co-Occurrence Matrix (GLCM) Based
Document4 pages
An Efficient Gray-Level Co-Occurrence Matrix (GLCM) Based
S M Rizvi
No ratings yet
Poster Paper Foo Kota Bengkulu New
Document1 page
Poster Paper Foo Kota Bengkulu New
Poppy Wulandari
No ratings yet
3202 7164 1 SM
Document21 pages
3202 7164 1 SM
Petrus Yendi Saputra
No ratings yet
Arleeluck C2395-Stu - Cancer Brochure Template
Document3 pages
Arleeluck C2395-Stu - Cancer Brochure Template
api-552283875
No ratings yet
Pediatric Blood Cancer - 2023 - Abstracts
Document600 pages
Pediatric Blood Cancer - 2023 - Abstracts
Alonso Gamir
No ratings yet
Cervical Cancer Risk Prediction Using Machine Learning
Document1 page
Cervical Cancer Risk Prediction Using Machine Learning
assad.ullah1006
No ratings yet
IMTX Patent
Document58 pages
IMTX Patent
Charles Gross
No ratings yet
Overview of Colon Polyps - UpToDate
Document44 pages
Overview of Colon Polyps - UpToDate
Hartemes Rosario
No ratings yet
Alimannao Hills, Peñablanca, Cagayan E-Mail Address: Adminoffice@isap - Edu.ph
Document3 pages
Alimannao Hills, Peñablanca, Cagayan E-Mail Address: Adminoffice@isap - Edu.ph
mark Orpilla
No ratings yet