You are on page 1of 4

MINI PROJECT

On
BREAST CANCER CLASSIFICATION

GOKHALE EDUCATION SOCIETY’S R. H. SAPAT COLLEGE


OF ENGINEERING, NASHIK

DEPARTMENT OF COMPUTER ENGINEERING

PRESENTED BY

Aishwarya Yogesh Chandwadkar (BE A 30)


Atharva Nitin Chandwadkar (BE A 31)
Kalyani Yogesh Desai (BE A 39)
Sarvesh Uday Joshi (BE A 71)
Aim :
BREAST CANCER PREDICTION USING DEEP LEARING

Problem Statement
The problem statement for breast cancer classification using deep learning is to develop a model
that can accurately classify breast lesions as malignant or benign using medical imaging data,
such as mammograms or ultrasound images. The model should be able to detect subtle patterns
and features in the images that are indicative of cancer and use them to make accurate
predictions.

Objective:
The goal is to create a highly accurate and reliable model that can aid radiologists in the
diagnosis of breast cancer, reducing the number of false positives and false negatives. This can
improve patient outcomes by allowing for earlier detection and more targeted treatment plans.

Software and Hardware Requirements:


Software Requirement:
i Operating system : windows/ Ubuntu 64 bit

ii Jupyter Notebook

Hardware Requirement :
i Computer system with Ram 4 GB or 8 GB

ii HDD : 500GB or 1TB

Theory :
1.1 Deep Learning
Deep learning is a subset of machine learning that uses artificial neural networks to learn
and make predictions from complex data. It is inspired by the structure and function of the
human brain, where neural networks are made up of interconnected layers of neurons that
work togetherto learn and solve problems.

1
Deep learning algorithms can be used for a wide range of applications, such as image and
speech recognition, natural language processing, and predictive modeling. They have shown
impressiveresults in many fields, including healthcare, finance, and autonomous vehicles.

1.2 Deep Neural Network

Deep neural networks (DNNs) are a type of artificial neural network that have multiple hidden
layers between the input and output layers. These layers allow for the network to learn more
complex representations of the data, enabling it to solve more complex problems.
DNNs can be used for a wide range of applications, including image and speech recognition,
natural language processing, and predictive modeling. The architecture of DNNs allows for
the automatic extraction of hierarchical features from the data, which can lead to better
performance compared to traditional machine learning models.
Training a DNN requires a large amount of labeled data and significant computational
resources. This is because the network needs to adjust the weights of all the neurons in all the
layers to minimize the error between its predicted output and the actual output. However, once
trained, DNNs can make highly accurate predictions on new, unseen data.

Methodology:
The Breast Cancer Wisconsin (Diagnostic) dataset is used in this project, which consists of
569 instances of tumors with 30 features for each instance. The data is preprocessed by
removing duplicates, handling missing values, and normalizing the data. Feature extraction is
performed to reduce the dimensionality of the dataset. A neural network with one hidden layer
is trained on the preprocessed data using the Keras deep learning library. The performance of
the neural network is evaluated using accuracy.

2
Results:

The results show that the neural network achieves an accuracy of 96% on the test set. The accuracy
of the neural network on the training set is 97%. These results indicate that the neural network can
effectively classify breast cancer tumors as either malignant or benign.

Conclusion:
In this project, we have demonstrated the use of neural networks in deep learning for breast cancer
classification. The results show that the neural network achieves high accuracy and can effectively
classify breast cancer tumors as either malignant or benign. The use of deep learning techniques
can potentially improve the accuracy and reliability of breast cancer diagnosis, and further research

is needed to validate the results on larger datasets.

You might also like