You are on page 1of 23

Institute Code: 0141

Title of Micro project: “Application of SVM Algorithm for Cancer Detection ”

Academic Year: 2023-2024 Program Code:


AN Course: Advanced Algorithms in AI & ML Course Code: 22683

Submitted by:

Roll No Student Name Batch Sign of Student

2609 Sarang Jagdale AN6I

Under the Guidance of:

Ms.K.G.Jagtap

1
Institute Code: 0141

CERTIFICATE

Certified that this micro project report titled “Application of SVM Algorithm for cancer
detection ” is the bonafide work of Sarang Jagdale, Roll no 2609 of year Diploma in Artificial
intelligence and machine learning for the Course: Advanced Algorithms in AI & ML Course
code: 22683 during the Academic Year 2023-2024 , who carried out the micro project work under
my supervision.

Under the Guidance of:

Ms.K.G.Jagtap

2
ACKNOWLEDGEMENT

We would like to express our special thanks of gratitude to our teachers, who gave us opportunity
to do this wonderful micro project on the topic “ Application of SVM Algorithm for cancer
detection ” which also helped us in doing a lot of Research and we came to know about so many
new things we all really thankful to all who help us doing this micro project.
Secondly we would also like to thank our parents and friends who helped us a lot in
finalizing this project within the limited time frame.

Name Signature
Sarang Jagdale

3
ALL INDIA SHRI SHIVAJI MEMORIAL SOCIETY’S POLYTECHNIC, PUNE -1

VISION AND MISSION OF THE INSTITUTE

VISION AND MISSION OF THE AI & ML DEPARTMENT

4
ALL INDIA SHRI SHIVAJI MEMORIAL SOCIETY’S POLYTECHNIC, PUNE -1

Artificial intelligence and machine learning Department

PROGRAM OUTCOMES (POs)

PO1 Basic and Discipline specific knowledge: Apply knowledge of basic


mathematics, science and engineering fundamentals and engineering specialization
to solve the engineering problems.

PO2 Problem analysis: Identify and analyze well-defined engineering problems using
codified standard methods.

PO3 Design/ development of solutions: Design solutions for well-defined technical


problems and assist with the design of systems components or processes to meet
specified needs.

PO4 Engineering Tools, Experimentation and Testing: Apply modern engineering


tools and appropriate technique to conduct standard tests and measurements.

PO5 Engineering practices for society, sustainability and environment: Apply


appropriate technology in context of society, sustainability, environment and
ethical practices.

PO6 Project Management: Use engineering management principles individually, as a


team member or a leader to manage projects and effectively communicate about
well-defined engineering activities.

PO7 Life-long learning: Ability to Analyze individual needs and engage in updating in
the context of technological changes.

PROGRAM SPECIFIC OUTCOMES (PSO)

Students will be able to:

PSO1: Apply computing knowledge with standard practices to develop software.


PSO2: Maintain Computer Hardware and Software System.

5
INDEX

Sr. No. Content Page No.

1. Title 1

2. Certificate 2

3. Acknowledgement 3

4. Annexure I 7

5. Annexure II 10

6. Annexure III 17

7. Annexure IV 19

8. Log Book 20

9. Rubrics Used for Evaluation 21

10. Evaluation Sheet 22

6
Annexure-I

Micro-Project Proposal
Title of the Project: “ Application of SVM Algorithm for cancer detection ”

Aims/Benefits of the Micro-Project: To make Application of SVM Algorithm for cancer


detection .

Benefits :

Overall, SVM is a powerful tool for cancer detection due to its ability to handle high-dimensional
data, robustness to overfitting, effectiveness in handling non-linear relationships, optimal margin
properties, implicit feature selection, and well-established theoretical foundations. However, like any
machine learning algorithm, its performance depends on the quality of the data, feature selection, and
appropriate parameter tuning.

1.0 Course Outcomes Addressed: C22683

1. Analyze the data using different learning methods.


2. Use different classification techniques.

2.0 Proposed Methodology:

1. Arrangement of groups and representatives for groups that are not usually represented
aspartners in main projects.
2. Capacity building and networking in relation to the role as partners in micro projects.
3. Collected materials related to project.
4. Support development of more need and user driven projects.
5. Contribute to the maximum requirements of project.
6. An eligible project idea addressing one of the four Priority Axes and a work plan for a
micro project including a description of how the capacity building and networking should
take place.
7. The project involves maximum three partners. From three partners, the contributions of
micro project are distributed.
8. An eligible Lead member who will guide the group members and analyzed the data.
9. Eligible match finding the proper information.
10. Softcopy corrections by respective teachers.
11. Completion of the micro project properly.
12. Final copy and submission.

7
4.0 Action Plan

Sr. Details of Activity Planned Planned Name of Responsible


No Start date Finish date Team Members
.
1. Introduction to Micro-project: Study for 07/08/23 11/08/23 Sarang Jagdale
selecting Micro project topic

2. Introduction to Micro-project: 11/08/23 14/08/23 Sarang Jagdale


Discussion about selected Micro project
topic with concerned Course Teacher

3. Introduction to Micro-project: Finalize and 14/08/23 21/08/23 Sarang Jagdale


Study for selected topic

4. Drafting Proposals 21/08/23 25/08/23 Sarang Jagdale

5. Proposal submission 25/08/23 29/08/23 Sarang Jagdale

6. Micro project Proposal Presentation 29/08/23 01/09/23 Sarang Jagdale

7. Making Changes in presentation, if suggested 01/09/23 05/09/23 Sarang Jagdale


by concerned teacher

8 Executing Micro-Project: Study from 05/09/23 11/09/23 Sarang Jagdale


different resources

9. Executing Micro-Project: Collect 11/09/23 15/09/23 Sarang Jagdale


information from studied resources

10. Executing Micro-Project: Arrange 15/09/23 20/09/23 Sarang Jagdale


collected information

11. Executing Micro project 20/09/23 22/09/23 Sarang Jagdale

12. Drafting Methodology 22/09/23 26/09/23 Sarang Jagdale

13. Drafting Literature Review 26/09/23 29/09/23 Sarang Jagdale

14. Drafting Result, Discussion 29/09/23 03/10/23 Sarang Jagdale

15. Micro project Presentation 03/10/23 13/10/23 Sarang Jagdale

16. Micro Project final submission 13/10/23 13/10/23 Sarang Jagdale

8
Resources Required

Sr. Name of Specifications Qty Remarks


No. Resources/material .

1. Computer System Computer(i3-i7)preferable RAM minimum 2GB 1


and onwards
2. Operating System Windows-11 1
3. Printer Hp printer 1

Names of Team Members with Roll No. :

Roll No Student Name

2609 Sarang Jagdale

9
Annexure-II

Micro-Project Report

Title of the Project: “ Application of SVM Algorithm for cancer detection ”

Rationale:

Overall, SVM is a powerful algorithm for classification tasks, especially when the data
is high-dimensional or when there's a need for a clear separation between classes.
However, SVM may not perform well on very large datasets due to its computational
complexity, and parameter tuning can be crucial for achieving optimal performance.

Aims/Benefits of the Micro-Project: To make Application of SVM Algorithm for cancer


detection .

Benefits:
Overall, SVM is a powerful tool for cancer detection due to its ability to handle high-dimensional data,
robustness to overfitting, effectiveness in handling non-linear relationships, optimal margin properties,
implicit feature selection, and well-established theoretical foundations. However, like any machine
learning algorithm, its performance depends on the quality of the data, feature selection, and
appropriate parameter tuning.

3.0 Course Outcomes Addressed: C22683

1. Analyze the data using different learning methods.


2. Use different classification and regression techniques.

Literature Review:

1. https://www.geeksforgeeks.org/support-vector-machine-algorithm/
2. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5822181/

1
ABSTRACT

Support Vector Machine (SVM) is a powerful machine learning algorithm used for linear or nonlinear
classification, regression, and even outlier detection tasks. SVMs can be used for a variety of tasks, such as
text classification, image classification, spam detection, handwriting identification, gene expression analysis,
face detection, and anomaly detection. SVMs are adaptable and efficient in a variety of applications because
they can manage high-dimensional data and nonlinear relationships.

SVM algorithms are very effective as we try to find the maximum separating hyperplane between the
different classes available in the target feature.

Support Vector Machine (SVM) is a supervised machine learning algorithm used for both classification and
regression. Though we say regression problems as well it’s best suited for classification. The main objective
of the SVM algorithm is to find the optimal hyperplane in an N-dimensional space that can separate the data
points in different classes in the feature space.

1
Applications


Gene Expression Classification:
SVM can be applied to gene expression data for classifying different types of cancer. By analyzing the
expression levels of thousands of genes, SVM can help identify patterns that distinguish between
cancerous and non-cancerous tissues.


Protein Sequence Analysis:
SVM can be used to analyze protein sequences and predict whether certain mutations or variations are
associated with cancer. This is particularly relevant in understanding the role of specific proteins in cancer
development.


Imaging Data Analysis:
SVM is applied to medical imaging data, such as mammograms, CT scans, or MRI images, to assist in the
early detection and classification of tumors. SVM can help in distinguishing between benign and malignant
tumors based on the features extracted from the images.


Classification of Cancer Subtypes:
SVM can be employed to classify different subtypes of cancers. For instance, it can distinguish between
different types of breast cancer, lung cancer, or leukemia based on various molecular markers or clinical
features.


Identification of Biomarkers:
SVM can be used to identify potential biomarkers associated with cancer. By analyzing high-dimensional
biological data, SVM can identify relevant features that contribute to the classification of cancer samples, a
iding in the discovery of new diagnostic markers.


Drug Response Prediction:
SVM can be utilized to predict the response of cancer patients to specific treatments. By analyzing
molecular and clinical data, SVM models can help identify patients who are likely to respond positively or
negatively to certain therapeutic interventions.


Early Detection of Cancer:
SVM models can be trained on diverse datasets to detect cancer at early stages. Early detection is crucial
for improving treatment outcomes, and SVM's ability to handle complex relationships in the data makes it
valuable in this context.

1
2.0 Actual Methodology Followed:

Sr. No./ Date Work Done


Hour No.
1. 11/08/23 Finalize the Topic
2. 14/08/23 Distribution of Work
3. 21/08/23 Distribution of Topic
4. 25/08/23 Collecting Images
5. 29/08/23 Starting animation
6. 01/09/23 Completing animation
7. 05/09/23 Creating a Word Document
8. 11/09/23 Inserting information
9. 15/09/23 Arranged the Information
10. 20/09/23 Proofread the Information
11. 22/09/23 Editing the Word Document
12. 26/09/23 Review from the Teacher
13. 29/09/23 Editing the Project Report as per Teacher’s suggestion
14. 03/10/23 Proofread and Finalize the Report
15. 13/10/23 Finalize the report
16. 13/10/23 Final submission of the Report

1
INTRODUCTION

3.0 Actual resources Used

Sr. Name of Specifications Qty. Remarks


No. Resources/material

1. Computer System Computer(i3-i7)preferable RAM 1


minimum 2GB and onwards
2. Operating System Windows (7-11) 1
3. Printer Hp printer 1

4.0 Code of the Micro-Project :


import numpy as np
import matplotlib.pyplot as plt
from sklearn import datasets
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler
from sklearn.svm import SVC
from sklearn.metrics import accuracy_score, classification_report, confusion_matrix

cancer_data = datasets.load_breast_cancer()
X = cancer_data.data
y = cancer_data.target

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

scaler = StandardScaler()
X_train_scaled = scaler.fit_transform(X_train)
X_test_scaled = scaler.transform(X_test)

svm_classifier = SVC(kernel='linear', C=1)


svm_classifier.fit(X_train_scaled, y_train)

y_pred = svm_classifier.predict(X_test_scaled)

accuracy = accuracy_score(y_test, y_pred)


conf_matrix = confusion_matrix(y_test, y_pred)
classification_rep = classification_report(y_test, y_pred)
print(f"Accuracy: {accuracy:.2f}")
print("\nConfusion Matrix:")
print(conf_matrix)
print("\nClassification Report:")
print(classification_rep)

1
Output of the Micro-Project:

1
Skill Developed/Learning outcome of this Micro-Project

1. Demonstrate working as a leader/team member.

2. Communication really go hand in glove with leadership.

3. How to create a project schedule.

4. How to manage time.

5. Decision making skill.

6. Problem solving

7. Planning and strategy skill


.

Applications of this Micro-Project:

 Early Detection of Cancer


 Gene Expression Classification:
 Protein Sequence Analysis
 Imaging Data Analysis

1
Annexure - III

Suggested Rubric for Assessment of Micro Project

(The marks may be allotted to the characteristics of the Micro-Project by considering the
suggested rubrics)

S. Characteristics to Poor Average Good Excellent


No be assessed (Marks 1-3) (Marks 4-5) (Marks 6-8) (Marks 9-10)
1. Relevance to the Related to very Related to some Addressed Addressed more
Course few LOs Los at-least one CO than one CO
2. Literature Not more than At-least 5 relevant At-least 7 About 10
Review/informatio two sources sources, at least 2 relevant relevant sources,
n collection (Primary and latest sources, most most latest
Secondary), very latest
old reference
3. Completion of Completed less Completed 50 to Completed 60 to Completed more
Target as per than 50% 60% 80% than 80%
Project
proposal
4. Analysis of Data Sample Size all, Sufficient and Sufficient and Enough data
and representation data neither appropriate sample, appropriate collected by
organized nor enough data sample, enough sufficient and
presented well generated but not data generated appropriate
organized and not which is sample size.
well presented organized and Proper
well. No or poor presented well. inferences drawn
inferences drawn But poor by organizing
inferences and presenting
drawn data through
tables, charts and
graphs
5. Quality of Incomplete Just assembled/ Well a Just Well a Just
prototype/Model fabrication/ fabricated and parts assembled/ assembled/
assembly are not functioning fabricated with fabricated with
well. Not in proper proper proper
shape, dimensions functioning functioning
beyond tolerance parts. In proper parts. in proper
limit. Appearance/ shape, within shape, within
finish is shabby. tolerance tolerance
dimensions and dimensions and
good finish. But good finish/
no creativity in appearance.
design and use Creativity in
of material design and use
of material.

1
6. Report Very short, poor Nearly sufficient Detailed, correct Very detailed,
Preparation quality sketches, and correct details and clear
Details about about methods, description of correct, clear
methods, materials, methods, description of
materials, precautions and materials, methods,
Precautions and conclusion. But precautions and materials,
Conclusions clarity is not there conclusion. precautions and
omitted, some in presentation. Sufficient conclusion.
details are But not enough graphic Enough tables,
wrong. graphic description description charts and
sketches

7. Presentation of Major Includes major Includes major Well organized,


the Micro-Project information is information but not information but includes major
not included; well organized not not well information,
information is presented well. organized not presented well.
not well presented well.
organized.

8. Viva Could not reply Replied to Replied Replied most of


to considerable considerable properly the questions
number of number of considerable properly
question questions nut not number of
very properly question.

1
Annexure IV
Micro Project Evaluation
Sheet
Name of Student: Sarang Jagdale Enrollment No: 2101410027
Name of Program: AIML Semester: 6th semester
Course Title: Advanced Algorithms in AI & ML Code: 22683
Title of the Micro-project: “Application of SVM Algorithm for cancer detection

3.0 Course Outcomes Achieved : C22683
C22683.a. Immplement processing steps on data to make it ready for analysis
C22683.c. Describe Support Vector Machines and K Nearest Neighbors.

Sr. Characteristic to be Poor Average Good Excellent Sub


No. accessed (Marks (Marks (Marks 6-8) (Marks 9-10) Total
1-3) 4-5)
Process and Product Assessment (Convert above total marks out of 6 Marks)
1 Relevance to the
course
2 Literature
Review/information
collection
3 Completion of the
Target as per
project proposal
4 Analysis of Data and
representation
5 Quality of the
Prototype/Model
6 Report Preparation
Individual Presentation/ Viva (Convert above total marks out of 4 Marks)
7 Presentation
8 Viva
(A) (B) Total Marks
Process and Product Assessment Individual Presentation/ Viva 10
(6 Marks) (4 Marks)

Comments/ suggestions about Team work/ Leadership/Inter-Personal communication (If any)


Name and Designation of the Teacher: …………………………………….
1
Dated Signature………………………………………………………………

2
Annexure IV

Log Book of the Student (Hourly Work Report)


Academic Year: 2023-2024

Title of the Project: “Application of SVM Algorithm for cancer detection ”


Course: Advanced Algorithms in AI & ML Course Code: 22683
Semester: 6th
Sr. No./ Date Time Work Done
Hour No.
1. 11/08/23 05-06 PM Study for selecting Micro project topic

2. 14/08/23 05-06 PM Discussion about selected Micro project topic with


concerned Course Teacher
3. 21/08/23 05-06 PM Finalize and Study for selected topic

4. 25/08/23 05-06 PM Drafting Proposals

5. 29/08/23 05-06 PM Proposal submission

6. 01/09/23 05-06 PM Micro project Proposal Presentation

7. 05/09/23 05-06 PM Making Changes in presentation, if suggested by


concerned teacher
8. 11/09/23 05-06 PM Study from different resources

9. 15/09/23 05-06 PM Collect information from studied resources

10. 20/09/23 05-06 PM Arrange collected information

11. 22/09/23 05-06 PM Executing Micro project

12. 26/09/23 05-06 PM Drafting Methodology

13. 29/09/23 05-06 PM Drafting Literature Review

14. 03/10/23 05-06 PM Drafting Result, Discusser

15. 13/10/23 01-03 PM Micro project Presentation

16. 13/10/23 01-03 PM Micro Project final submission

Under the Guidance of:

Ms.K.G.Jagtap

2
Annexure IV
Rubrics Used for Evaluation of a Micro Project

Program/Semester /Master: AN6I


Course/Course code: Advanced
Algorithms in AI & ML
Course Code: 22683
Title of the Micro project: “Application of SVM Algorithm for cancer detection ”
Assessment of micro project based on rubrics for performance in group activity :( Marks to
be given out of 06)
Assessment of performance in individual presentation/Viva of micro project: (Marks to be
given out of 04)
Scale used for assessment: Poor (1-3), Average (4-5), Good (6-8), Excellent (9-10)
A) Process and Product Assessment (A):

Rubric No. Characteristics to be assessed Marks Obtained out of 10


1 Relevance to course
2 Literature review/information collection
3 Completion of target as per project proposal
4 Analysis of data and representation
5 Quality of prototype/model
6 Report Preparation
Total Out of (60)
Process and Product Assessment (A) :Total Out of (06)

B) Individual Presentation/Viva(B)

Rubric Rubric Individual


Individual
7 8 Presentati
Presentatio
on/Viva
n/Viva
Individ (Convert Total
Individ (Addition
ual out of 08 (A+B)
RollNo. EnrollmentNo. ual ofmarks in
Name of Student Present marks
Viva Rubric 7 to
ation intoout of
8)
4)
(B)
Marks Marks Mark
Marks Marks
out of out of s out
10 10 outof 20 outof 04 of 10
2609 2101410027 Sarang Jagdale

Under the Guidance of

Ms.K.G.Jagtap

2
Evaluation Sheet for the Micro Project

Academic Year: 2023-2024 Name of Faculty Ms.K.G.Jagtap


Course: Advanced Algorithms in AI & ML Course Code: 22683
th
Semester: 6
Title of the Project: “Application of SVM Algorithm for cancer detection ”

CO’s addressed by the Micro Project: C22683


C22683.a. Immplement processing steps on data to make it ready for analysis
C22683.c. Describe Support Vector Machines and K Nearest Neighbors.

Major Learning Outcomes achieved by students by doing the project

Outcomes in Affective Domain :-


1) Follow safety.
2) Demonstrate working as a leader/team member
3) Follow ethical practices.

Comments/Suggestions about team work/leadership/inter-personal communication (if any)

Roll No. Student Name Marks out of (6) Marks out of (4) Total out
for performance for performance in of (10)
in group activity oral / presentation
2609 Sarang Jagdale

(Dated Signature of Faculty)

You might also like