Professional Documents
Culture Documents
• Conclusion
• References
Support Vector Machine- Classification
• Support vector machines (SVMs) are supervised machine learning algorithms which are used
both for classification and regression.
• In 1960s, SVMs were first introduced but later they got refined in 1990.
• Support vector machine is another simple algorithm that every machine learning expert
should have used.
• SVM can be used for both regression and classification tasks. But, it is widely used in
classification objectives.
Cont.
• The main task of the SVM algorithm is to create the best line or
decision boundary that can segregate n-dimensional space into
classes so that we can easily put the new data point in the correct
category in the future. And this best decision boundary is called a
hyperplane.
Types of SVM
•Linear SVM: Linear SVM is used for linearly separable data, which means if a dataset
can be classified into two classes by using a single straight line, then such data is termed
as linearly separable data, and classifier is used called as Linear SVM classifier.
2. Extend the above definition for non-linearly separable problems: have a penalty
term for misclassifications.
3. Map data to high dimensional space where it is easier to classify with linear
decision surfaces: reformulate problem so that data is mapped implicitly to this
space.
Terminology used in SVM
Support Vectors
Hyperplane
• For a classification task with only two features (consider previous slide example), you can
think of a hyperplane as a line that linearly separates and classifies a set of data.
• The hyperplane our data points lie, the more confident we are that they have been correctly
classified.
• We therefore want our data points to be as far away from the hyperplane as possible, while
still being on the correct side of it.
• So when new testing data is added, whatever side of the hyperplane it lands will decide the
class that we assign to it.
Cont.
• In the SVM algorithm, we plot each data item as a point in n-dimensional space
(where n is number of features you have) with the value of each feature being
the value of a particular coordinate.
Speech
Recognition
Pros & Cons of Support Vector Machines
Pros:
•Accuracy
Cons:
•Isn’t suited to larger datasets as the training time with SVMs can be high
• SVM is used for text classification tasks such as category assignment, detecting
spam and sentiment analysis.
• SVM also plays a vital role in many areas of handwritten digit recognition, such
as postal automation services.
Active learning
Query?
References
Han J & Kamber M, “Data Mining: Concepts and Techniques”, Third Edition, Elsevier, 2011.
https://www.analyticsvidhya.com/blog/2017/09/understaing-support-vector-machine-example-code/
https://www.javatpoint.com/machine-learning-support-vector-machine-algorithm
https://www.kdnuggets.com/2016/07/support-vector-machines-simple-explanation.html
https://www.saedsayad.com/support_vector_machine.htm
https://towardsdatascience.com/https-medium-com-pupalerushikesh-svm-f4b42800e989
Next Class Topic
Performance Analysis