0% found this document useful (0 votes)

31 views8 pages

Support Vector Machine (SVM) Algorithm

Support Vector Machine Notes For Machine Learning

Uploaded by

Disha Sen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views8 pages

Support Vector Machine (SVM) Algorithm

Support Vector Machine Notes For Machine Learning

Uploaded by

Disha Sen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

AI ML DS Data Science Data Analysis Data Visualization Machine Learning Deep Learning NLP Computer

Support Vector Machine (SVM) Algorithm

Last Updated : 04 Jul, 2024
Support Vector Machine (SVM) is a powerful machine learning algorithm
used for linear or nonlinear classification, regression, and even outlier
detection tasks. SVMs can be used for a variety of tasks, such as text
classification, image classification, spam detection, handwriting identification,
gene expression analysis, face detection, and anomaly detection. SVMs are
adaptable and efficient in a variety of applications because they can manage
high-dimensional data and nonlinear relationships.

SVM algorithms are very effective as we try to find the maximum separating
hyperplane between the different classes available in the target feature.

Support Vector Machine

Support Vector Machine (SVM) is a supervised machine learning algorithm
used for both classification and regression. Though we say regression
problems as well it’s best suited for classification. The main objective of the
SVM algorithm is to find the optimal hyperplane in an N-dimensional space
that can separate the data points in different classes in the feature space.
The hyperplane tries that the margin between the closest points of different
classes should be as maximum as possible. The dimension of the hyperplane
depends upon the number of features. If the number of input features is two,
then the hyperplane is just a line. If the number of input features is three,
then the hyperplane becomes a 2-D plane. It becomes difficult to imagine
when the number of features exceeds three.

Let’s consider two independent variables x1, x2, and one dependent variable
which is either a blue circle or a red circle.
Linearly Separable Data points

From the figure above it’s very clear that there are multiple lines (our
hyperplane here is a line because we are considering only two input features
x1, x2) that segregate our data points or do a classification between red and
blue circles. So how do we choose the best line or in general the best
hyperplane that segregates our data points?

How does SVM work?

One reasonable choice as the best hyperplane is the one that represents the
largest separation or margin between the two classes.

Multiple hyperplanes separate the data from two classes

So we choose the hyperplane whose distance from it to the nearest data

point on each side is maximized. If such a hyperplane exists it is known as the
maximum-margin hyperplane/hard margin. So from the above figure, we
choose L2. Let’s consider a scenario like shown below
Selecting hyperplane for data with outlier

Here we have one blue ball in the boundary of the red ball. So how does
SVM classify the data? It’s simple! The blue ball in the boundary of red ones
is an outlier of blue balls. The SVM algorithm has the characteristics to
ignore the outlier and finds the best hyperplane that maximizes the margin.
SVM is robust to outliers.

Hyperplane which is the most optimized one

So in this type of data point what SVM does is, finds the maximum margin as
done with previous data sets along with that it adds a penalty each time a
point crosses the margin. So the margins in these types of cases are called
soft margins. When there is a soft margin to the data set, the SVM tries to
minimize (1/margin+∧(∑penalty)). Hinge loss is a commonly used penalty. If
no violations no hinge loss.If violations hinge loss proportional to the
distance of violation.

Till now, we were talking about linearly separable data(the group of blue
balls and red balls are separable by a straight line/linear line). What to do if
data are not linearly separable?

Original 1D dataset for classification

Say, our data is shown in the figure above. SVM solves this by creating a new
variable using a kernel. We call a point xi on the line and we create a new
variable yi as a function of distance from origin o.so if we plot this we get
something like as shown below

Mapping 1D data to 2D to become able to separate the two classes

In this case, the new variable y is created as a function of distance from the
origin. A non-linear function that creates a new variable is referred to as a
kernel.

Support Vector Machine Terminology

1. Hyperplane: Hyperplane is the decision boundary that is used to separate

the data points of different classes in a feature space. In the case of linear
classifications, it will be a linear equation i.e. wx+b = 0.
2. Support Vectors: Support vectors are the closest data points to the
hyperplane, which makes a critical role in deciding the hyperplane and
margin.
3. Margin: Margin is the distance between the support vector and
hyperplane. The main objective of the support vector machine algorithm is
to maximize the margin. The wider margin indicates better classification
performance.
4. Kernel: Kernel is the mathematical function, which is used in SVM to map
the original input data points into high-dimensional feature spaces, so,
that the hyperplane can be easily found out even if the data points are not
linearly separable in the original input space. Some of the common kernel
functions are linear, polynomial, radial basis function(RBF), and sigmoid.
5. Hard Margin: The maximum-margin hyperplane or the hard margin
hyperplane is a hyperplane that properly separates the data points of
different categories without any misclassifications.
6. Soft Margin: When the data is not perfectly separable or contains
outliers, SVM permits a soft margin technique. Each data point has a slack
variable introduced by the soft-margin SVM formulation, which softens
the strict margin requirement and permits certain misclassifications or
violations. It discovers a compromise between increasing the margin and
reducing violations.
7. C: Margin maximisation and misclassification fines are balanced by the
regularisation parameter C in SVM. The penalty for going over the margin
or misclassifying data items is decided by it. A stricter penalty is imposed
with a greater value of C, which results in a smaller margin and perhaps
fewer misclassifications.
8. Hinge Loss: A typical loss function in SVMs is hinge loss. It punishes
incorrect classifications or margin violations. The objective function in
SVM is frequently formed by combining it with the regularisation term.
9. Dual Problem: A dual Problem of the optimisation problem that requires
locating the Lagrange multipliers related to the support vectors can be
used to solve SVM. The dual formulation enables the use of kernel tricks
and more effective computing.

Mathematical intuition of Support Vector Machine

Consider a binary classification problem with two classes, labeled as +1 and

-1. We have a training dataset consisting of input feature vectors X and their
corresponding class labels Y.

The equation for the linear hyperplane can be written as:

wT x + b = 0
The vector W represents the normal vector to the hyperplane. i.e the
direction perpendicular to the hyperplane. The parameter b in the equation
represents the offset or distance of the hyperplane from the origin along the
normal vector w.

The distance between a data point x_i and the decision boundary can be
calculated as:
w T xi +b
di =

∣∣w∣∣

where ||w|| represents the Euclidean norm of the weight vector w. Euclidean
norm of the normal vector W

For Linear SVM classifier :

1 : wT x + b ≥ 0
y^ = {
0 : wT x + b < 0

Optimization:
For Hard margin linear SVM classifier:

minimize 12 w T w = minimize 12 ∥w ∥2

w,b W ,b

subject to yi (w T xi + b) ≥ 1 f or i = 1, 2, 3, ⋯ , m

The target variable or label for the ith training instance is denoted by the
symbol ti in this statement. And ti=-1 for negative occurrences (when yi= 0)
and ti=1positive instances (when yi = 1) respectively. Because we require the
decision boundary that satisfy the constraint: ti (w T xi + b) ≥ 1

For Soft margin linear SVM classifier:

minimize 12 w T w + C ∑m
i=1 ζi

w,b

subject to yi (w T xi + b) ≥ 1 − ζi and ζi ≥ 0 f or i = 1, 2, 3, ⋯ , m

Dual Problem: A dual Problem of the optimisation problem that requires

locating the Lagrange multipliers related to the support vectors can be
used to solve SVM. The optimal Lagrange multipliers α(i) that maximize
the following dual objective function

1
maximize
α
:

2

∑ ∑ αi αj ti tj K(xi , xj ) − ∑ αi

i→m j→m i→m

where,

αi is the Lagrange multiplier associated with the ith training sample.

K(xi, xj) is the kernel function that computes the similarity between two
samples xi and xj. It allows SVM to handle nonlinear classification
problems by implicitly mapping the samples into a higher-dimensional
feature space.
The term ∑αi represents the sum of all Lagrange multipliers.

The SVM decision boundary can be described in terms of these optimal

Lagrange multipliers and the support vectors once the dual issue has been
solved and the optimal Lagrange multipliers have been discovered. The
training samples that have i > 0 are the support vectors, while the decision
boundary is supplied by:

w = ∑ αi ti K (xi , x) + b

i→m
ti (w T xi − b) = 1 ⟺ b = w T xi − ti

Types of Support Vector Machine

Based on the nature of the decision boundary, Support Vector Machines

(SVM) can be divided into two main parts:

Linear SVM: Linear SVMs use a linear decision boundary to separate the
data points of different classes. When the data can be precisely linearly
separated, linear SVMs are very suitable. This means that a single straight
line (in 2D) or a hyperplane (in higher dimensions) can entirely divide the
data points into their respective classes. A hyperplane that maximizes the
margin between the classes is the decision boundary.
Non-Linear SVM: Non-Linear SVM can be used to classify data when it
cannot be separated into two classes by a straight line (in the case of 2D).
By using kernel functions, nonlinear SVMs can handle nonlinearly
separable data. The original input data is transformed by these kernel
functions into a higher-dimensional feature space, where the data points
can be linearly separated. A linear SVM is used to locate a nonlinear
decision boundary in this modified space.

Popular kernel functions in SVM

The SVM kernel is a function that takes low-dimensional input space and
transforms it into higher-dimensional space, ie it converts nonseparable
problems to separable problems. It is mostly useful in non-linear separation
problems. Simply put the kernel, does some extremely complex data
transformations and then finds out the process to separate the data based on
the labels or outputs defined.
Linear : K(w, b) = w T x + b
Polynomial : K(w, x) = (γw T x + b)N
Gaussian RBF: K(w, x) = exp(−γ∣∣xi − xj ∣∣n

Sigmoid :K(xi , xj ) = tanh(αxTi xj + b)

Advantages of SVM

Effective in high-dimensional cases.

Its memory is efficient as it uses a subset of training points in the decision
function called support vectors.
Different kernel functions can be specified for the decision functions and
its possible to specify custom kernels.

SVM implementation in Python

Predict if cancer is Benign or malignant. Using historical data about patients

diagnosed with cancer enables doctors to differentiate malignant cases and
benign ones are given independent attributes.
Steps
Load the breast cancer dataset from sklearn.datasets
Separate input features and target variables.
Build and train the SVM classifiers using RBF kernel.
Plot the scatter plot of the input features.
Plot the decision boundary.
Plot the decision boundary

Python

# Load the important packages from sklearn.datasets import

load_breast_cancer import matplotlib.pyplot as plt from
sklearn.inspection import DecisionBoundaryDisplay from sklearn.svm
import SVC # Load the datasets cancer = load_breast_cancer() X =
cancer.data[:, :2] y = cancer.target #Build the model svm =
SVC(kernel="rbf", gamma=0.5, C=1.0) # Trained the model svm.fit(X, y)
# Plot Decision Boundary DecisionBoundaryDisplay.from_estimator( svm,
X, response_method="predict", cmap=plt.cm.Spectral, alpha=0.8,
xlabel=cancer.feature_names[0], ylabel=cancer.feature_names[1], ) #
Scatter plot plt.scatter(X[:, 0], X[:, 1], c=y, s=20, edgecolors="k")
plt.show()

Output:

Support Vector Machine (SVM) Algorithm - GeeksforGeeks
No ratings yet
Support Vector Machine (SVM) Algorithm - GeeksforGeeks
20 pages
Module 3 ML 24
No ratings yet
Module 3 ML 24
65 pages
Unit 2 PPT - Part 2
100% (1)
Unit 2 PPT - Part 2
81 pages
Support Vector Machine (SVM) Algorithm
No ratings yet
Support Vector Machine (SVM) Algorithm
10 pages
Machine Learning: Shoaib Farooq
No ratings yet
Machine Learning: Shoaib Farooq
17 pages
MLQB2
No ratings yet
MLQB2
11 pages
Deep Learn
No ratings yet
Deep Learn
48 pages
Ankita
No ratings yet
Ankita
10 pages
16 SVM
No ratings yet
16 SVM
41 pages
DeepSeek Presentation
No ratings yet
DeepSeek Presentation
76 pages
S V M (SVM) : Upport Ector Achine
No ratings yet
S V M (SVM) : Upport Ector Achine
67 pages
Support Vector Machine: Abinas Panda
No ratings yet
Support Vector Machine: Abinas Panda
52 pages
L5 SVMs
No ratings yet
L5 SVMs
37 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
103 pages
Chapter 07
No ratings yet
Chapter 07
18 pages
Machine Learning (CSO851) - Lecture 05
No ratings yet
Machine Learning (CSO851) - Lecture 05
27 pages
1 SVM Lecture Material Main Notes
No ratings yet
1 SVM Lecture Material Main Notes
19 pages
Unit II 2.2 ML Kernel Machines SVM
No ratings yet
Unit II 2.2 ML Kernel Machines SVM
50 pages
Support Vector Machine (SVM) Algorithm
No ratings yet
Support Vector Machine (SVM) Algorithm
9 pages
Support Vactor Machine Final
No ratings yet
Support Vactor Machine Final
11 pages
Supervised Alg
No ratings yet
Supervised Alg
27 pages
ML Support Vector Machines 2
No ratings yet
ML Support Vector Machines 2
22 pages
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Ch-7support Vecbot Mochines El Keinal Based Meihods Regression and
No ratings yet
Ch-7support Vecbot Mochines El Keinal Based Meihods Regression and
6 pages
Chapter 07 SVM
No ratings yet
Chapter 07 SVM
20 pages
Session Svmclassification
No ratings yet
Session Svmclassification
28 pages
Unit 2 - SVM - 241016 - 104220
No ratings yet
Unit 2 - SVM - 241016 - 104220
47 pages
Data Mining Techniques
No ratings yet
Data Mining Techniques
27 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
28 pages
Support Vector Machine: Prof. Subodh Kumar Mohanty
No ratings yet
Support Vector Machine: Prof. Subodh Kumar Mohanty
52 pages
SVM - Feb 15
No ratings yet
SVM - Feb 15
34 pages
Ann Unit III
No ratings yet
Ann Unit III
20 pages
Support Vector Machine
No ratings yet
Support Vector Machine
19 pages
Unit2 Notes What Is A Support Vector Machine
No ratings yet
Unit2 Notes What Is A Support Vector Machine
11 pages
Machine Learning Unit-3.3
No ratings yet
Machine Learning Unit-3.3
38 pages
Support Vector Machine (SVM) Terminology Hyperplane WX + B 0 Support Vectors Margin Kernel Hard Margin Soft Margin
No ratings yet
Support Vector Machine (SVM) Terminology Hyperplane WX + B 0 Support Vectors Margin Kernel Hard Margin Soft Margin
6 pages
SML Unit 4
No ratings yet
SML Unit 4
61 pages
Unit - 2
No ratings yet
Unit - 2
15 pages
Support Vector Machine
No ratings yet
Support Vector Machine
52 pages
Support Vector Machine
No ratings yet
Support Vector Machine
17 pages
13.1 Support Vector Machine
No ratings yet
13.1 Support Vector Machine
28 pages
SVM
No ratings yet
SVM
11 pages
Ain3001 - 04 - Support - Vector.machines
No ratings yet
Ain3001 - 04 - Support - Vector.machines
50 pages
Project
No ratings yet
Project
21 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
SVM
No ratings yet
SVM
12 pages
Gauss Nodes Revolution: Numerical Integration Theory Radically Simplified And Generalised
From Everand
Gauss Nodes Revolution: Numerical Integration Theory Radically Simplified And Generalised
Rob Porter
No ratings yet
Unit 2
No ratings yet
Unit 2
47 pages
W12 SVM
No ratings yet
W12 SVM
52 pages
Databricks Machine Learning Professional Practice Questions
No ratings yet
Databricks Machine Learning Professional Practice Questions
4 pages
Support Vector Machines
No ratings yet
Support Vector Machines
16 pages
Smart Energy Monitoring System - Project Final Report
No ratings yet
Smart Energy Monitoring System - Project Final Report
11 pages
Neural Networks and Artificial Intelligence For Biomedical Engineering Ieee Press Series On Biomedical Engineering PDF
No ratings yet
Neural Networks and Artificial Intelligence For Biomedical Engineering Ieee Press Series On Biomedical Engineering PDF
314 pages
Support Vector Machine Algorithm
No ratings yet
Support Vector Machine Algorithm
8 pages
Unit - 2-1
No ratings yet
Unit - 2-1
7 pages
A Introduction To SVM PDF
No ratings yet
A Introduction To SVM PDF
48 pages
SVM Notes Unit 4
No ratings yet
SVM Notes Unit 4
8 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
Support Vector Machine
No ratings yet
Support Vector Machine
31 pages
A Multi-Heuristic Algorithm For Multi-Container 3-D Bin Packing Problem Optimization Using Real World Constraints
No ratings yet
A Multi-Heuristic Algorithm For Multi-Container 3-D Bin Packing Problem Optimization Using Real World Constraints
26 pages
SVM, KNN, Tree NBC
No ratings yet
SVM, KNN, Tree NBC
22 pages
1 IntroductionDL
No ratings yet
1 IntroductionDL
69 pages
Support Vector Machines: (Vapnik, 1979)
No ratings yet
Support Vector Machines: (Vapnik, 1979)
34 pages
Chap 6 - Deep FeedForward Networks - Eunjeong Yi
No ratings yet
Chap 6 - Deep FeedForward Networks - Eunjeong Yi
21 pages
10 SVM
No ratings yet
10 SVM
23 pages
Application of Machine Learning and Deep Learning in Finite Element Analysis: A Comprehensive Review
No ratings yet
Application of Machine Learning and Deep Learning in Finite Element Analysis: A Comprehensive Review
40 pages
SVM Scribe Notes
No ratings yet
SVM Scribe Notes
16 pages
Feature Selection 16891042299
No ratings yet
Feature Selection 16891042299
23 pages
Support Vector Machine - Explanation
No ratings yet
Support Vector Machine - Explanation
12 pages
AI Human Capital, Jobs and Skills
No ratings yet
AI Human Capital, Jobs and Skills
2 pages
Calorie Burnt
No ratings yet
Calorie Burnt
45 pages
Introduction To Machine Learning - Midterm Quiz 1
No ratings yet
Introduction To Machine Learning - Midterm Quiz 1
10 pages
MobileNetV2 Code
No ratings yet
MobileNetV2 Code
3 pages
Vaibhav Gupta Data Analyst
No ratings yet
Vaibhav Gupta Data Analyst
1 page
Tushar Internship Report 4th Year
No ratings yet
Tushar Internship Report 4th Year
17 pages
Edx Dpu 1100 Courses
No ratings yet
Edx Dpu 1100 Courses
55 pages
Support Vector Machine (SVM) : Basic Terminologies
100% (1)
Support Vector Machine (SVM) : Basic Terminologies
2 pages
The Funky Impact of Emerging Technologies On The Accounting Groove V2.3
No ratings yet
The Funky Impact of Emerging Technologies On The Accounting Groove V2.3
30 pages
Statistical Learning Slides
No ratings yet
Statistical Learning Slides
60 pages
Auto Target
No ratings yet
Auto Target
12 pages
Master Thesis Topics: Felix Kahlhoefer
No ratings yet
Master Thesis Topics: Felix Kahlhoefer
10 pages
Cyclical Learning Rate For Training Neural Networks (Leslie Smith)
No ratings yet
Cyclical Learning Rate For Training Neural Networks (Leslie Smith)
10 pages
Agrawal Et Al. 2018 - Prediction Machines - The Simple Economics of Artificial Intelligence
No ratings yet
Agrawal Et Al. 2018 - Prediction Machines - The Simple Economics of Artificial Intelligence
5 pages
A Review of Deep Learning Models To Detect Malware in Android Applications
No ratings yet
A Review of Deep Learning Models To Detect Malware in Android Applications
9 pages
Travel Agg
No ratings yet
Travel Agg
54 pages
Data Mining CS4168 Lecture 5 Basics of Classification 1
No ratings yet
Data Mining CS4168 Lecture 5 Basics of Classification 1
25 pages
The Role of Artificial Intelligence AI I2018 PDF
No ratings yet
The Role of Artificial Intelligence AI I2018 PDF
6 pages
Algorithmic Fairness: From Social Good To A Mathematical Framework
No ratings yet
Algorithmic Fairness: From Social Good To A Mathematical Framework
2 pages
AIML Unit Wise Question Bank
100% (1)
AIML Unit Wise Question Bank
4 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Support Vector Machine (SVM) Algorithm

Uploaded by

Support Vector Machine (SVM) Algorithm

Uploaded by

AI ML DS Data Science Data Analysis Data Visualization Machine Learning Deep Learning NLP Computer

Support Vector Machine (SVM) Algorithm

Support Vector Machine

How does SVM work?

Multiple hyperplanes separate the data from two classes

So we choose the hyperplane whose distance from it to the nearest data

Hyperplane which is the most optimized one

Original 1D dataset for classification

Mapping 1D data to 2D to become able to separate the two classes

Support Vector Machine Terminology

1. Hyperplane: Hyperplane is the decision boundary that is used to separate

Mathematical intuition of Support Vector Machine

Consider a binary classification problem with two classes, labeled as +1 and

The equation for the linear hyperplane can be written as:

For Linear SVM classifier :

For Soft margin linear SVM classifier:

Dual Problem: A dual Problem of the optimisation problem that requires

i→m j→m i→m

αi is the Lagrange multiplier associated with the ith training sample.

The SVM decision boundary can be described in terms of these optimal

Types of Support Vector Machine

Based on the nature of the decision boundary, Support Vector Machines

Popular kernel functions in SVM

Sigmoid :K(xi , xj ) = tanh(αxTi xj + b)

Effective in high-dimensional cases.

SVM implementation in Python

Predict if cancer is Benign or malignant. Using historical data about patients

# Load the important packages from sklearn.datasets import

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.