0% found this document useful (0 votes)

95 views38 pages

SVM for Data Scientists

Support vector machines (SVMs) are a type of supervised machine learning model used for classification and regression analysis. SVMs find a hyperplane in a high-dimensional space that distinctly classifies data points. The hyperplane is chosen so that the distance between it and the nearest data points of any class is maximized. These nearest data points are called support vectors. SVMs can efficiently perform nonlinear classification using what is called the kernel trick, implicitly mapping inputs into high-dimensional feature spaces. Common kernels include polynomial and radial basis function kernels. The SVM optimization problem can be solved using quadratic programming to find the hyperplane that maximizes margin and minimizes classification errors.

Uploaded by

Shalini Singhal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

95 views38 pages

SVM for Data Scientists

Uploaded by

Shalini Singhal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Support Vector Machine

1
2
3
4
5
6
7
8
9
10
11
12
13
Classification: Definition
• Given a collection of records (training set )
– Each record contains a set of attributes, one of the attributes is
the class label.

• Find a model for class attribute as a function of

the values of other attributes. q : X  Y

• Goal: previously unseen records should be

assigned a class as accurately as possible.

14
Classification Example
height

 x1 
x   x X  2
weight  2
Training examples {( x1 , y1 ),  , (x l , yl )}

Linear classifier: x2 yH

H if (w x) b  0
q(x)  
 J if (w x) b  0
y  J (w x) b  0
15
x1
Linear Classifiers

f(x,w,b) = sign(w x + b)
denotes +1
denotes -1

Any of these
would be fine..

..but which is
best?

16
Support Vector Machine
x+ M=Margin Width
Support Vectors
are those
datapoints that
X-
the margin
pushes up
against

What we know:
 
• w . x+ + b = +1 (x  x )  w 2
• w . x- + b = -1 M  
• w . (x+-x-) = 2 w w

17
Linear SVM Mathematically
• Goal: 1) Correctly classify all training data
wx i  b  1 if yi = +1

wx i  b  1 If yi = -1
yi ( wxi  b)  1 for all i

2
2) Maximize the Margin M 
w
1 t
same as minimize ww
2
• We can formulate a Quadratic Optimization Problem and solve for w and b

1 t
Minimize  ( w)  w w
2
subject to
yi (wxi b) 1 i
18
Linear SVM. Cont.
• Requiring the derivatives with respect to w,b to vanish yields:
m
1 m m
maximize


i 1
 i  
2 i 1 j 1
 
i j y i   j   i   j 
y x ,x

m
Subject to :  i 0
 y
i 1
i 

 i  0 i
• KKT conditions yield:

for any  i  0, b  y i   w, x i 
• Where:
m
w  i y x  i  i 
i 1 19
Linear SVM. Cont.
• The resulting separating function is:
m
f  x     i y i  x  i  , x  b
i 1

• If f(x) >0, x will be assigned class label +1 else -1

20
Linear SVM. Cont.
• The resulting separating function is:
m
f  x     i y i  x  i  , x  b
i 1

• Notes:
– The points with α=0 do not affect the solution.
– The points with α≠0 are called support vectors.
– The equality conditions hold true only for the
Support Vectors.
21
Non-linear SVMs: Feature spaces
• General idea: the original feature space can always be
mapped to some higher-dimensional feature space
where the training set is separable:

Φ: x → φ(x)

22
Non Linear SVM
• Note that the training data appears in the solution only in inner
products.

• If we pre-map the data into a higher and sparser space we can

get more separability and a stronger separation family of
functions.

• The pre-mapping might make the problem infeasible.

• We want to avoid pre-mapping and still have the same

separation ability.

• Suppose we have a simple function that operates on two training

points and implements an inner product of their pre-mappings,
then we achieve better separation with no added cost. 23
24
25
26
27
28
The “Kernel Trick”
• The linear classifier relies on inner product between vectors K(xi,xj)=xiTxj
• If every datapoint is mapped into high-dimensional space via some
transformation Φ: x → φ(x), the inner product becomes:
K(xi,xj)= φ(xi) Tφ(xj)
• A kernel function is a function that is equivalent to an inner product in some
feature space.
• Example:
2-dimensional vectors x=[x1 x2]; let K(xi,xj)=(1 + xiTxj)2,
Need to show that K(xi,xj)= φ(xi) Tφ(xj):
K(xi,xj)=(1 + xiTxj)2,= 1+ xi12xj12 + 2 xi1xj1 xi2xj2+ xi22xj22 + 2xi1xj1 + 2xi2xj2=
= [1 xi12 √2 xi1xi2 xi22 √2xi1 √2xi2]T [1 xj12 √2 xj1xj2 xj22 √2xj1 √2xj2]
=
= φ(xi) Tφ(xj), where φ(x) = [1 x12 √2 x1x2 x22 √2x1 √2x2]
• Thus, a kernel function implicitly maps data to a high-dimensional space
(without the need to compute each φ(x) explicitly).
29
Mercer Kernels
• A Mercer kernel is a function: k:XdXd R
for which there exists a function: : Xd  H
such that:
x, y  X d k ( x, y )   ( x),  ( y )
• A function k(.,.) is a Mercer kernel if
for any function g(.), such that:
 ( x)dx  
2
g

the following holds true:

  g ( x) g ( y)k ( x, y)dxdy  0
30
Commonly used Mercer Kernels
k ( x , y )   x, y  p
• Homogeneous Polynomial Kernels:

k ( x, y )   x, y  1
p
• Non-homogeneous Polynomial Kernels:

• Radial Basis Function (RBF) Kernels:


k ( x, y )  exp   x  y
2


31
Solution of non-linear SVM
• The problem:

 ,x 
m
1 m m
maximize


i 1
 i  
2 i 1 j 1
 
i j y i   j 
y k x i   j 

m
Subject to :  i 0
 y
i 1
i 

• The separating function: 0  i  C i

sgn f  x   sgn   i y i k x i  , x   b
m

i 1

32
33
34
35
36
37
38

ML SVM Lect10 11
No ratings yet
ML SVM Lect10 11
27 pages
Understanding Kernel Tricks in SVMs
No ratings yet
Understanding Kernel Tricks in SVMs
43 pages
SVM Applications and Properties
100% (1)
SVM Applications and Properties
34 pages
Introduction to Support Vector Machines
No ratings yet
Introduction to Support Vector Machines
36 pages
SVM Tutorial
No ratings yet
SVM Tutorial
34 pages
Introduction to Support Vector Machines
No ratings yet
Introduction to Support Vector Machines
33 pages
SVM-CDing2024 11 15
No ratings yet
SVM-CDing2024 11 15
54 pages
SVM Classifiers: A Technical Guide
No ratings yet
SVM Classifiers: A Technical Guide
44 pages
Financial Market Volatility Forecasting
No ratings yet
Financial Market Volatility Forecasting
52 pages
Introduction to Support Vector Machines
No ratings yet
Introduction to Support Vector Machines
40 pages
Support Vector Machines Overview
No ratings yet
Support Vector Machines Overview
34 pages
Introduction To: Support Vector Machines
No ratings yet
Introduction To: Support Vector Machines
53 pages
Support Vector Machines Explained
No ratings yet
Support Vector Machines Explained
36 pages
SVM Tutorial
No ratings yet
SVM Tutorial
31 pages
SVM Overview and Applications
No ratings yet
SVM Overview and Applications
33 pages
SVM Tutorial
No ratings yet
SVM Tutorial
34 pages
Support Vector Machine
No ratings yet
Support Vector Machine
45 pages
Machine Learning - Open Elective - Part III
No ratings yet
Machine Learning - Open Elective - Part III
90 pages
SVM Tutorial
No ratings yet
SVM Tutorial
34 pages
Introduction to Support Vector Machines
No ratings yet
Introduction to Support Vector Machines
36 pages
Lec06 SVM
No ratings yet
Lec06 SVM
25 pages
Support Vector Machines
No ratings yet
Support Vector Machines
57 pages
Understanding Support Vector Machines
No ratings yet
Understanding Support Vector Machines
23 pages
7 - Support Vector Machines (SVM)
No ratings yet
7 - Support Vector Machines (SVM)
29 pages
SVMs for Machine Learning Students
No ratings yet
SVMs for Machine Learning Students
36 pages
Support Vector Machines Overview
No ratings yet
Support Vector Machines Overview
40 pages
SVM Basics for Computer Science Students
No ratings yet
SVM Basics for Computer Science Students
36 pages
cs221 Lecture11
No ratings yet
cs221 Lecture11
71 pages
2.1 SVM
No ratings yet
2.1 SVM
16 pages
SVM Class 2
No ratings yet
SVM Class 2
87 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
26 pages
Support Vector Machines
No ratings yet
Support Vector Machines
33 pages
Support Vector Machine
No ratings yet
Support Vector Machine
19 pages
Support Vector Machines: Logisic Regression
No ratings yet
Support Vector Machines: Logisic Regression
10 pages
Support Vector Machines: Xiaojin Zhu
No ratings yet
Support Vector Machines: Xiaojin Zhu
41 pages
This Is
No ratings yet
This Is
7 pages
Atc Lecture Tyliu
No ratings yet
Atc Lecture Tyliu
48 pages
History and Basics of Support Vector Machines
No ratings yet
History and Basics of Support Vector Machines
35 pages
Lect 11-SVM
No ratings yet
Lect 11-SVM
14 pages
SVM Algorithm Flowchart Overview
No ratings yet
SVM Algorithm Flowchart Overview
50 pages
Lecture 18 - SVM
No ratings yet
Lecture 18 - SVM
54 pages
SVM Geometry and Kernel Trick Notes
No ratings yet
SVM Geometry and Kernel Trick Notes
4 pages
SVM Geometry and Kernel Trick Notes
No ratings yet
SVM Geometry and Kernel Trick Notes
4 pages
ML 18-20 SVM
No ratings yet
ML 18-20 SVM
44 pages
Understanding Support Vector Machines
No ratings yet
Understanding Support Vector Machines
58 pages
Lect3 2
No ratings yet
Lect3 2
43 pages
Kernal and Multiclass
No ratings yet
Kernal and Multiclass
51 pages
L5-Support Vector Machine
No ratings yet
L5-Support Vector Machine
61 pages
Support Vector Machines Overview
No ratings yet
Support Vector Machines Overview
24 pages
SVM Presentation
No ratings yet
SVM Presentation
27 pages
SVM vs Logistic Regression Efficiency
No ratings yet
SVM vs Logistic Regression Efficiency
6 pages
Unit - 2-1
No ratings yet
Unit - 2-1
7 pages
Understanding Support Vector Machines
No ratings yet
Understanding Support Vector Machines
40 pages
SVM
No ratings yet
SVM
21 pages
SVM Kernels and Its Type
No ratings yet
SVM Kernels and Its Type
6 pages
JD - Manager - Buying Excellence - Babyshop (Campus) (2026)
No ratings yet
JD - Manager - Buying Excellence - Babyshop (Campus) (2026)
4 pages
Principles of Programming Languages
No ratings yet
Principles of Programming Languages
40 pages
Building Decision Trees in ML
No ratings yet
Building Decision Trees in ML
16 pages
Ai Assignment
No ratings yet
Ai Assignment
2 pages
Case Study - 2: Communication Skills For Professionals
No ratings yet
Case Study - 2: Communication Skills For Professionals
3 pages
Alibaba's B2B Growth and Strategy
No ratings yet
Alibaba's B2B Growth and Strategy
27 pages
7d41 PDF
No ratings yet
7d41 PDF
7 pages
Alibaba: Born Global Firm Analysis
No ratings yet
Alibaba: Born Global Firm Analysis
1 page
CBSE Board Exam Marking Scheme 2024
No ratings yet
CBSE Board Exam Marking Scheme 2024
6 pages
Internship Presentation New PDF
No ratings yet
Internship Presentation New PDF
14 pages
MS Computer Science-20211102
No ratings yet
MS Computer Science-20211102
2 pages
Deep Learning PDF
No ratings yet
Deep Learning PDF
7 pages
AI Internship 45 Days Plan
No ratings yet
AI Internship 45 Days Plan
3 pages
6 Steps to Business Analytics Success
No ratings yet
6 Steps to Business Analytics Success
5 pages
Neural Network-Unit-1-Complete-Notes
No ratings yet
Neural Network-Unit-1-Complete-Notes
154 pages
AI in Obstetrics
No ratings yet
AI in Obstetrics
3 pages
A Competency Framework For AI Integration in India
No ratings yet
A Competency Framework For AI Integration in India
78 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
12 pages
Long Short-Term Memory Networks For Accurate State-Of-Charge Estimation of Li-Ion Batteries
No ratings yet
Long Short-Term Memory Networks For Accurate State-Of-Charge Estimation of Li-Ion Batteries
10 pages
Optimization in Neural Network
No ratings yet
Optimization in Neural Network
22 pages
LAKe-Net Topology-Aware Point Cloud Completion by Localizing Aligned Keypoints
No ratings yet
LAKe-Net Topology-Aware Point Cloud Completion by Localizing Aligned Keypoints
10 pages
A Classification Study For Turkish Folk Music Makam Recognition Using Machine Learning With Data Augmentation Techniques
No ratings yet
A Classification Study For Turkish Folk Music Makam Recognition Using Machine Learning With Data Augmentation Techniques
19 pages
Offline Multi-agent Decision Transformer
No ratings yet
Offline Multi-agent Decision Transformer
16 pages
Week 7 Solution
100% (1)
Week 7 Solution
4 pages
Question Bank UM19MB602: Introduction To Machine Learning Unit 4: Decision Tree
No ratings yet
Question Bank UM19MB602: Introduction To Machine Learning Unit 4: Decision Tree
4 pages
Final Review Paper
No ratings yet
Final Review Paper
8 pages
Ids Past Papers Merged
No ratings yet
Ids Past Papers Merged
62 pages
Demystifying Graph Data Science Graph Algorithms, Analytics Methods, Platforms, Databases, and Use Cases (Pethuru Raj, Abhishek Kumar Etc.) (Z-Library)
No ratings yet
Demystifying Graph Data Science Graph Algorithms, Analytics Methods, Platforms, Databases, and Use Cases (Pethuru Raj, Abhishek Kumar Etc.) (Z-Library)
415 pages
RProp Algorithm in Neural Network Optimization
No ratings yet
RProp Algorithm in Neural Network Optimization
6 pages
Mechanical Engineering Courses Syllabi 2
No ratings yet
Mechanical Engineering Courses Syllabi 2
88 pages
Supply Chain Analytics Syllabus PDF
No ratings yet
Supply Chain Analytics Syllabus PDF
5 pages
Real-Time Drowning Detection System
100% (1)
Real-Time Drowning Detection System
21 pages
Vehicle Traffic Density
No ratings yet
Vehicle Traffic Density
19 pages
Predicting Employee Promotions
No ratings yet
Predicting Employee Promotions
52 pages
Data Science Interview Guide
No ratings yet
Data Science Interview Guide
93 pages
Sanet - St.deep Learning in Practice
100% (1)
Sanet - St.deep Learning in Practice
219 pages
Jumpstart Your Machine Learning Journey With Amazon Sagemaker and Facilitate Your Portfolio Management
No ratings yet
Jumpstart Your Machine Learning Journey With Amazon Sagemaker and Facilitate Your Portfolio Management
27 pages
Done N Dusted
No ratings yet
Done N Dusted
29 pages

SVM for Data Scientists

Uploaded by

SVM for Data Scientists

Uploaded by

Support Vector Machine

• Find a model for class attribute as a function of

• Goal: previously unseen records should be

Linear classifier: x2 yH

• If f(x) >0, x will be assigned class label +1 else -1

• If we pre-map the data into a higher and sparser space we can

• The pre-mapping might make the problem infeasible.

• We want to avoid pre-mapping and still have the same

• Suppose we have a simple function that operates on two training

the following holds true:

• Radial Basis Function (RBF) Kernels:

• The separating function: 0  i  C i

You might also like