Hinge Loss

Hinge loss is a loss function primarily used in support vector machines (SVMs) that penalizes incorrect predictions and those that lack confidence, with labels represented as -1 or 1. The loss is calculated using the formula max(0, 1 - y * ŷ), where y is the actual label and ŷ is the prediction, resulting in zero loss when the signs of the label and prediction match. Hinge loss is faster than cross-entropy but may lead to degraded accuracy.

Uploaded by

Cát Lăng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views9 pages

Hinge Loss

Uploaded by

Cát Lăng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Hinge Loss

Consider 𝑦 to be the actual label (-1 or 1)

The 0/1 Loss and 𝑦ො to be the prediction.
Let’s try to multiply the two together: 𝑦𝑦ො
If the label is -1 and the prediction is -1:
-1(-1) = +1  Positive
If we follow the graph, any positive will
give us 0 loss.
If the label is +1 and the prediction is +1:
+1(+1) = +1  Positive
If we follow the graph, any positive will
give us 0 loss.

If the label is -1 and the prediction is +1:

-1(+1) = -1  Negative
If we follow the graph, any negative will
give us 1 loss.
If the label is +1 and the prediction is -1:
+1(-1) = -1  Negative
If we follow the graph, any negative will
give us 1 loss.
Rather than penalizing with 1, we make the penalization linear/proportional to the error
What if we include a margin of 1? We can introduce
confidence to the model! We can optimize until a margin,
rather than not penalizing for any positive prediction.

Margin
When signs match  (-)(-) = (+)(+) = +  Correct Classification and no loss
When signs don’t match  (-)(+) = (+)(-) = -  Wrong Classification and loss
Consider the plot of 1-x
Hinge Loss
• A marginal loss, usually used for SVMs
• Used when labels are [-1,1]
• It penalizes not only wrong predictions, but correct predictions which
are not confident enough.
• Faster than cross entropy but accuracy is degraded
For all samples: ෍ max(0,1 − 𝑦 ∗ 𝑦)
ො

Where 𝑦 is the actual label (-1 or 1) and 𝑦ො is the prediction.

The loss is 0 when the signs of the label and prediction match.

Consider the prediction when the actual label is -1:

max[0, 1-(-1*0.3)] = max[0, 1.3] = 1.3  Loss is high
max[0, 1-(-1*-0.8)] = max[0, 0.2] = 0.2  Loss is low
max[0, 1-(-1*-1.1)] = max[0, -0.1] = 0  No Loss! 1.3

max[0, 1-(-1*-1)] = max[0, 0] = 0  No Loss!

max[0, 1-(-1*1)] = max[0, 2] = 2  Loss is very high
-0.3
෍ max(0,1 − 𝑦 ∗ 𝑦)
ො

Where 𝑦 is the actual label (-1 or 1) and 𝑦ො is the prediction.

The loss is 0 when the signs of the label and prediction match.

Consider the prediction when the actual label is +1:

max[0, 1-(1*-0.3)] = max[0, 1.3] = 1.3  Loss is high
max[0, 1-(1*0.8)] = max[0, 0.2] = 0.2  Loss is low
max[0, 1-(1*1.1)] = max[0, -0.1] = 0  No Loss!
max[0, 1-(1*1)] = max[0, 0] = 0  No Loss!
max[0, 1-(1*-1)] = max[0, 2] = 2  Loss is very high

Implementing Huber Loss in Python
No ratings yet
Implementing Huber Loss in Python
6 pages
Lect 8
No ratings yet
Lect 8
117 pages
Lecture-4 Emprical Risk and Optimization
No ratings yet
Lecture-4 Emprical Risk and Optimization
20 pages
Understanding Loss Functions in Machine Learning
No ratings yet
Understanding Loss Functions in Machine Learning
26 pages
Comprehensive Guide to Loss Functions
No ratings yet
Comprehensive Guide to Loss Functions
8 pages
2 LossAndOptimization
No ratings yet
2 LossAndOptimization
130 pages
Linear Classfiers, Loss
No ratings yet
Linear Classfiers, Loss
38 pages
4-Loss Function
No ratings yet
4-Loss Function
8 pages
CS229 Supplemental Lecture Notes: 1 Binary Classification
No ratings yet
CS229 Supplemental Lecture Notes: 1 Binary Classification
7 pages
Binary Classification and Logistic Regression
No ratings yet
Binary Classification and Logistic Regression
7 pages
Understanding Loss Functions in Deep Learning
No ratings yet
Understanding Loss Functions in Deep Learning
9 pages
Lclas (Lect 04)
No ratings yet
Lclas (Lect 04)
9 pages
Week 2 Introduction To Linear Models - Revised - v1
No ratings yet
Week 2 Introduction To Linear Models - Revised - v1
54 pages
NLP for Machine Learning Enthusiasts
No ratings yet
NLP for Machine Learning Enthusiasts
53 pages
1 Intro
No ratings yet
1 Intro
5 pages
Linear Classification Guide
No ratings yet
Linear Classification Guide
28 pages
8 Linear Classifiers HInge Loss 03-08-2024
No ratings yet
8 Linear Classifiers HInge Loss 03-08-2024
20 pages
ML Intro Numericals
No ratings yet
ML Intro Numericals
27 pages
Hyperparameter Tuning in DNNs
No ratings yet
Hyperparameter Tuning in DNNs
44 pages
01 Lecturenote SRM
No ratings yet
01 Lecturenote SRM
9 pages
Choosing Loss Functions for Neural Networks
No ratings yet
Choosing Loss Functions for Neural Networks
29 pages
WINSEM2024-25 CSE4006 ETH AP2024254000689 2025-01-09 Reference-Material-I
No ratings yet
WINSEM2024-25 CSE4006 ETH AP2024254000689 2025-01-09 Reference-Material-I
15 pages
Cs221 LEC 5 Slides
No ratings yet
Cs221 LEC 5 Slides
67 pages
DL 02 Basics
No ratings yet
DL 02 Basics
94 pages
Loss
No ratings yet
Loss
18 pages
Lecture 07
No ratings yet
Lecture 07
29 pages
02 - Linear Models - D (Multiclass Classification)
No ratings yet
02 - Linear Models - D (Multiclass Classification)
9 pages
Loss Function
No ratings yet
Loss Function
13 pages
01 Lecturenote SRM
No ratings yet
01 Lecturenote SRM
9 pages
DL 02 Basics
No ratings yet
DL 02 Basics
95 pages
Ds 2
No ratings yet
Ds 2
27 pages
Beyond Classification Beyond Classification Beyond Classification Beyond Classification
No ratings yet
Beyond Classification Beyond Classification Beyond Classification Beyond Classification
23 pages
Linear Classifier: by Dr. Sanjeev Kumar Associate Professor Department of Mathematics IIT Roorkee, Roorkee-247 667, India
No ratings yet
Linear Classifier: by Dr. Sanjeev Kumar Associate Professor Department of Mathematics IIT Roorkee, Roorkee-247 667, India
86 pages
Lect 9 - Loss Functions
No ratings yet
Lect 9 - Loss Functions
28 pages
MIT ML Course: Perceptron & PA Analysis
No ratings yet
MIT ML Course: Perceptron & PA Analysis
3 pages
What Is A Loss Function
No ratings yet
What Is A Loss Function
3 pages
03-Linear Classification
No ratings yet
03-Linear Classification
17 pages
Op Tim Ization
No ratings yet
Op Tim Ization
18 pages
Understanding Loss Functions in ML
No ratings yet
Understanding Loss Functions in ML
22 pages
Loss Function
No ratings yet
Loss Function
23 pages
Statistical Learning Theory
No ratings yet
Statistical Learning Theory
100 pages
9.b Handout-1-Loss Functions
No ratings yet
9.b Handout-1-Loss Functions
3 pages
Lecture 6
No ratings yet
Lecture 6
19 pages
DL145611 03 Shallow
No ratings yet
DL145611 03 Shallow
92 pages
Intro to Machine Learning Basics
No ratings yet
Intro to Machine Learning Basics
61 pages
03 Linear Models
No ratings yet
03 Linear Models
46 pages
Zr62nbxFEeiJFxIALfRvzg 2.1.10 How Do We Define Learning
No ratings yet
Zr62nbxFEeiJFxIALfRvzg 2.1.10 How Do We Define Learning
7 pages
Supervised Learning
No ratings yet
Supervised Learning
5 pages
Sol Multiclass 1
No ratings yet
Sol Multiclass 1
5 pages
Deep Learning 3
No ratings yet
Deep Learning 3
7 pages
Cross-Entropy Loss Function
No ratings yet
Cross-Entropy Loss Function
27 pages
Available Losses
No ratings yet
Available Losses
5 pages
Notes On Logistic Regression
No ratings yet
Notes On Logistic Regression
3 pages
Loss Functions
No ratings yet
Loss Functions
7 pages
320-Cheatsheet2 2024-05-09 17 - 40 - 40
No ratings yet
320-Cheatsheet2 2024-05-09 17 - 40 - 40
2 pages
Machine Learning PDF
No ratings yet
Machine Learning PDF
77 pages
Cross Entropy Loss and Derivatives in Deep Learning
No ratings yet
Cross Entropy Loss and Derivatives in Deep Learning
16 pages
RNNs and LSTMs
No ratings yet
RNNs and LSTMs
41 pages
Regularization and Normalization
No ratings yet
Regularization and Normalization
29 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
98 pages
Softmax
No ratings yet
Softmax
5 pages
KL Divergence
No ratings yet
KL Divergence
8 pages
Very Highspeed BJT Buffer For Trackandhold Amplifiers With Enhan
No ratings yet
Very Highspeed BJT Buffer For Trackandhold Amplifiers With Enhan
4 pages
Backpropogation Algorithm
No ratings yet
Backpropogation Algorithm
48 pages
CMOS Delay Optimization Guide
No ratings yet
CMOS Delay Optimization Guide
12 pages
Digital Forensics Skilling Workbook
No ratings yet
Digital Forensics Skilling Workbook
73 pages
Trabajo Esfuerzo
No ratings yet
Trabajo Esfuerzo
1 page
Optimize Primo for Faster Performance
No ratings yet
Optimize Primo for Faster Performance
3 pages
Web Designing Course Overview and Syllabus
No ratings yet
Web Designing Course Overview and Syllabus
10 pages
Transport Layer Protocols Overview
100% (1)
Transport Layer Protocols Overview
11 pages
Webinar PPT 24
No ratings yet
Webinar PPT 24
21 pages
Data Science & Machine Learning Bootcamp
No ratings yet
Data Science & Machine Learning Bootcamp
4 pages
Resume Ryan Cardoza
No ratings yet
Resume Ryan Cardoza
2 pages
HTML Basics
No ratings yet
HTML Basics
101 pages
TrackwiseCheckList ScopeQuery
No ratings yet
TrackwiseCheckList ScopeQuery
10 pages
Apple Mobile Advertising: Final-Term Assignment
No ratings yet
Apple Mobile Advertising: Final-Term Assignment
25 pages
BETCK24105A - Intro To AI - CIE1 - Set1 - Solution
No ratings yet
BETCK24105A - Intro To AI - CIE1 - Set1 - Solution
7 pages
Electrical Engineering Project Expertise
No ratings yet
Electrical Engineering Project Expertise
3 pages
Conversion Rate Optimization - Patrick McKenzie DCBKK
No ratings yet
Conversion Rate Optimization - Patrick McKenzie DCBKK
28 pages
Pharmacy Management System Details Paginated
No ratings yet
Pharmacy Management System Details Paginated
8 pages
EAM Curriculum: Intro to S/4HANA
No ratings yet
EAM Curriculum: Intro to S/4HANA
35 pages
CYB 204 Week1-5 Lecture Note
No ratings yet
CYB 204 Week1-5 Lecture Note
33 pages
Bluetooth Network Implementation
No ratings yet
Bluetooth Network Implementation
5 pages
Rev.D Nov.-2016: 描述 / Descriptions
No ratings yet
Rev.D Nov.-2016: 描述 / Descriptions
6 pages
Aspects in Vedic Astrology
100% (1)
Aspects in Vedic Astrology
37 pages
Elementary Data Link Protocols
No ratings yet
Elementary Data Link Protocols
4 pages
Big Iq Datasheet
No ratings yet
Big Iq Datasheet
10 pages
AD9371 and ADRV9009 Setup With ZCU102 or ZC706 April2019
No ratings yet
AD9371 and ADRV9009 Setup With ZCU102 or ZC706 April2019
31 pages
National University - Bangladesh
No ratings yet
National University - Bangladesh
3 pages
PicoBlaze Interrupts & Assembly
No ratings yet
PicoBlaze Interrupts & Assembly
25 pages
IMS-module 3
No ratings yet
IMS-module 3
7 pages
IT Project (CBSE)
No ratings yet
IT Project (CBSE)
16 pages
Cloud Computing 6TH Sem
No ratings yet
Cloud Computing 6TH Sem
6 pages
SEDABook - ReadingElementary (2020!10!05 15-50-54 UTC)
No ratings yet
SEDABook - ReadingElementary (2020!10!05 15-50-54 UTC)
15 pages
Remotectl Dumpstate
No ratings yet
Remotectl Dumpstate
2 pages