0% found this document useful (0 votes)

9 views8 pages

Loss Functions

The document discusses loss functions used in neural networks to quantify model performance, categorizing them into regression and classification losses. It details various regression loss functions such as Mean Square Error, Mean Absolute Error, and Huber Loss, along with their mathematical formulations and use cases. Additionally, it covers classification losses, particularly Cross-Entropy Loss, explaining its significance in measuring the difference between predicted and actual probabilities in classification tasks.

Uploaded by

tirthankar.sardar.cseds.2022

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views8 pages

Loss Functions

Uploaded by

tirthankar.sardar.cseds.2022

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Loss Functions

 Neural Network uses optimizing strategies like stochastic gradient descent to

minimize the error in the algorithm. The way we actually compute this error is
by using a Loss Function. It is used to quantify how good or bad the model is
performing.
 Loss functions can be classified into two major categories depending upon
the type of learning task we are dealing with — Regression
losses and Classification losses.
 In classification, we are trying to predict output from set of finite categorical
values i.e Given large data set of images of hand written digits, categorizing
them into one of 0–9 digits.
 Regression, on the other hand, deals with predicting a continuous value for
example given floor area, number of rooms, size of rooms, predict the price
of room.
NOTE
n - Number of training examples.
i - ith training example in a data set.
y(i) - Ground truth label for ith training example.
y_hat(i) - Prediction for ith training example.

Regression Losses
1. Mean Square Error/Quadratic Loss/L2 Loss
Mathematical formulation :-

As the name suggests, Mean square error is measured as the average of squared
difference between predictions and actual observations.

It’s only concerned with the average magnitude of error irrespective of their
direction. However, due to squaring, predictions which are far away from actual
values are penalized heavily in comparison to less deviated predictions.
# calculate mean squared error
def mean_squared_error(actual, predicted):
sum_square_error = 0.0
for i in range(len(actual)):
sum_square_error += (actual[i] - predicted[i])**2.0
mean_square_error = 1.0 / len(actual) * sum_square_error
return mean_square_error

from sklearn.metrics import mean_squared_error

>>> y_true = [3, -0.5, 2, 7]
>>> y_pred = [2.5, 0.0, 2, 8]
>>> mean_squared_error(y_true, y_pred)
0.375

2. Mean Absolute Error/L1 Loss

Mean absolute error, on the other hand, is measured as the average of sum of
absolute differences between predictions and actual observations. Like MSE, this as
well measures the magnitude of error without considering their direction.
MAE is more robust to outliers since it does not make use of square.
Mathematical formulation :-

.
from sklearn.metrics import mean_absolute_error
>>> y_true = [3, -0.5, 2, 7]
>>> y_pred = [2.5, 0.0, 2, 8]
>>> mean_absolute_error(y_true, y_pred)
0.5
MAE loss is useful if the training data is corrupted with outliers (i.e. we
erroneously receive unrealistically huge negative/positive values in our
training environment, but not our testing environment).
Deciding which loss function to use

If the outliers represent anomalies that are important for business and should
be detected, then we should use MSE. On the other hand, if we believe that
the outliers just represent corrupted data, then we should choose MAE as
loss.

 L1 loss is more robust to outliers, but its derivatives are not continuous,
making it inefficient to find the solution. L2 loss is sensitive to outliers, but
gives a more stable and closed form solution

3. Huber Loss:
 Mean Square Error (MSE) is greater for learning the outliers in the
dataset, on the other hand, Mean Absolute Error(MAE) is good to ignore
the outliers.
 But in some cases, the data which looks like outliers should not be
ignored and also those points should not get high priority. Here where
Huber Loss comes in.

Huber Loss = Combination of both MSE and MAE

 Huber loss is both MSE and MAE means it is quadratic(MSE) when the error is
small else MAE. Here delta is the hyperparameter to define the range for MAE
and MSE which can be iterative to make sure the correct delta value.

Classification Losses

1. Cross Entropy Loss

 Cross-entropy loss is often simply referred to as “cross-entropy,”

“logarithmic loss,” “logistic loss,” or “log loss” for short.

 It gives the probability value between 0 and 1 for a classification task. Cross-
Entropy calculates the average difference between the predicted and actual
probabilities.

 Each predicted probability is compared to the actual class output value (0 or

1) and a score is calculated that penalizes the probability based on the
distance from the expected value. The penalty is logarithmic, offering a small
score for small differences (0.1 or 0.2) and enormous score for a large
difference (0.9 or 1.0).

 This is the most common setting for classification problems. Cross-entropy

loss increases as the predicted probability diverges from the actual label.
Consider a 4-class classification task where an image is classified as either a
dog, cat, horse or cheetah.

Let us calculate the probability generated by the first logit after Softmax is
applied

E= 2.73

In the above Figure, Softmax converts logits into probabilities. The purpose of the
Cross-Entropy is to take the output probabilities (P) and measure the distance from
the truth values (as shown in Figure below).
Cross-entropy is defined as

Binary Cross-Entropy Loss

For binary classification, we have binary cross-entropy defined as
Or it can be written as follows

Multi-class cross-entropy / categorical cross-entropy

We use multi-class cross-entropy for multi-class classification problems. Let’s

say we need to create a model that predicts the type/class of fruit. We have
three types of fruit (oranges, apples, lemons) in different containers.

from sklearn.metrics import log_loss

>>> log_loss(["spam", "ham", "ham", "spam"],
... [[.1, .9], [.9, .1], [.8, .2], [.35, .65]])
0.21616...

from math import log

# calculate binary cross entropy

def binary_cross_entropy(actual, predicted):
sum_score = 0.0
for i in range(len(actual)):
sum_score += actual[i] * log(1e-15 + predicted[i])
mean_sum_score = 1.0 / len(actual) * sum_score
return mean_sum_score

Loss Functions
No ratings yet
Loss Functions
7 pages
Understanding Loss Functions in ML
No ratings yet
Understanding Loss Functions in ML
22 pages
Loss Functions in Regression & Classification
No ratings yet
Loss Functions in Regression & Classification
14 pages
Loss Function
No ratings yet
Loss Function
5 pages
Deep Learning Unit 2
No ratings yet
Deep Learning Unit 2
31 pages
What Is A Loss Function
No ratings yet
What Is A Loss Function
3 pages
Understanding Loss Functions in Machine Learning
No ratings yet
Understanding Loss Functions in Machine Learning
26 pages
Assignment 1 - Machine Learning
No ratings yet
Assignment 1 - Machine Learning
9 pages
Loss Functions in ANN: Python Guide
No ratings yet
Loss Functions in ANN: Python Guide
6 pages
Choosing Loss Functions for Neural Networks
No ratings yet
Choosing Loss Functions for Neural Networks
29 pages
Comprehensive Guide to Loss Functions
No ratings yet
Comprehensive Guide to Loss Functions
8 pages
Lect 9 - Loss Functions
No ratings yet
Lect 9 - Loss Functions
28 pages
Machine Learning Loss Functions Guide
100% (2)
Machine Learning Loss Functions Guide
37 pages
Loss Functions Types
No ratings yet
Loss Functions Types
11 pages
Losses
No ratings yet
Losses
9 pages
8 Linear Classifiers HInge Loss 03-08-2024
No ratings yet
8 Linear Classifiers HInge Loss 03-08-2024
20 pages
Deep Learning with Keras & TensorFlow
100% (1)
Deep Learning with Keras & TensorFlow
159 pages
Lesson 04 Deep Neural Network
No ratings yet
Lesson 04 Deep Neural Network
81 pages
Practical-5 - 2CEIT606 - Artificial Intelligence
No ratings yet
Practical-5 - 2CEIT606 - Artificial Intelligence
14 pages
AI and Math - Python Multiple-Choice Questions
No ratings yet
AI and Math - Python Multiple-Choice Questions
16 pages
WINSEM2024-25 CSE4006 ETH AP2024254000689 2025-01-09 Reference-Material-I
No ratings yet
WINSEM2024-25 CSE4006 ETH AP2024254000689 2025-01-09 Reference-Material-I
15 pages
DL Lesson20 21 Loss Functions
No ratings yet
DL Lesson20 21 Loss Functions
18 pages
Loss Function in Deep Learning
No ratings yet
Loss Function in Deep Learning
15 pages
Overview of Machine Learning Models
No ratings yet
Overview of Machine Learning Models
52 pages
Cost Function Loss Function
No ratings yet
Cost Function Loss Function
7 pages
Loss
No ratings yet
Loss
18 pages
Deep Learning (Part 2) - Loss Function and Gradient Function - by Sumbatilinda - Medium
No ratings yet
Deep Learning (Part 2) - Loss Function and Gradient Function - by Sumbatilinda - Medium
30 pages
Overview of Loss Functions in AI
No ratings yet
Overview of Loss Functions in AI
17 pages
CO5 Neural Network LossFunctions
No ratings yet
CO5 Neural Network LossFunctions
34 pages
Mle Loss Functions
No ratings yet
Mle Loss Functions
4 pages
Introduction
No ratings yet
Introduction
10 pages
Machine Vesion hw6
No ratings yet
Machine Vesion hw6
18 pages
Detailed Guide 7 Loss Functions Machine Learning Python Code
No ratings yet
Detailed Guide 7 Loss Functions Machine Learning Python Code
16 pages
Lecture 07
No ratings yet
Lecture 07
29 pages
CHC 351 Module 4
No ratings yet
CHC 351 Module 4
126 pages
Naïve Bayes Classifier Overview
No ratings yet
Naïve Bayes Classifier Overview
25 pages
4-Loss Function
No ratings yet
4-Loss Function
8 pages
Loss Functions
No ratings yet
Loss Functions
16 pages
Loss Functions
No ratings yet
Loss Functions
7 pages
DL Unit-2
100% (1)
DL Unit-2
24 pages
DL145611 03 Shallow
No ratings yet
DL145611 03 Shallow
92 pages
Understanding Loss and Loss Functions
No ratings yet
Understanding Loss and Loss Functions
25 pages
Lect 8
No ratings yet
Lect 8
117 pages
03-Linear Classification
No ratings yet
03-Linear Classification
17 pages
Iet Cipher ML Bootcamp (Session-1)
No ratings yet
Iet Cipher ML Bootcamp (Session-1)
67 pages
Lecture 6
No ratings yet
Lecture 6
19 pages
Machine Learning Loss Functions Guide
No ratings yet
Machine Learning Loss Functions Guide
100 pages
Module 1 - Problems in Neural Network
No ratings yet
Module 1 - Problems in Neural Network
20 pages
Loss Functions in Neural Networks PDF
No ratings yet
Loss Functions in Neural Networks PDF
6 pages
Complete Chapter Revision Takeaways Supervised ML Regression
No ratings yet
Complete Chapter Revision Takeaways Supervised ML Regression
22 pages
Key Error Metrics for Neural Networks
No ratings yet
Key Error Metrics for Neural Networks
14 pages
Ai - W3L6
No ratings yet
Ai - W3L6
29 pages
MtechDL Unit2
No ratings yet
MtechDL Unit2
25 pages
Loss Function
No ratings yet
Loss Function
23 pages
9.b Handout-1-Loss Functions
No ratings yet
9.b Handout-1-Loss Functions
3 pages
Cost Functions in Machine Learning
No ratings yet
Cost Functions in Machine Learning
23 pages
Loss Emperical
No ratings yet
Loss Emperical
7 pages
The Luxury Strategy Break The Rules of Marketing To Build Luxury Brands 2nd Edition by JeanNoeumll KapfererVincent Bastien
No ratings yet
The Luxury Strategy Break The Rules of Marketing To Build Luxury Brands 2nd Edition by JeanNoeumll KapfererVincent Bastien
344 pages
Olympic Flat Bench With Dumbbell Storage and Weight Set
No ratings yet
Olympic Flat Bench With Dumbbell Storage and Weight Set
48 pages
imageRUNNER Ir330, iR330E, Ir330s, Ir400, iR400E, iR400S Manual de Reparacion y Catalogo de Partes PDF
63% (8)
imageRUNNER Ir330, iR330E, Ir330s, Ir400, iR400E, iR400S Manual de Reparacion y Catalogo de Partes PDF
973 pages
Bacterial Wilt Disease On Banana
No ratings yet
Bacterial Wilt Disease On Banana
10 pages
Module - 3.1
No ratings yet
Module - 3.1
120 pages
Toyota 4a Fe Engine Reference PDF
No ratings yet
Toyota 4a Fe Engine Reference PDF
57 pages
7612 en 03
No ratings yet
7612 en 03
338 pages
Oral Pathology Charts II
No ratings yet
Oral Pathology Charts II
7 pages
KSAOs: Reliability and Validity in Selection
No ratings yet
KSAOs: Reliability and Validity in Selection
3 pages
Learning About Herbivores, Carnivores, and Omnivores - Wayground
No ratings yet
Learning About Herbivores, Carnivores, and Omnivores - Wayground
5 pages
MOD-5 Notes
No ratings yet
MOD-5 Notes
58 pages
Burien Park & Recreation Plan 2011-2025
No ratings yet
Burien Park & Recreation Plan 2011-2025
443 pages
NDE Level III Service Contract
No ratings yet
NDE Level III Service Contract
2 pages
Indraprastha Gas Limited: (A) Mandatory Checks
No ratings yet
Indraprastha Gas Limited: (A) Mandatory Checks
3 pages
S3 - MLP - 20-21 - 2nd - Term - Exam - RP (3rd Draft)
No ratings yet
S3 - MLP - 20-21 - 2nd - Term - Exam - RP (3rd Draft)
7 pages
UPSC Mains: Buddhism & Nalanda
No ratings yet
UPSC Mains: Buddhism & Nalanda
25 pages
Radio Schedule
No ratings yet
Radio Schedule
3 pages
Grade 4 Q 2 Reading
No ratings yet
Grade 4 Q 2 Reading
14 pages
SAEJ743 V 001
No ratings yet
SAEJ743 V 001
19 pages
SW Username & Password
No ratings yet
SW Username & Password
4 pages
Wrongdoing and The Moral Emotions Derk Pereboom PDF Download
No ratings yet
Wrongdoing and The Moral Emotions Derk Pereboom PDF Download
89 pages
Banana Powder
No ratings yet
Banana Powder
3 pages
SPM PHYSICS FORM 4 Forces and Motion
0% (1)
SPM PHYSICS FORM 4 Forces and Motion
16 pages
MLR - R and R2
No ratings yet
MLR - R and R2
17 pages
Complete Family Wealth 2nd Edition James E. Hughes ebook accessible full chapters
No ratings yet
Complete Family Wealth 2nd Edition James E. Hughes ebook accessible full chapters
41 pages
321B Excavator Hydraulic System: Kga1-Up AKG501-UP 9CZ1001-UP
No ratings yet
321B Excavator Hydraulic System: Kga1-Up AKG501-UP 9CZ1001-UP
2 pages
Plastic Moment of Resistance
No ratings yet
Plastic Moment of Resistance
5 pages
Ari711s Supplementary Test
No ratings yet
Ari711s Supplementary Test
7 pages
100r06 FUNTIONAL CHECK
No ratings yet
100r06 FUNTIONAL CHECK
26 pages
Escorial Cooking Manual 2
No ratings yet
Escorial Cooking Manual 2
8 pages

Loss Functions

Uploaded by

Loss Functions

Uploaded by

Loss Functions

 Neural Network uses optimizing strategies like stochastic gradient descent to

from sklearn.metrics import mean_squared_error

2. Mean Absolute Error/L1 Loss

Huber Loss = Combination of both MSE and MAE

1. Cross Entropy Loss

 Cross-entropy loss is often simply referred to as “cross-entropy,”

 Each predicted probability is compared to the actual class output value (0 or

 This is the most common setting for classification problems. Cross-entropy

Binary Cross-Entropy Loss

Multi-class cross-entropy / categorical cross-entropy

We use multi-class cross-entropy for multi-class classification problems. Let’s

from sklearn.metrics import log_loss

from math import log

# calculate binary cross entropy

You might also like