0% found this document useful (0 votes)

42 views48 pages

Backpropogation Algorithm

The document explains how neural networks operate, focusing on the backpropagation algorithm used to train them by adjusting weights to minimize output error. It details the process of calculating net values, activations, and the application of gradient descent for weight updates. Additionally, it covers concepts like local and global minima in optimization, and the chain rule for calculating gradients in more complex networks.

Uploaded by

Cát Lăng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views48 pages

Backpropogation Algorithm

Uploaded by

Cát Lăng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 48

How Neural Networks and the

Backpropagation Works
We have this input data…….
Feature 1 Feature 2
0.5 -0.5
0.3 0.4
0.7 0.9

We wish to map it to…..

Feature 1 Feature 2
0.9 0.1
0.9 0.9
0.1 0.1
Let’s take our first sample
Feature 1 Feature 2
0.5 -0.5
0.3 0.4
0.7 0.9

Feature 1 Feature 2
0.9 0.1
0.9 0.9
0.1 0.1
Consider this Neural network….
Example taken from: Neural Networks, A classroom approach by Satish Kumar
Bias Bias
0.01 0.31
Input Layer Hidden Layer Output Layer
d1
X1 0.1 0.37
0.9
0.5
0.3 -0.22

0.9
-0.2
d2
X2 0.1
-0.5 0.55 -0.12

-0.02 0.27

Bias
Bias
Let’s Start by moving forward
Bias

0.01
Input Layer Hidden Layer
Net value is the total input
0.1
X1 coming to the neuron.
0.5

Net value of the first neuron in the hidden layer:

-0.2
𝑧1 = 𝑥1 (0.1) + 𝑥2 −0.2 + 𝑏𝑖𝑎𝑠
X2
-0.5 𝑧1 = 0.5 0.1 + −0.5 −0.2 + 0.01

𝑧1 = 0.16
Input Layer
Net value is the total input
X1
coming to the neuron.
0.5
0.3
Net value of the second neuron in the hidden layer:
𝑧2 = 𝑥1 (0.3) + 𝑥2 0.55 + 𝑏𝑖𝑎𝑠

X2
-0.5 0.55

-0.02

Bias

𝑧2 = 0.5 0.3 + −0.5 0.55 + (−0.02)

𝑧2 = −0.145
The activation of the neuron

Net (z) Activation

(z)

Activation is scaling the input value (net value) to

a value from 0-1
For Example, The Sigmoidal Function:
Activating the two neurons at the hidden layer:

Net (z) Activation

(z) The input(net) value

For simplicity, we will consider =1

1
𝛿 𝑧1 = −0.16 = 0.5399
1+𝑒
1
𝛿 𝑧2 = 0.145 = 0.4638
1+𝑒
Let’s continue with the output neurons
Now, the hidden neuron’s output becomes the input to the next neuron

Bias
0.31
Hidden Layer Output Layer

0.37

0.9
𝑦1 = 0.5399(0.37) + 0.4638 0.9 + 0.31
𝑦1 = 0.9271
-0.22
Similarly……

-0.12

0.27

Bias

𝑦2 = 0.5399(−0.22) + 0.4638 −0.12 + 0.27

𝑦2 = 0.0955
Now, activating the output neurons
1
𝛿 𝑦1 = = 0.7164
1+𝑒 −0.9271
1
𝛿 𝑦2 = = 0.5238
1+𝑒 −0.0955
Output Layer

0.7164

0.5238

Complete Guide to Neural Networks with Python: Theory and Applications

Definition of Backpropagation
• A method to train the neural network, by
adjusting the weights of the neurons, for the
purpose of reducing the output error.
Gradient Descent
Gradient Descent

• The base algorithm that is used to minimize the error with respect to the
weights of the neural network. The learning rate determines the step size of
the update used to reach the minimum.
• An Epoch is one complete pass through all the samples.

https://www.learnopencv.com/understanding-activation-
functions-in-deep-learning/ https://sebastianraschka.com/faq/docs/closed-form-vs-gd.html
The Backpropagation
Remember our objective is to:
Minimize the error By Changing the Weight

Negative Slope: Gradient Descent

We move in the
When we Increase w,
direction opposite to
the loss is decreasing 
the derivative
-(-) = +  Weight
(opposite to the slope)
Increases (Moving
Right)

Positive Slope:
When we increase w,
the loss is increasing 
-(+) = -  Weight
Decreases (Moving Left)
Weight Update Rule:

ɳ = Learning Rate – How

fast we update the
weights. In other words,
𝑑𝐸 the step size of the update

𝑤 𝑤 − ղ
𝑑𝑤
Old Weight Negative Learnin Gradient
https://towardsdatascience.com/gradien
Slop g Rate t-descent-in-a-nutshell-eaf8c18212f0
Local Minimum and Global Minimum
f(x)
Convex and Non-Convex
Optimization

One global/local minima One or more local

minima and a global
minima

Image Credits: https://www.oreilly.com/radar/the-hard-thing-about-deep-learning/

Non-Convex Optimization
Multiple Local Minima
y=z+2 No w term 𝑑𝐸
z=w+4
???
𝑑𝑤

𝑤 Net (z) Activation

𝑎
𝐸𝑟𝑟𝑜𝑟 (𝐸)

This cannot be
done directly
𝑦 =𝑧+2 𝑦 = 𝑓(𝑧)

𝑧 =𝑤+4 𝑧 = 𝑔(𝑤)

𝑑𝑦 𝑑𝑦 𝑑𝑧
= .
𝑑𝑤 𝑑𝑧 𝑑𝑤
What Should be done is…….
𝑑𝑎
𝑑𝑧 𝑑𝐸
𝑑𝑧
𝑑𝑤 𝑑𝑎

𝑤 Net (z) Activation

𝑎
𝐸𝑟𝑟𝑜𝑟 (𝐸)
The Chain Rule
𝑑𝑎
𝑑𝑧 𝑑𝐸
𝑑𝑧
𝑑𝑤 𝑑𝑎

𝑤 Net (z) Activation

𝑎
𝐸𝑟𝑟𝑜𝑟 (

𝑑𝐸 𝑑𝐸 𝑑𝑎 𝑑𝑧
=
𝑑𝑤 𝑑𝑎 𝑑𝑧 𝑑𝑤
More Complex

𝑤1 𝑤2
𝑥 𝑧1 𝑎1 𝑧 2 𝑎2 𝐸
𝑑𝐸
𝑑𝑧 1
𝑑𝑎1 𝑑𝑧 2 𝑑𝑎2 𝑑𝑎2
𝑑𝑤1
𝑑𝑧 1 𝑑𝑎1 𝑑𝑧 2

𝑑𝐸 𝑑𝐸 𝑑𝑎2 𝑑𝑧 2 𝑑𝑎1 𝑑𝑧 1
=
𝑑𝑤1 𝑑𝑎2 𝑑𝑧 2 𝑑𝑎1 𝑑𝑧 1 𝑑𝑤1
Consider these neurons to work with…….
Bias Bias
0.01 0.31
Input Layer Hidden Layer Output Layer
d1
X1 0.1 0.37
0.9
0.5
0.3 -0.22

0.9
-0.2
d2
X2 0.1
-0.5 0.55 -0.12

-0.02 0.27

Bias
Bias
Adjusting the weight of the output neuron
Bias Bias
0.01 0.31
Input Layer Hidden Layer Output Layer
d1
X1 0.1 0.37
0.9
0.5
0.3 -0.22

0.9
-0.2
d2
X2 0.1
-0.5 0.55 -0.12

-0.02 0.27

Bias
Bias
How much is the error changing
with respect to the output
Expected Actual

Assuming one training sample

per iteration (batch size of 1)

1
{𝑑1 −𝛿 𝑦1 } 2 + {𝑑2 −𝛿 𝑦2 } 2
2

= -(0.9 – 0.7164) = -0.1836

How much is the output changing
with respect to the input

= 0.7164(1-0.7164) = 0.2031
How much is the input changing
with respect to the weight

= 0.5399
All Together

-0.1836 0.2031 0.5399

= -0.0201
Weight Update for the neuron

Found from the chain rule

-0.0201

Old Weight Learning Rate, how fast are you moving

Assume it to be 1.2

= 0.37 + 1.2(0.0201) = 0.3941

The new weight
Adjusting the weight for the Hidden
Layer

Input Layer Hidden Layer Output Layer

X1
0.5

X2
-0.5
In our case, p=2 𝜕𝑧1
𝜕𝑤1

𝑧1 = 𝑥1 (0.1) + 𝑥2 −0.2 + 𝑏𝑖𝑎𝑠

0.5
𝜕(𝛿 𝑍1 )

In our case, p=2 𝑍1

𝛿 𝑍1 [1-𝛿 𝑍1 ]
1
𝛿 𝑧1 =
1 + 𝑒 −𝑍1
0.5399(1-0.5399)

0.2484
In our case, p=2

𝜕𝐸 𝜕(𝛿 𝑦1 ) 𝜕 𝑦1 𝜕𝐸 𝜕(𝛿 𝑦2 ) 𝜕 𝑦2
+
𝜕(𝛿 𝑦1 ) 𝜕 𝑦1 𝜕(𝛿 𝑍1 ) 𝜕(𝛿 𝑦2 ) 𝜕 𝑦2 𝜕(𝛿 𝑍1 )
-0.1836 0.2031

𝑦1 = 𝛿 𝑍1 (𝑤1) + 𝛿 𝑍2 𝑤2 + 0.31
𝑦1 = 0.5399(0.37) + 0.4638 0.9 + 0.31
In our case, p=2

𝜕𝐸 𝜕(𝛿 𝑦1 ) 𝜕 𝑦1 𝜕𝐸 𝜕(𝛿 𝑦2 ) 𝜕 𝑦2
+
𝜕(𝛿 𝑦1 ) 𝜕 𝑦1 𝜕(𝛿 𝑍1 ) 𝜕(𝛿 𝑦2 ) 𝜕 𝑦2 𝜕(𝛿 𝑍1 )
-0.1836 0.2031 0.37

1
{𝑑1 −𝛿 𝑦1 } 2 + {𝑑2 −𝛿 𝑦2 } 2
2
In our case, p=2

𝜕𝐸 𝜕(𝛿 𝑦1 ) 𝜕 𝑦1 𝜕𝐸 𝜕(𝛿 𝑦2 ) 𝜕 𝑦2
+
𝜕(𝛿 𝑦1 ) 𝜕 𝑦1 𝜕(𝛿 𝑍1 ) 𝜕(𝛿 𝑦2 ) 𝜕 𝑦2 𝜕(𝛿 𝑍1 )
-0.1836 0.2031 0.37
−[𝑑2 − 𝛿 𝑦2 ]
-(0.1-0.5238)
0.4238
In our case, p=2

𝜕𝐸 𝜕(𝛿 𝑦1 ) 𝜕 𝑦1 𝜕𝐸 𝜕(𝛿 𝑦2 ) 𝜕 𝑦2
+
𝜕(𝛿 𝑦1 ) 𝜕 𝑦1 𝜕(𝛿 𝑍1 ) 𝜕(𝛿 𝑦2 ) 𝜕 𝑦2 𝜕(𝛿 𝑍1 )
-0.1836 0.2031 0.37 0.4238

𝛿 𝑦2 [1 − 𝛿 𝑦2 ]
0.5238(1-0.5238)
0.2494
In our case, p=2

𝜕𝐸 𝜕(𝛿 𝑦1 ) 𝜕 𝑦1 𝜕𝐸 𝜕(𝛿 𝑦2 ) 𝜕 𝑦2
+
𝜕(𝛿 𝑦1 ) 𝜕 𝑦1 𝜕(𝛿 𝑍1 ) 𝜕(𝛿 𝑦2 ) 𝜕 𝑦2 𝜕(𝛿 𝑍1 )
-0.1836 0.2031 0.37 0.4238 0.2494

𝛿 𝑍1
𝑦2 = 0.5399(−0.22) + 0.4638 −0.12 + 0.27
In our case, p=2

𝜕𝐸 𝜕(𝛿 𝑦1 ) 𝜕 𝑦1 𝜕𝐸 𝜕(𝛿 𝑦2 ) 𝜕 𝑦2
+
𝜕(𝛿 𝑦1 ) 𝜕 𝑦1 𝜕(𝛿 𝑍1 ) 𝜕(𝛿 𝑦2 ) 𝜕 𝑦2 𝜕(𝛿 𝑍1 )
-0.1836 0.2031 0.37 0.4238 0.2494 -0.22

-0.0370
0.2484 0.5
-0.0370

-0.0045954
Weight Update for the hidden neuron

Found from the chain rule

-0.0045954

Old Weight Learning Rate, how fast are you moving

Assume it to be 1.2

= 0.1 + 1.2(0.0045954) =
0.1055
The new weight
A Final Diagram to Wrap it up…….

https://www.jeremyjordan.me/neural-networks-training/
Weights Update for the network

https://www.jeremyjordan.me/neural-networks-training/

Blue Path: Orange Path:

Combine
Continue…..
• Similar Procedure for all the other neurons

Complete Guide to Neural Networks with Python: Theory and Applications

Take the second sample (iteration 2)
Feature 1 Feature 2
0.5 -0.5
0.3 0.4
0.7 0.9

Feature 1 Feature 2
0.9 0.1
0.9 0.9
0.1 0.1
Take the third sample (iteration 3)
Feature 1 Feature 2
0.5 -0.5
0.3 0.4
0.7 0.9

Feature 1 Feature 2
0.9 0.1
0.9 0.9
0.1 0.1
• That was ONE EPOCH. An Epoch is one
complete pass through all the samples. After
repeating that for many epochs (ex. 25) our
neural network is expected to reach the
minimum error, and be considered as trained.
We’ll learn about optimization later!

CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
Backpropagation in Neural Networks
No ratings yet
Backpropagation in Neural Networks
29 pages
Backpropagation Algorithm Explained
No ratings yet
Backpropagation Algorithm Explained
11 pages
NN Lecture Notes
No ratings yet
NN Lecture Notes
45 pages
Understanding Backpropagation in Neural Networks
No ratings yet
Understanding Backpropagation in Neural Networks
12 pages
Presentation 1
No ratings yet
Presentation 1
14 pages
Neural Net 3rdclass
No ratings yet
Neural Net 3rdclass
35 pages
Back-Propagation in Neural Networks
No ratings yet
Back-Propagation in Neural Networks
42 pages
A Step by Step Backpropagation
No ratings yet
A Step by Step Backpropagation
8 pages
Backpropagation in Neural Nets
No ratings yet
Backpropagation in Neural Nets
13 pages
Back Propogation
No ratings yet
Back Propogation
9 pages
Back in NN
No ratings yet
Back in NN
12 pages
Backpropagation Example
No ratings yet
Backpropagation Example
9 pages
Unit 4
No ratings yet
Unit 4
16 pages
A Step by Step Backpropagation Example
No ratings yet
A Step by Step Backpropagation Example
9 pages
ANN Example
No ratings yet
ANN Example
10 pages
Step by Step Back Propagation
No ratings yet
Step by Step Back Propagation
8 pages
Chap 2 Training Feed Forward Neural Networks
No ratings yet
Chap 2 Training Feed Forward Neural Networks
22 pages
Lecture 13.3 Classification ANN
No ratings yet
Lecture 13.3 Classification ANN
64 pages
Curs3site PDF
No ratings yet
Curs3site PDF
38 pages
Backpropagation in MLP: A Detailed Guide
No ratings yet
Backpropagation in MLP: A Detailed Guide
34 pages
ANN-Implemetation of Back-Prop
No ratings yet
ANN-Implemetation of Back-Prop
89 pages
26 NeuralNetworks
No ratings yet
26 NeuralNetworks
23 pages
DL U-I Introduction Part-2
No ratings yet
DL U-I Introduction Part-2
48 pages
Unit 2
No ratings yet
Unit 2
38 pages
NN 2
No ratings yet
NN 2
12 pages
Sparse Autoencoder Overview
No ratings yet
Sparse Autoencoder Overview
15 pages
A Step by Step Backpropagation Example - Matt Mazur
No ratings yet
A Step by Step Backpropagation Example - Matt Mazur
9 pages
A Step by Step Backpropagation Example - Matt Mazur
No ratings yet
A Step by Step Backpropagation Example - Matt Mazur
17 pages
cst414 - Deep Learning
No ratings yet
cst414 - Deep Learning
34 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
35 pages
Neural Networks
No ratings yet
Neural Networks
29 pages
Back-Propagation Algorithm Explained
No ratings yet
Back-Propagation Algorithm Explained
13 pages
7-Working Example-01-08-2024
No ratings yet
7-Working Example-01-08-2024
29 pages
An Introduction To Mathematics Behind Neural Networks
No ratings yet
An Introduction To Mathematics Behind Neural Networks
5 pages
MLP (Backward Propagation)
No ratings yet
MLP (Backward Propagation)
16 pages
Neural Network Backpropagation Guide
No ratings yet
Neural Network Backpropagation Guide
9 pages
ECE/CS 559 - Neural Networks Lecture Notes #7: The Backpropagation Algorithm
No ratings yet
ECE/CS 559 - Neural Networks Lecture Notes #7: The Backpropagation Algorithm
9 pages
Module 1 DL
No ratings yet
Module 1 DL
84 pages
(IJCST-V6I4P17) :P T V Lakshmi
No ratings yet
(IJCST-V6I4P17) :P T V Lakshmi
4 pages
Module 3.docxaiml
No ratings yet
Module 3.docxaiml
20 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
26 pages
Slides 11
No ratings yet
Slides 11
48 pages
Sparseautoencoder 2011new
No ratings yet
Sparseautoencoder 2011new
19 pages
5 - From Linear Models To Multi-Layer Perceptrons
No ratings yet
5 - From Linear Models To Multi-Layer Perceptrons
45 pages
MLP Numerical
No ratings yet
MLP Numerical
19 pages
0111CS191028
No ratings yet
0111CS191028
4 pages
Neural Network Backpropagation Guide
No ratings yet
Neural Network Backpropagation Guide
38 pages
NN Intro
No ratings yet
NN Intro
34 pages
Exp 3
No ratings yet
Exp 3
9 pages
International Baccalaureate (IB) : Artificial Neural Networks - #3
No ratings yet
International Baccalaureate (IB) : Artificial Neural Networks - #3
13 pages
Convolutional Neural Network Basics
100% (1)
Convolutional Neural Network Basics
59 pages
Classification BP Regression KNN Other Classifiers - Final
No ratings yet
Classification BP Regression KNN Other Classifiers - Final
116 pages
Lecture 40,41 BP Algorithm
No ratings yet
Lecture 40,41 BP Algorithm
11 pages
Backpropagation
No ratings yet
Backpropagation
12 pages
DNN Tip
No ratings yet
DNN Tip
49 pages
Neural Networks for Beginners
No ratings yet
Neural Networks for Beginners
79 pages
Ece18898g Neural Networks
No ratings yet
Ece18898g Neural Networks
47 pages
RNNs and LSTMs
No ratings yet
RNNs and LSTMs
41 pages
Regularization and Normalization
No ratings yet
Regularization and Normalization
29 pages
Softmax
No ratings yet
Softmax
5 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
98 pages
KL Divergence
No ratings yet
KL Divergence
8 pages
CMOS Delay Optimization Guide
No ratings yet
CMOS Delay Optimization Guide
12 pages
Very Highspeed BJT Buffer For Trackandhold Amplifiers With Enhan
No ratings yet
Very Highspeed BJT Buffer For Trackandhold Amplifiers With Enhan
4 pages
PE First Aid Kit
No ratings yet
PE First Aid Kit
2 pages
FS 2 Episode 10
No ratings yet
FS 2 Episode 10
5 pages
Data Science & Machine Learning Guide
No ratings yet
Data Science & Machine Learning Guide
52 pages
MSC Chemistry 2016 17 MSC Chemistry Placement Statistics
No ratings yet
MSC Chemistry 2016 17 MSC Chemistry Placement Statistics
2 pages
International GCSE Mathematics Paper 1HR
No ratings yet
International GCSE Mathematics Paper 1HR
28 pages
CPA0714ra Iloilo
No ratings yet
CPA0714ra Iloilo
10 pages
Argumentative Essay - Day 1
No ratings yet
Argumentative Essay - Day 1
6 pages
Yeshe Dorjee Thongchi
No ratings yet
Yeshe Dorjee Thongchi
3 pages
Vasquez, L. M., & Lee, G. (2017) - Creative Writing. Manila: Rex Book Store Inc.
No ratings yet
Vasquez, L. M., & Lee, G. (2017) - Creative Writing. Manila: Rex Book Store Inc.
2 pages
Science PDF
100% (1)
Science PDF
117 pages
Technology in A Constructivist Environment
No ratings yet
Technology in A Constructivist Environment
19 pages
Higher National Diploma in English ENGL 1108 Language and Society Answer Scheme
No ratings yet
Higher National Diploma in English ENGL 1108 Language and Society Answer Scheme
10 pages
الاثار السلبية للرسوم
No ratings yet
الاثار السلبية للرسوم
49 pages
Dual Training System
100% (1)
Dual Training System
11 pages
Product Analysis for Marketing Students
No ratings yet
Product Analysis for Marketing Students
5 pages
The Romantic
100% (1)
The Romantic
22 pages
BAJA SAEINDIA 2024 Event Overview
No ratings yet
BAJA SAEINDIA 2024 Event Overview
7 pages
MIT Pravesh BTech Orientation Welcome Mail-2025
No ratings yet
MIT Pravesh BTech Orientation Welcome Mail-2025
2 pages
Expressive Speech Acts in Grade 10 Classroom
No ratings yet
Expressive Speech Acts in Grade 10 Classroom
23 pages
YTP Application Form
No ratings yet
YTP Application Form
2 pages
Overview of SIWES Training Program
No ratings yet
Overview of SIWES Training Program
9 pages
Mzumbe University (Chuo Kikuu Mzumbe)
No ratings yet
Mzumbe University (Chuo Kikuu Mzumbe)
1 page
Semi-Brittle Fracture in TiAl & Tungsten
No ratings yet
Semi-Brittle Fracture in TiAl & Tungsten
104 pages
Improvisation in Different Artforms
No ratings yet
Improvisation in Different Artforms
9 pages
Verified PDF Download Testbank Jawetz Melnick Adelbergs Medical Microbiology 27th Edition Fast Instant Download
No ratings yet
Verified PDF Download Testbank Jawetz Melnick Adelbergs Medical Microbiology 27th Edition Fast Instant Download
398 pages
Ahmed Cemal Paşa - Nevzat Artuç PDF
No ratings yet
Ahmed Cemal Paşa - Nevzat Artuç PDF
481 pages
Monitoring Tool For Quarterly Examinations
No ratings yet
Monitoring Tool For Quarterly Examinations
3 pages
Project Failure 1
No ratings yet
Project Failure 1
3 pages
Playing With Reality - Ii. The Development of Psychic Reality From A Theoretical Perspective
100% (1)
Playing With Reality - Ii. The Development of Psychic Reality From A Theoretical Perspective
22 pages
Group 3 - Campaner
No ratings yet
Group 3 - Campaner
26 pages

Backpropogation Algorithm

Uploaded by

Backpropogation Algorithm

Uploaded by

How Neural Networks and the

We wish to map it to…..

Net value of the first neuron in the hidden layer:

𝑧2 = 0.5 0.3 + −0.5 0.55 + (−0.02)

Net (z) Activation

Activation is scaling the input value (net value) to

Net (z) Activation

For simplicity, we will consider =1

𝑦2 = 0.5399(−0.22) + 0.4638 −0.12 + 0.27

Complete Guide to Neural Networks with Python: Theory and Applications

Negative Slope: Gradient Descent

ɳ = Learning Rate – How

One global/local minima One or more local

Image Credits: https://www.oreilly.com/radar/the-hard-thing-about-deep-learning/

𝑤 Net (z) Activation

𝑤 Net (z) Activation

𝑤 Net (z) Activation

Assuming one training sample

= -(0.9 – 0.7164) = -0.1836

-0.1836 0.2031 0.5399

Found from the chain rule

Old Weight Learning Rate, how fast are you moving

= 0.37 + 1.2(0.0201) = 0.3941

Input Layer Hidden Layer Output Layer

𝑧1 = 𝑥1 (0.1) + 𝑥2 −0.2 + 𝑏𝑖𝑎𝑠

In our case, p=2 𝑍1

Found from the chain rule

Old Weight Learning Rate, how fast are you moving

Blue Path: Orange Path:

Complete Guide to Neural Networks with Python: Theory and Applications

You might also like