0% found this document useful (0 votes)

29 views10 pages

Computational Graphs

The document discusses the concept of backpropagation and its significance in training deep learning models, emphasizing its role as a computational tool in various fields. It explains computational graphs as a method for visualizing mathematical expressions and outlines how derivatives are calculated using these graphs. The document also contrasts forward-mode and reverse-mode differentiation, highlighting their respective approaches to tracking input-output relationships in computations.

Uploaded by

Ramesh Kumar Mojjada

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views10 pages

Computational Graphs

Uploaded by

Ramesh Kumar Mojjada

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

CS-671: Deep Learning and its Applications

Lecture: 06
Computational Graphs

Aditya Nigam, Assistant Professor

School of Computing and Electrical Engineering (SCEE)
Indian Institute of Technology, Mandi
http://faculty.iitmandi.ac.in/ãditya/ [email protected]

Presentation for CS-671@IIT Mandi (6 March, 2019)

(*Slides Credit : Calculus on Computational Graphs: Backpropagation, colah’s blog)
http://colah.github.io/posts/2015-08-Backprop/

February - May, 2019

Calculus on Computational Graphs: Backpropagation

Introduction
Backpropagation is the key algorithm that makes training deep
models computationally tractable.
Beyond its use in deep learning, backpropagation is a powerful
computational tool in many other areas, ranging from weather
forecasting to analyzing numerical stability it just goes by different
names.
The general, application independent, name is reverse-mode
differentiation.
Fundamentally, its a technique for calculating derivatives quickly.

Aditya Nigam (SCEE, IIT-Mandi) [email protected] Lecture, February - May, 2019

Calculus on Computational Graphs: Backpropagation
Computational Graphs
Computational graphs are a nice way to think about mathematical
expressions.
For example, consider the expression e=(a+b)(b+1). There are three
operations: two additions and one multiplication.
To create a computational graph, we make each of these operations,
along with the input variables, into nodes.
When one nodes value is the input to another node, an arrow goes
from one to another.

Aditya Nigam (SCEE, IIT-Mandi) [email protected] Lecture, February - May, 2019

Calculus on Computational Graphs: Backpropagation
Derivatives on Computational Graphs
The key is to understand derivatives on the edges.
If a directly affects c, then we want to know how it affects c.
If a changes a little bit, how does c change? We call this the partial
derivative of c with respect to a.
To evaluate the partial derivatives in graph, we need the sum rule and
the product rule:

Aditya Nigam (SCEE, IIT-Mandi) [email protected] Lecture, February - May, 2019

Calculus on Computational Graphs: Backpropagation

Derivatives on Computational Graphs

What if we want to understand how nodes that arent directly
connected affect each other?
Lets consider how e is affected by a. If we change a at a speed of 1, c
also changes at a speed of 1.
In turn, c changing at a speed of 1 causes e to change at a speed of
2. So e changes at a rate of 1 × 2 with respect to a.
The general rule is to sum over all possible paths from one node to the
other, multiplying the derivatives on each edge of the path together.
For example, to get the derivative of e with respect to b we get:

∂e
=1×2+1×3 (1)
∂b

Aditya Nigam (SCEE, IIT-Mandi) [email protected] Lecture, February - May, 2019

Calculus on Computational Graphs: Backpropagation
Factoring Paths

The problem with just summing over the paths is that its very easy to
get a combinatorial explosion in the number of possible paths.
If we want to get the derivative of Z wrt X by summing over all
paths, we need to sum over 3 × 3=9 paths:
Instead of just naively summing over the paths, it would be much
better to factor them:
∂Z
= (α + β + γ)(δ + + ζ) (2)
∂X
This is where forward-mode differentiation and reverse-mode
differentiation come in.
Aditya Nigam (SCEE, IIT-Mandi) [email protected] Lecture, February - May, 2019
Calculus on Computational Graphs: Backpropagation

Forward-mode differentiation
Instead of summing over all of the paths explicitly, they compute the
same sum more efficiently by merging paths back together at every
node.
In fact, both algorithms touch each edge exactly once!
It starts at an input to the graph and moves towards the end. At
every node, it sums all the paths feeding in.
Each of those paths represents one way in which the input affects
that node.
By adding them up, we get the total way in which the node is
affected by the input, its derivative.

Aditya Nigam (SCEE, IIT-Mandi) [email protected] Lecture, February - May, 2019

Calculus on Computational Graphs: Backpropagation

At each node, it merges all paths which originated at that node.

Forward-mode differentiation tracks how one input affects every node.
Reverse-mode differentiation tracks how every node affects one
output.
Aditya Nigam (SCEE, IIT-Mandi) [email protected] Lecture, February - May, 2019
Calculus on Computational Graphs: Backpropagation

Forward-mode differentiation gives us the derivative of every node

with respect to b. The derivative of our output with respect to one of
our inputs.
Reverse-mode differentiation gives us the derivative of e with respect
to every node:
Forward-mode differentiation gave us the derivative of our output
with respect to a single input, but reverse-mode differentiation gives
us all of them.
Aditya Nigam (SCEE, IIT-Mandi) [email protected] Lecture, February - May, 2019
Calculus on Computational Graphs: Backpropagation

Computational Victories
Reverse-mode differentiation looks like a strange way of doing the
same thing as the forward-mode.

Is there some advantage?

Forward-mode differentiation tracks how one input affects every node.

Reverse-mode differentiation tracks how every node affects one

output.

Aditya Nigam (SCEE, IIT-Mandi) [email protected] Lecture, February - May, 2019

Back Propagation
No ratings yet
Back Propagation
10 pages
Demystifying Deep Learning
No ratings yet
Demystifying Deep Learning
68 pages
Chap 3 Slides
No ratings yet
Chap 3 Slides
95 pages
Machine Learning and Pattern Recognition Week 8 - Backprop
No ratings yet
Machine Learning and Pattern Recognition Week 8 - Backprop
8 pages
Lecture21 Deep Learning PartII April12 2021
No ratings yet
Lecture21 Deep Learning PartII April12 2021
60 pages
Computational Graphs in Deep Learning Unit v4 Deep Leaerning
No ratings yet
Computational Graphs in Deep Learning Unit v4 Deep Leaerning
3 pages
Neural Networks & Auto Differentiation
No ratings yet
Neural Networks & Auto Differentiation
13 pages
Backpropagation in Neural Networks
No ratings yet
Backpropagation in Neural Networks
9 pages
Automatic Differentiation in ML
No ratings yet
Automatic Differentiation in ML
114 pages
Lecture 2, Part 2: Backpropagation: Roger Grosse
No ratings yet
Lecture 2, Part 2: Backpropagation: Roger Grosse
9 pages
07autodiff Nnets
No ratings yet
07autodiff Nnets
12 pages
Lec06 Derivatives
No ratings yet
Lec06 Derivatives
22 pages
ECE/CS 559 - Neural Networks Lecture Notes #7: The Backpropagation Algorithm
No ratings yet
ECE/CS 559 - Neural Networks Lecture Notes #7: The Backpropagation Algorithm
9 pages
Deep Learning's Evolution and Impact
No ratings yet
Deep Learning's Evolution and Impact
6 pages
NLP Backpropagation Guide
No ratings yet
NLP Backpropagation Guide
8 pages
Chap5 3-BackProp
No ratings yet
Chap5 3-BackProp
41 pages
Differentiable Programming & Optimization
No ratings yet
Differentiable Programming & Optimization
72 pages
Karpathy 1 Micrograd 1
No ratings yet
Karpathy 1 Micrograd 1
52 pages
Tut 01
No ratings yet
Tut 01
39 pages
Step-by-Step Automatic Differentiation Guide
No ratings yet
Step-by-Step Automatic Differentiation Guide
17 pages
Lecture02 Backpropagation Annotated
No ratings yet
Lecture02 Backpropagation Annotated
33 pages
Week 5 - Ann
No ratings yet
Week 5 - Ann
30 pages
Learning 3
No ratings yet
Learning 3
98 pages
3 Gradient
No ratings yet
3 Gradient
30 pages
Calc
No ratings yet
Calc
6 pages
Machine Learning
No ratings yet
Machine Learning
4 pages
Chapter 6 - Backpropagation
No ratings yet
Chapter 6 - Backpropagation
48 pages
Back Prop
No ratings yet
Back Prop
8 pages
Backpropagation in Civil Engineering
No ratings yet
Backpropagation in Civil Engineering
28 pages
Introduction to Differentiable Physics
No ratings yet
Introduction to Differentiable Physics
8 pages
Automatic Differentiation in ML: Survey
No ratings yet
Automatic Differentiation in ML: Survey
53 pages
Introduction to Computational Graphs
No ratings yet
Introduction to Computational Graphs
13 pages
Forward Gradients for ML Optimization
No ratings yet
Forward Gradients for ML Optimization
10 pages
Unit 1
No ratings yet
Unit 1
30 pages
First
No ratings yet
First
92 pages
Machine Learning: Backpropagation
No ratings yet
Machine Learning: Backpropagation
24 pages
PyTorch for Deep Learning Students
No ratings yet
PyTorch for Deep Learning Students
7 pages
Automatic Differentiation of Algorithms For Machine Learning
No ratings yet
Automatic Differentiation of Algorithms For Machine Learning
7 pages
Lecture04 Neuralnets
No ratings yet
Lecture04 Neuralnets
81 pages
Numerical Analysis for Economists
No ratings yet
Numerical Analysis for Economists
57 pages
Mod 2 DL
No ratings yet
Mod 2 DL
8 pages
3 Gradient
No ratings yet
3 Gradient
31 pages
Dama50 Unit4n
No ratings yet
Dama50 Unit4n
32 pages
Deep Learning: Feedforward & Backprop
No ratings yet
Deep Learning: Feedforward & Backprop
56 pages
Lecture 20
No ratings yet
Lecture 20
71 pages
Karpathy 1 Micrograd 2
No ratings yet
Karpathy 1 Micrograd 2
27 pages
Machine Learning: Backpropagation Basics
No ratings yet
Machine Learning: Backpropagation Basics
5 pages
Deep Learning by Ian Goodfellow, Yoshua Bengio, Aaron Courville (Z-Lib - Org) - 226-228
No ratings yet
Deep Learning by Ian Goodfellow, Yoshua Bengio, Aaron Courville (Z-Lib - Org) - 226-228
3 pages
14 Backprop
No ratings yet
14 Backprop
34 pages
Lecture 3-4
No ratings yet
Lecture 3-4
50 pages
Backward Forward Propogation
No ratings yet
Backward Forward Propogation
19 pages
Autodiff
No ratings yet
Autodiff
12 pages
Back Propagation
No ratings yet
Back Propagation
71 pages
Chapter 3-3 Neural Network-Back Propagation
No ratings yet
Chapter 3-3 Neural Network-Back Propagation
32 pages
Matrix Calculus for Deep Learning
No ratings yet
Matrix Calculus for Deep Learning
33 pages
Gradient Descent & Backpropagation Practice Problems
No ratings yet
Gradient Descent & Backpropagation Practice Problems
7 pages
Vector Calculus: Gradients & Backpropagation
No ratings yet
Vector Calculus: Gradients & Backpropagation
41 pages
Efficient Neural Network Differential Operators
No ratings yet
Efficient Neural Network Differential Operators
11 pages
Machine Learning For Neuroscience: Convolutional Neural Networks
No ratings yet
Machine Learning For Neuroscience: Convolutional Neural Networks
50 pages
05 Python Strings
No ratings yet
05 Python Strings
8 pages
04 Iterative Control Statements
No ratings yet
04 Iterative Control Statements
13 pages
Initialization S
No ratings yet
Initialization S
14 pages
Algorithm Design Essentials
No ratings yet
Algorithm Design Essentials
19 pages
01 Asymptotic Notations
No ratings yet
01 Asymptotic Notations
10 pages
02 Recurrence Relations
No ratings yet
02 Recurrence Relations
2 pages
Free Computer Science Courses & Practice
No ratings yet
Free Computer Science Courses & Practice
6 pages
Week 4
No ratings yet
Week 4
27 pages
Deep Learning Course Overview
No ratings yet
Deep Learning Course Overview
298 pages
Alpert 1993
No ratings yet
Alpert 1993
22 pages
T-Test vs ANOVA: Key Differences Explained
No ratings yet
T-Test vs ANOVA: Key Differences Explained
3 pages
Assignment 1 ISOM2500 2025spring
No ratings yet
Assignment 1 ISOM2500 2025spring
5 pages
Unit 6 Assignments
No ratings yet
Unit 6 Assignments
1 page
HPLC - Determination of Caffeine in Soda
No ratings yet
HPLC - Determination of Caffeine in Soda
4 pages
Statistics and Decision Modeling, 5th Ed.
No ratings yet
Statistics and Decision Modeling, 5th Ed.
12 pages
Add Math Statistics
No ratings yet
Add Math Statistics
14 pages
Homework March7 2019
No ratings yet
Homework March7 2019
3 pages
Standard Operating Procedures
No ratings yet
Standard Operating Procedures
17 pages
Minerva Mukhopadhyay: Academic Profile
No ratings yet
Minerva Mukhopadhyay: Academic Profile
3 pages
Machine Learning: Notes by Aniket Sahoo - Part II
No ratings yet
Machine Learning: Notes by Aniket Sahoo - Part II
140 pages
Partial Derivatives and Critical Points
No ratings yet
Partial Derivatives and Critical Points
3 pages
The Common Mistakes of Undergraduate Efl Students in Writing Argumentative Essays
No ratings yet
The Common Mistakes of Undergraduate Efl Students in Writing Argumentative Essays
10 pages
EXPASSVG IHSTATmacrofree
No ratings yet
EXPASSVG IHSTATmacrofree
2 pages
Titration Lab Report
No ratings yet
Titration Lab Report
8 pages
Factor Analysis 2
No ratings yet
Factor Analysis 2
20 pages
Euler's and Runge-Kutta Method PDF
No ratings yet
Euler's and Runge-Kutta Method PDF
8 pages
Understanding Markov Processes
No ratings yet
Understanding Markov Processes
28 pages
Introduction To Differential Equations
83% (6)
Introduction To Differential Equations
81 pages
Comparative Discourse Analysis Using Topic Models
No ratings yet
Comparative Discourse Analysis Using Topic Models
12 pages
Methods of Philosophizing Explained
No ratings yet
Methods of Philosophizing Explained
14 pages
Effect of Window Length
No ratings yet
Effect of Window Length
9 pages
LIST MAN POWER & EQUIPMENT (Test UT - Permata Hijau)
0% (1)
LIST MAN POWER & EQUIPMENT (Test UT - Permata Hijau)
14 pages
Tut 10
No ratings yet
Tut 10
2 pages
Chapters 1 10 TB Answer Key
No ratings yet
Chapters 1 10 TB Answer Key
427 pages
SIP Report Format
No ratings yet
SIP Report Format
4 pages
Updated Integration Formulas Color Sectioned
No ratings yet
Updated Integration Formulas Color Sectioned
57 pages
CS 174: Problem Set 2 Solutions
No ratings yet
CS 174: Problem Set 2 Solutions
6 pages
Lesson 2: Extrema On An Interval (NOTES)
No ratings yet
Lesson 2: Extrema On An Interval (NOTES)
5 pages

Computational Graphs

Uploaded by

Computational Graphs

Uploaded by

CS-671: Deep Learning and its Applications

Aditya Nigam, Assistant Professor

Presentation for CS-671@IIT Mandi (6 March, 2019)

February - May, 2019

Aditya Nigam (SCEE, IIT-Mandi) [email protected] Lecture, February - May, 2019

Aditya Nigam (SCEE, IIT-Mandi) [email protected] Lecture, February - May, 2019

Aditya Nigam (SCEE, IIT-Mandi) [email protected] Lecture, February - May, 2019

Derivatives on Computational Graphs

Aditya Nigam (SCEE, IIT-Mandi) [email protected] Lecture, February - May, 2019

Aditya Nigam (SCEE, IIT-Mandi) [email protected] Lecture, February - May, 2019

At each node, it merges all paths which originated at that node.

Forward-mode differentiation gives us the derivative of every node

Is there some advantage?

Forward-mode differentiation tracks how one input affects every node.

Reverse-mode differentiation tracks how every node affects one

Aditya Nigam (SCEE, IIT-Mandi) [email protected] Lecture, February - May, 2019

You might also like