0% found this document useful (0 votes)

77 views35 pages

Lecture10 RNN LSTMs

Uploaded by

sandeshgawade16022003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

77 views35 pages

Lecture10 RNN LSTMs

Uploaded by

sandeshgawade16022003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Deep Learning: Theory and Practice

Recurrent Neural Networks 28-03-2019

Introduction
❖ The standard DNN/CNN paradigms
❖ (x,y) - ordered pair of data vectors/images (x) and
target (y)
❖ Moving to sequence data
❖ (x(t),y(t)) where this could be sequence to sequence
mapping task.
❖ (x(t),y) where this could be a sequence to vector
mapping task.
Introduction
❖ Difference between CNNs/DNNs
❖ (x(t),y(t)) where this could be sequence to sequence
mapping task.
❖ Input features / output targets are correlated in time.
❖ Unlike standard models where each pair is
independent.
❖ Need to model dependencies in the sequence over
time.
Introduction to Recurrent Networks

“Deep Learning”, Ian Goodfellow, Yoshua Bengio, Aaron Courville

Recurrent Networks

“Deep Learning”, Ian Goodfellow, Yoshua Bengio, Aaron Courville

Recurrent Networks

“Deep Learning”, Ian Goodfellow, Yoshua Bengio, Aaron Courville

Back Propagation in RNNs
Model Parameters

Gradient Descent
Recurrent Networks
Back Propagation Through Time
Back Propagation Through Time
Standard Recurrent Networks

“Deep Learning”, Ian Goodfellow, Yoshua Bengio, Aaron Courville

Other Recurrent Networks

Teacher
Forcing Networks

“Deep Learning”, Ian Goodfellow, Yoshua Bengio, Aaron Courville

Recurrent Networks

Teacher
Forcing Networks

“Deep Learning”, Ian Goodfellow, Yoshua Bengio, Aaron Courville

Recurrent Networks

Multiple Input
Single Output
Recurrent Networks

Single Input
Multiple Output
Recurrent Networks

Bi-directional
Networks
Recurrent Networks

Sequence to
Sequence
Mapping Networks
Long-term Dependency Issues
Vanishing/Exploding Gradients

❖ Gradients either vanish or explode

❖ Initial frames may not contribute to gradient
computations or may contribute too much.
Long-Short Term Memory
LSTM Cell
Input Gate
f - sigmoid function
g, h - tanh function

Forget Gate
Cell

Output Gate
LSTM output
Long Short Term Memory Networks
Gated Recurrent Units (GRU)
Attention in LSTM Networks

❖ Attentions allows a mechanism to add relevance

❖ Certain regions of the audio have more importance
than the rest for the task at hand.
Encoder - Decoder Networks with Attention
Attention Models
Attention - Speech Example
From our lab [part of ICASSP 2019 paper].
Language Recognition Evaluation
End-to-end model using GRUs and Attention
Proposed End-to-End Language Recognition Model
Proposed End-to-End Language Recognition Model
Proposed End-to-End Language Recognition Model
Language Recognition Evaluation
State-of-art models use the input sequence directly.
We proposed the attention model - Attention weighs th
importance of each short-term segment feature for the
task.
0-3s : O...One muscle at all, it was terrible
Attention Weight
3s-4s : .... ah .... ah ....
4s - 9s : I couldn't scream, I couldn't shout, I
couldn't even move my arms up, or my legs
9s -11s : I was trying me hardest, I was really
really panicking.

Bharat Padi, et al. “End-to-end language recognition using

hierarchical gated recurrent networks”, under review 2018.
Language Recognition Evaluation
Language Recognition Evaluation

RNN 2
No ratings yet
RNN 2
144 pages
Module 4
No ratings yet
Module 4
14 pages
RNNs & LSTMs for Tech Enthusiasts
No ratings yet
RNNs & LSTMs for Tech Enthusiasts
9 pages
ML Unit 4
No ratings yet
ML Unit 4
47 pages
Unit 3
No ratings yet
Unit 3
8 pages
AAM Unit 6 Notes
No ratings yet
AAM Unit 6 Notes
20 pages
Unit III - Recurrent Neural Networks
No ratings yet
Unit III - Recurrent Neural Networks
44 pages
Unit 3 Deep Learning SPPU BE IT
No ratings yet
Unit 3 Deep Learning SPPU BE IT
30 pages
Deep Learning (MODULE-5)
100% (1)
Deep Learning (MODULE-5)
71 pages
RNN StannfordBased
No ratings yet
RNN StannfordBased
102 pages
Recurrent Neural Networks
100% (1)
Recurrent Neural Networks
14 pages
Sequence Modeling
100% (1)
Sequence Modeling
131 pages
GenAI Module2
No ratings yet
GenAI Module2
190 pages
06 - LLM
No ratings yet
06 - LLM
18 pages
Unit 4 - MachineLearning
No ratings yet
Unit 4 - MachineLearning
16 pages
Module2 L7 RNN LSTM
No ratings yet
Module2 L7 RNN LSTM
47 pages
Unit 4 - Machine Learning
No ratings yet
Unit 4 - Machine Learning
16 pages
DL U-Ii
No ratings yet
DL U-Ii
41 pages
RNNs: A Guide for AI Enthusiasts
No ratings yet
RNNs: A Guide for AI Enthusiasts
83 pages
RNN LSTM
No ratings yet
RNN LSTM
49 pages
RNN LSTM GRU Transformers
0% (1)
RNN LSTM GRU Transformers
123 pages
NN Text Generation Zaid Bouslikhin
No ratings yet
NN Text Generation Zaid Bouslikhin
14 pages
Recurrent Neural Networks: Index
No ratings yet
Recurrent Neural Networks: Index
13 pages
Sequence Models231205
No ratings yet
Sequence Models231205
72 pages
LSTM Ucl
100% (1)
LSTM Ucl
35 pages
Module 06
No ratings yet
Module 06
5 pages
Recurrent Neural Network
No ratings yet
Recurrent Neural Network
21 pages
CS601 - Machine Learning - Unit 4 - Notes - 1672759767
No ratings yet
CS601 - Machine Learning - Unit 4 - Notes - 1672759767
12 pages
What Are Recurrent Neural Networks
No ratings yet
What Are Recurrent Neural Networks
7 pages
DeepLearning SecC
No ratings yet
DeepLearning SecC
20 pages
Recurrent Neural Networks
No ratings yet
Recurrent Neural Networks
36 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
49 pages
SRM Institute of Science and Technology: Record Work
No ratings yet
SRM Institute of Science and Technology: Record Work
251 pages
ML (Cs-601) Unit 4 Complete
No ratings yet
ML (Cs-601) Unit 4 Complete
45 pages
Neural Networks
No ratings yet
Neural Networks
22 pages
DL Unit-4
No ratings yet
DL Unit-4
4 pages
U5 PDF
No ratings yet
U5 PDF
37 pages
LSTM and RNNs in Sequence Modeling
No ratings yet
LSTM and RNNs in Sequence Modeling
27 pages
Gen AI - 15-3-25
No ratings yet
Gen AI - 15-3-25
24 pages
RNNs: Applications and Training Guide
No ratings yet
RNNs: Applications and Training Guide
36 pages
Session2 2024 - 2025 - Natural Language Processing
No ratings yet
Session2 2024 - 2025 - Natural Language Processing
30 pages
DL For Sequencial Data
No ratings yet
DL For Sequencial Data
36 pages
10.2478 - Jaiscr 2019 0006
No ratings yet
10.2478 - Jaiscr 2019 0006
11 pages
DL Mod4
No ratings yet
DL Mod4
105 pages
CH4 - AA1.1-Sequence Models
No ratings yet
CH4 - AA1.1-Sequence Models
26 pages
Deep Learning
No ratings yet
Deep Learning
26 pages
Lecture Notes - RRN
No ratings yet
Lecture Notes - RRN
8 pages
Dl-Unit 5
No ratings yet
Dl-Unit 5
10 pages
Long Short-Term Memory Networks (LSTM) - Simply Explained! - Data Basecamp
No ratings yet
Long Short-Term Memory Networks (LSTM) - Simply Explained! - Data Basecamp
4 pages
RNN LSTM
No ratings yet
RNN LSTM
72 pages
Lecture 3 LSTM, GRU
No ratings yet
Lecture 3 LSTM, GRU
45 pages
Deep Learning Subject Practicals Uni Mumbai
No ratings yet
Deep Learning Subject Practicals Uni Mumbai
13 pages
NNDL Unit 5 Understanding Recurrent Neural Networks in Depth
No ratings yet
NNDL Unit 5 Understanding Recurrent Neural Networks in Depth
32 pages
LSTM Seq2Seq Models for Text Data
No ratings yet
LSTM Seq2Seq Models for Text Data
44 pages
Illustrated Guide To LSTM's and GRU'S - A Step by Step Explanation - by Michael Phi - Towards Data Science
No ratings yet
Illustrated Guide To LSTM's and GRU'S - A Step by Step Explanation - by Michael Phi - Towards Data Science
15 pages
Chapter 2
No ratings yet
Chapter 2
68 pages
42 Recurrent Neural Networks and LSTM
No ratings yet
42 Recurrent Neural Networks and LSTM
68 pages
Survey of Prediction Using Recurrent Neural Network
No ratings yet
Survey of Prediction Using Recurrent Neural Network
3 pages
Deep Learning Course Overview
No ratings yet
Deep Learning Course Overview
4 pages
Amc External
No ratings yet
Amc External
13 pages
E & TC Sent Back 21.12.2024
No ratings yet
E & TC Sent Back 21.12.2024
3 pages
Final Year Seating Arrangement 21.12.2024
No ratings yet
Final Year Seating Arrangement 21.12.2024
8 pages
Iris Classifier Accuracy Comparison
No ratings yet
Iris Classifier Accuracy Comparison
5 pages
Activation Functions and Keras Metrics
No ratings yet
Activation Functions and Keras Metrics
31 pages
3 Days Workshop On Generative Ai Agenda1-1 PDF
No ratings yet
3 Days Workshop On Generative Ai Agenda1-1 PDF
4 pages
Neuron Structure in Neural Networks
No ratings yet
Neuron Structure in Neural Networks
25 pages
Generative AI: VAE, GAN, and Transformers
No ratings yet
Generative AI: VAE, GAN, and Transformers
36 pages
Clustering Techniques in R Guide
No ratings yet
Clustering Techniques in R Guide
4 pages
Neural Networks Tutorial Answers
No ratings yet
Neural Networks Tutorial Answers
32 pages
Lecture5 FGV
No ratings yet
Lecture5 FGV
25 pages
Questions Bank On DL
No ratings yet
Questions Bank On DL
7 pages
An Introduction To Mathematics Behind Neural Networks - Towards Data Science
No ratings yet
An Introduction To Mathematics Behind Neural Networks - Towards Data Science
14 pages
Backpropagation Neural Network Lab Report
No ratings yet
Backpropagation Neural Network Lab Report
8 pages
Understanding Bidirectional Associative Memories
No ratings yet
Understanding Bidirectional Associative Memories
2 pages
Week 4 - Classification - Decision Tree 1
No ratings yet
Week 4 - Classification - Decision Tree 1
40 pages
Comprehensive Machine Learning Guide
No ratings yet
Comprehensive Machine Learning Guide
20 pages
To Send
No ratings yet
To Send
6 pages
Explanation of REINFORCE Training Code For CartPole
No ratings yet
Explanation of REINFORCE Training Code For CartPole
3 pages
@ilapss Ilaps GenAI Syllabuspdf
No ratings yet
@ilapss Ilaps GenAI Syllabuspdf
7 pages
Factor Analysis 2
No ratings yet
Factor Analysis 2
20 pages
Deep Learning Sem 5
No ratings yet
Deep Learning Sem 5
3 pages
Data Modification and Predictive Analytics - MCQ - 1 - 2
No ratings yet
Data Modification and Predictive Analytics - MCQ - 1 - 2
24 pages
DWM Lab 07 Saif Bodu
No ratings yet
DWM Lab 07 Saif Bodu
100 pages
Neural Networks: Backpropagation Basics
No ratings yet
Neural Networks: Backpropagation Basics
38 pages
Popular Ensemble Methods: An Empirical Study: David Opitz
No ratings yet
Popular Ensemble Methods: An Empirical Study: David Opitz
30 pages
Mca 4 Sem Machine Learning and Data Analytics Using Python 91855 May 2023
No ratings yet
Mca 4 Sem Machine Learning and Data Analytics Using Python 91855 May 2023
3 pages
Weekly Quiz 2 Boosting Ensemble Techniques and Model Tuning Great Learning PDF
100% (3)
Weekly Quiz 2 Boosting Ensemble Techniques and Model Tuning Great Learning PDF
8 pages
Case Study - Gans - Cifar - Quiz - Attempt Review
No ratings yet
Case Study - Gans - Cifar - Quiz - Attempt Review
5 pages
Machine Learning & AI Review Questions
No ratings yet
Machine Learning & AI Review Questions
3 pages
new-Guidelines-Datamining-I-UGCF-DSE-CS Hons-Sem 4-Jan 25
No ratings yet
new-Guidelines-Datamining-I-UGCF-DSE-CS Hons-Sem 4-Jan 25
3 pages
Perceptron Algorithm
No ratings yet
Perceptron Algorithm
10 pages
Unit-V Deep Generative Models Part-01
No ratings yet
Unit-V Deep Generative Models Part-01
41 pages

Lecture10 RNN LSTMs

Uploaded by

Lecture10 RNN LSTMs

Uploaded by

Deep Learning: Theory and Practice

Recurrent Neural Networks 28-03-2019

“Deep Learning”, Ian Goodfellow, Yoshua Bengio, Aaron Courville

“Deep Learning”, Ian Goodfellow, Yoshua Bengio, Aaron Courville

“Deep Learning”, Ian Goodfellow, Yoshua Bengio, Aaron Courville

“Deep Learning”, Ian Goodfellow, Yoshua Bengio, Aaron Courville

“Deep Learning”, Ian Goodfellow, Yoshua Bengio, Aaron Courville

“Deep Learning”, Ian Goodfellow, Yoshua Bengio, Aaron Courville

❖ Gradients either vanish or explode

❖ Attentions allows a mechanism to add relevance

Bharat Padi, et al. “End-to-end language recognition using

You might also like