0% found this document useful (0 votes)

13 views26 pages

RNN LSTM

The document discusses various structures of Recurrent Neural Networks (RNN), focusing on Bidirectional RNNs and Long Short-Term Memory (LSTM) networks. It highlights the advantages of LSTMs in learning long-term dependencies through memory cells and gate units, as well as comparing LSTMs with Gated Recurrent Units (GRUs). Additionally, it provides insights into the gradient flow in LSTMs and their ability to preserve sequence information.

Uploaded by

Tayyaba zia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views26 pages

RNN LSTM

Uploaded by

Tayyaba zia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

Various Structure of RNN

Bidirectional RNN
Content
LSTM (Long Short
Term Memory)
2
Bidirectional
RNN
• Information flow is one
directional
• What if I need past and
future information?

3
Bidirectional RNN

• Let’s use reverse information flow

• Not bad.. but

4
Bidirectional
RNN
Let’s combine both

5
Bidirectional
RNN
Simpler Representation

6
Bidirectional RNN
Deep Bidirectional RNN

7
Long Short-Term
Memory (LSTM)
• Long Term Dependency of Standard
RNN
• x1~xt-1 are encoded into ht-1
• ht-1 has the information on the
past
• It is a context to process xt

8
Long Short-Term
Memory (LSTM)
• Long Term Dependency of
Standard RNN
• However, it may
exponentially decade
or grow
• Usually, it is limited to
10 steps

9
Long Short-Term
Memory (LSTM)
• Capable of learning long-term dependencies.
• LSTM networks introduce a new
structure called a memory cell
• An LSTM can learn to bridge time
intervals in excess of 1000 steps
• Gate units that learn to open and close
access to the past
• Input gate
• Forget gate
• Output gate
• Neuron with a self-recurrent

10
Long Short-Term Memory (LSTM)
• Equations
• 𝒊: input gate to accept the new
• 𝒇: forget gate to forget the past
• 𝒐: output gate, how much of the
information will be passed to
expose to the next time step.
• 𝒈: self-recurrent which is equal
to standard RNN
• 𝒄𝒕: internal memory
• 𝒔𝒕: hidden state
• 𝐲: final output

11
Long Short-Term Memory (LSTM)
• How to work

• Equations
• 𝒊: input gate to accept the new
• 𝒇: forget gate to forget the past
• 𝒐: output gate, how much of the
information will be passed to
expose to the next time step.
• 𝒈: self-recurrent which is equal to
standard RNN
• 𝒄𝒕: internal memory
• 𝒔𝒕: hidden state
• 𝐲: final output

12
Long Short-Term
Memory (LSTM)
• How to work
• Equations
• 𝒊: input gate to accept the new
• 𝒇: forget gate to forget the past
• 𝒐: output gate, how much of the
information will be passed to
expose to the next time step.
• 𝒈: self-recurrent which is equal to
standard RNN
• 𝒄𝒕: internal memory
• 𝒔𝒕: hidden state
• 𝐲: final output

13
Long Short-Term
Memory (LSTM)
• How to work

14
Long Short-Term
Memory (LSTM)
• How to work

15
Long Short-Term
Memory (LSTM)
• How to work
• Equations
• 𝒊: input gate to accept the new
• 𝒇: forget gate to forget the past
• 𝒐: output gate, how much of the
information will be passed to
expose to the next time step.
• 𝒈: self-recurrent which is equal to
standard RNN
• 𝒄𝒕: internal memory
• 𝒔𝒕: hidden state
• 𝐲: final output

16
Long Short-Term
Memory (LSTM)
• How to work
• Equations
• 𝒊: input gate to accept the new
• 𝒇: forget gate to forget the past
• 𝒐: output gate, how much of the
information will be passed to
expose to the next time step.
• 𝒈: self-recurrent which is equal to
standard RNN
• 𝒄𝒕: internal memory
• 𝒔𝒕: hidden state
• 𝐲: final output

17
Long Short-Term
Memory (LSTM)
• Preserving Sequence Information
• O : entirely open
• — : entirely closed

• Traditional RNNs are a special case of LSTMs:

• Input gate set to 1 (passing all new information)
• Forget gate set to 0 (forgetting all of the past)
• Output gate set 1 (exposing the entire memory)

18
RNN vs LSTM

Long Short-
Term Memory
(LSTM)

19
Gradient Flow

Long Short-
Term Memory
(LSTM)

20
Long Short-Term Memory (LSTM)
Gradient Flow

21
22
C3

Srivastava et al, “Highway Networks”,

Highway Networks

ICML DL Workshop 2015

In between:
Long Short-Term Memory (LSTM)
Uninterrupted Gradient Flow

Softmax
h2
C2

FC 1000
Pool
3x3 conv, 64
3x3 conv, 64
3x3 conv, 64
3x3 conv, 64
3x3 conv, 64
3x3 conv, 64
...
3x3 conv, 128
3x3 conv, 128
3x3 conv, 128
3x3 conv, 128
3x3 conv, 128
3x3 conv, 128 / 2
3x3 conv, 64
3x3 conv, 64
C1

3x3 conv, 64
3x3 conv, 64
3x3 conv, 64
3x3 conv, 64
Pool
7x7 conv, 64 / 2
Input

Similar to ResNet!
• Gradient Flow
C0

h0
Other Variants: GRU
• Gated Recurrent Unit (GRU) is a type of (RNN)
• in certain cases, has advantages over (LSTM).
• GRU uses less memory and is faster than LSTM,
• LSTM is more accurate when using datasets with longer sequences.

23
Other Variants
• GRU

24
Comparison
RNN VS LSTM vs GRU

25
Question and Answer

RNNs & LSTMs for Tech Enthusiasts
No ratings yet
RNNs & LSTMs for Tech Enthusiasts
9 pages
Sequence Modeling
100% (1)
Sequence Modeling
131 pages
RNNs and Their Types - Simple Explanation
No ratings yet
RNNs and Their Types - Simple Explanation
5 pages
RNN
No ratings yet
RNN
28 pages
CNN RNN LSTM GRU Simple
100% (3)
CNN RNN LSTM GRU Simple
20 pages
Deep Learning (MODULE-5)
100% (1)
Deep Learning (MODULE-5)
71 pages
Final PDL - Unit IV
No ratings yet
Final PDL - Unit IV
51 pages
Week 6
No ratings yet
Week 6
60 pages
RNNs: A Guide for AI Enthusiasts
No ratings yet
RNNs: A Guide for AI Enthusiasts
83 pages
LSTM
No ratings yet
LSTM
19 pages
LSTM
No ratings yet
LSTM
12 pages
RNN 2
No ratings yet
RNN 2
144 pages
LSTM
No ratings yet
LSTM
22 pages
LSTM
No ratings yet
LSTM
11 pages
RNNs: Applications and Training Guide
No ratings yet
RNNs: Applications and Training Guide
36 pages
LSTM: Advanced RNN for Sequence Data
No ratings yet
LSTM: Advanced RNN for Sequence Data
12 pages
Long Short-Term Memory Survey Paper
No ratings yet
Long Short-Term Memory Survey Paper
6 pages
RNNs: LSTM vs GRU Lecture
No ratings yet
RNNs: LSTM vs GRU Lecture
22 pages
Understanding LSTM Networks and Applications
No ratings yet
Understanding LSTM Networks and Applications
17 pages
Module 4
No ratings yet
Module 4
14 pages
LSTM Memristive Neural Networks Survey
No ratings yet
LSTM Memristive Neural Networks Survey
14 pages
DeepLearning SecC
No ratings yet
DeepLearning SecC
20 pages
T3-Slide 006 LSTM
No ratings yet
T3-Slide 006 LSTM
25 pages
DL Half TechKnowledge
No ratings yet
DL Half TechKnowledge
50 pages
Neural Networks
No ratings yet
Neural Networks
22 pages
Long Short-Term Memory (LSTM)
No ratings yet
Long Short-Term Memory (LSTM)
25 pages
Deep Learning 2017 Lecture6RNN
No ratings yet
Deep Learning 2017 Lecture6RNN
31 pages
34-Long-Term Dependencies - Echo State Networks - Long Short-Term Memory and Othe-03!10!2024
No ratings yet
34-Long-Term Dependencies - Echo State Networks - Long Short-Term Memory and Othe-03!10!2024
14 pages
Unit III - Recurrent Neural Networks
No ratings yet
Unit III - Recurrent Neural Networks
44 pages
LSTM and GRU Architectures Explained
No ratings yet
LSTM and GRU Architectures Explained
18 pages
Recurrent Neural Network
No ratings yet
Recurrent Neural Network
38 pages
Understanding Long Short-Term Memory
No ratings yet
Understanding Long Short-Term Memory
22 pages
LSTM Architecture Explained: Key Components
No ratings yet
LSTM Architecture Explained: Key Components
19 pages
LSTM Detailed Explanation
No ratings yet
LSTM Detailed Explanation
2 pages
LSTM and GRU
No ratings yet
LSTM and GRU
22 pages
Long Short-Term Memory Networks (LSTM) - Simply Explained! - Data Basecamp
No ratings yet
Long Short-Term Memory Networks (LSTM) - Simply Explained! - Data Basecamp
4 pages
DL Co-3 PPT 3
No ratings yet
DL Co-3 PPT 3
19 pages
One X at A Time Re-Use The Same Edge Weights
No ratings yet
One X at A Time Re-Use The Same Edge Weights
39 pages
RNN LSTM
No ratings yet
RNN LSTM
49 pages
9 RNN LSTM Gru
No ratings yet
9 RNN LSTM Gru
91 pages
Lecture 6 Smaller Network: RNN: One X at A Time Re-Use The Same Edge Weights
No ratings yet
Lecture 6 Smaller Network: RNN: One X at A Time Re-Use The Same Edge Weights
39 pages
RNNs Explained for Tech Enthusiasts
No ratings yet
RNNs Explained for Tech Enthusiasts
6 pages
LSTM Architecture Presentation
No ratings yet
LSTM Architecture Presentation
18 pages
AAM Unit 6 Notes
No ratings yet
AAM Unit 6 Notes
20 pages
Unit 3
No ratings yet
Unit 3
8 pages
CSE465 T7b LSTM
No ratings yet
CSE465 T7b LSTM
23 pages
RNN LSTM
No ratings yet
RNN LSTM
72 pages
Recurrent Neural Networks
100% (1)
Recurrent Neural Networks
14 pages
LSTM Presentation
No ratings yet
LSTM Presentation
23 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
49 pages
LSTM&RNN
No ratings yet
LSTM&RNN
10 pages
LSTM
No ratings yet
LSTM
27 pages
LSTM Presentation 1
No ratings yet
LSTM Presentation 1
10 pages
Advanced RNN Architectures Explained
No ratings yet
Advanced RNN Architectures Explained
60 pages
ML (Cs-601) Unit 4 Complete
No ratings yet
ML (Cs-601) Unit 4 Complete
45 pages
LSTM Material 1
No ratings yet
LSTM Material 1
3 pages
DL Unit-4
No ratings yet
DL Unit-4
4 pages
Artificial Intelligence and Machine Learning - CS3491 - Question Bank and Important 2 Marks Questions With Answer
No ratings yet
Artificial Intelligence and Machine Learning - CS3491 - Question Bank and Important 2 Marks Questions With Answer
33 pages
Stability Analysis in Digital Control Systems
No ratings yet
Stability Analysis in Digital Control Systems
28 pages
Matched Filter - Wikipedia
No ratings yet
Matched Filter - Wikipedia
9 pages
Lecture 9 - Network Assignment
No ratings yet
Lecture 9 - Network Assignment
42 pages
Aitken's Interpolation Method Explained
No ratings yet
Aitken's Interpolation Method Explained
13 pages
Finite Element Method in Matlab
No ratings yet
Finite Element Method in Matlab
7 pages
A Robust Document Image Watermarking Scheme Using DNN
No ratings yet
A Robust Document Image Watermarking Scheme Using DNN
24 pages
Deep Neural Networks: Amity Centre For Artificial Intelligence, Amity University, Noida, India
No ratings yet
Deep Neural Networks: Amity Centre For Artificial Intelligence, Amity University, Noida, India
62 pages
Matrix Multiplication Algorithms Explained
No ratings yet
Matrix Multiplication Algorithms Explained
38 pages
Optimization Techniques Practice
No ratings yet
Optimization Techniques Practice
2 pages
Ce l5 M Numerical Methods
No ratings yet
Ce l5 M Numerical Methods
4 pages
Transhipment Example
No ratings yet
Transhipment Example
10 pages
Adding and Subtracting Polynomials Worksheet
No ratings yet
Adding and Subtracting Polynomials Worksheet
5 pages
Chapter 18
No ratings yet
Chapter 18
31 pages
CMP 204 Data Structure
No ratings yet
CMP 204 Data Structure
19 pages
Uninformed Search: BFS and DFS Explained
No ratings yet
Uninformed Search: BFS and DFS Explained
41 pages
Me310 7 Differentiation
No ratings yet
Me310 7 Differentiation
9 pages
Yamaha Rivage PM10 Review
No ratings yet
Yamaha Rivage PM10 Review
18 pages
Example of Classification Model in Predictive Analytics Techniques
No ratings yet
Example of Classification Model in Predictive Analytics Techniques
9 pages
Experiment 1
100% (1)
Experiment 1
15 pages
Assignment 1
No ratings yet
Assignment 1
31 pages
Colour Image Watermarking Based On Wavelet and QR Decomposition
No ratings yet
Colour Image Watermarking Based On Wavelet and QR Decomposition
4 pages
Aiml 5
No ratings yet
Aiml 5
3 pages
Customer Churn Prediction
100% (1)
Customer Churn Prediction
32 pages
Udacity Enterprise Syllabus Introduction To Machine Learning With TensorFlow nd230
No ratings yet
Udacity Enterprise Syllabus Introduction To Machine Learning With TensorFlow nd230
12 pages
EduSkills & Google AIML Internship Report
No ratings yet
EduSkills & Google AIML Internship Report
9 pages
CS3401 - Algorithm Solved Answers For University Questions.
No ratings yet
CS3401 - Algorithm Solved Answers For University Questions.
39 pages
MA3252 Linear Programming Guide
No ratings yet
MA3252 Linear Programming Guide
56 pages
Deep Learning
No ratings yet
Deep Learning
35 pages
Problem Statement 8
No ratings yet
Problem Statement 8
2 pages