RNN LSTM

Uploaded by

bossdhruva0

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views37 pages

RNN LSTM

Uploaded by

bossdhruva0

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 37

RNN

It is one type of ANN

Applictions:
Speech Recognition
• Language Translator
• Market stock Predictor
• Name Entity prediction
Different types of RNN
• One-to-one:
• This is also called Plain Neural networks. It deals with a fixed
size of the input to the fixed size of output, where they are
independent of previous information/output.
• Example: Image classification.

• One-to-Many:
• It deals with a fixed size of information as input that gives a
sequence of data as output.
• Example: Image Captioning takes the image as input and
outputs a sentence of words.
Many-to-One:
It takes a sequence of information as input and outputs a fixed size
of the output.
Example: sentiment analysis where any sentence is classified as
expressing the positive or negative sentiment.

Many-to-Many:
It takes a Sequence of information as input and processes the
recurrently outputs as a Sequence of data.
Example: Machine Translation, where the RNN reads any sentence
in English and then outputs the sentence in French.
• Bidirectional Many-to-Many:
• Synced sequence input and output. Notice
that in every case are no pre-specified
constraints on the lengths sequences because
the recurrent transformation (green) is fixed
and can be applied as many times as we like.
• Example: Video classification where we wish
to label every frame of the video
How RNN works
Consider an unfolded RNN
• The formula for the current state can be
written as

Here, Ht is the new state, ht-1 is the previous state while xt is the current input.
We now have a state of the previous input instead of the input itself, because
the input neuron would have applied the transformations on our previous
input. So each successive input is called a time step.
• In this case, we have four inputs to be given to the
network, during a recurrence formula, the same
function and the same weights are applied to the
network at each time step.
• Taking the simplest form of a recurrent neural
network, let’s say that the activation function is
tanh, the weight at the recurrent neuron is Whh,
and the weight at the input the neuron is Wxh, we
can write the equation for the state at the time t
as
• The Recurrent neuron, in this case, is just
considering the immediately previous state.
For longer sequences, the equation can
involve multiple such states. Once the final
state is calculated we can go on to produce
the output.
Now, once the current state is calculated we can
calculate the output state as-
• Let me summarize the steps in a recurrent neuron
• A single time step of the input is supplied to the network i.e. xt is supplied to the
network
• We then calculate its current state using a combination of the current input and the
previous state i.e. we calculate ht
• The current ht becomes ht-1 for the next time step
• We can go as many time steps as the problem demands and combine the
information from all the previous states
• Once all the time steps are completed the final current state is used to calculate the
output yt
• The output is then compared to the actual output and the error is generated
• The error is then backpropagated to the network to update the weights(we shall go
into the details of backpropagation in further sections) and the network is trained.
• Long Short-Term Memory(LSTM):
• LSTM is an improved version of the regular RNN which was
designed to make it easy to capture long-term dependencies
in sequence data. A regular RNN functions in such a way that
the hidden state activation is influenced by the other local
activations nearest to them, which corresponds to a “short-
term memory”, while the network weights are influenced by
the computations that take place over entire long sequences,
which corresponds to “long-term memory”. Hence the RNN
was redesigned so that it has an activation state that can also
act as weights and preserve information over long distances,
hence the name “Long Short-Term Memory”.
• LSTMs are explicitly designed to avoid the
long-term dependency problem.
Remembering information for long periods is
practically their default behavior
• Architecture
This decides what info
Is to add to the cell state

LSTM
This sigmoid gate
determines how much
Output gate
Controls what
information goes thru goes into output

Ct-1

ht-1

Forget input Why sigmoid or tanh:

Sigmoid: 0,1 gating as switch.
gate gate Vanishing gradient problem in
The core idea is this cell state Ct, it is LSTM is handled already.
changed slowly, with only minor linear ReLU replaces tanh ok?
interactions. It is very easy for
information to flow along it
unchanged.
it decides what component
is to be updated.
C’t provides change contents

Updating the cell state

Decide what part of the cell

state to output
RNN vs LSTM
Implementation
• Let’s start by importing the classes and functions required for this model and initializing the
random number generator to a constant value to ensure you can easily reproduce the results.

import tensorflow as tf
from tensorflow.keras.datasets import imdb
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Dense
from tensorflow.keras.layers import LSTM
from tensorflow.keras.layers import Embedding
from tensorflow.keras.preprocessing import sequence
# fix random seed for reproducibility
tf.random.set_seed(7)
• You need to load the IMDB dataset. You are constraining the dataset to the top 5,000 words.
You will also split the dataset into train (50%) and test (50%) sets

# load the dataset but only keep the top n words, zero the rest
top_words = 5000
(X_train, y_train), (X_test, y_test) = imdb.load_data(num_words=top_words)

Next, you need to truncate and pad the input sequences, so they are all the same length for
modeling. The model will learn that the zero values carry no information. The sequences are
not the same length in terms of content, but same-length vectors are required to perform the
computation in Keras
# truncate and pad input sequences
max_review_length = 500
X_train = sequence.pad_sequences(X_train, maxlen=max_review_length)
X_test = sequence.pad_sequences(X_test, maxlen=max_review_length)
• You can now define, compile and fit your LSTM model.
• The first layer is the Embedded layer that uses 32-length vectors to represent each word.
The next layer is the LSTM layer with 100 memory units (smart neurons). Finally, because
this is a classification problem, you will use a Dense output layer with a single neuron
and a sigmoid activation function to make 0 or 1 predictions for the two classes (good
and bad) in the problem.
• Because it is a binary classification problem, log loss is used as the loss function
(binary_crossentropy in Keras). The efficient ADAM optimization algorithm is used. The
model is fit for only two epochs because it quickly overfits the problem. A large batch
size of 64 reviews is used to space out weight updates.
# create the model
embedding_vecor_length = 32
model = Sequential()
model.add(Embedding(top_words, embedding_vecor_length,
input_length=max_review_length))
model.add(LSTM(100))
model.add(Dense(1, activation='sigmoid'))
model.compile(loss='binary_crossentropy', optimizer='adam', metrics=['accuracy'])
print(model.summary())
model.fit(X_train, y_train, validation_data=(X_test, y_test), epochs=3, batch_size=64)
# Final evaluation of the model
scores = model.evaluate(X_test, y_test, verbose=0)
print("Accuracy: %.2f%%" % (scores[1]*100))

Overview of Recurrent Neural Networks
100% (2)
Overview of Recurrent Neural Networks
53 pages
Module 6
No ratings yet
Module 6
42 pages
Final PDL - Unit IV
No ratings yet
Final PDL - Unit IV
51 pages
RNNs: A Guide for AI Enthusiasts
No ratings yet
RNNs: A Guide for AI Enthusiasts
83 pages
RNN and LSTM for Sentiment Analysis
No ratings yet
RNN and LSTM for Sentiment Analysis
14 pages
Deep Learning Subject Practicals Uni Mumbai
No ratings yet
Deep Learning Subject Practicals Uni Mumbai
13 pages
DeepLearning SecC
No ratings yet
DeepLearning SecC
20 pages
Unit 3
No ratings yet
Unit 3
8 pages
RNN LSTM
No ratings yet
RNN LSTM
72 pages
Lesson 7 - RNN
No ratings yet
Lesson 7 - RNN
89 pages
LSTM
No ratings yet
LSTM
22 pages
DL Mod4
No ratings yet
DL Mod4
105 pages
Sequence Modeling
100% (1)
Sequence Modeling
131 pages
LSTM, RNN
No ratings yet
LSTM, RNN
38 pages
RNN Overview: Types, Applications, and Code
No ratings yet
RNN Overview: Types, Applications, and Code
8 pages
DL Half TechKnowledge
No ratings yet
DL Half TechKnowledge
50 pages
RNN With LSTM
No ratings yet
RNN With LSTM
36 pages
Aiml C6 DL RNN CS
No ratings yet
Aiml C6 DL RNN CS
42 pages
LSTM Ucl
100% (1)
LSTM Ucl
35 pages
Unit 3 Deep Learning SPPU BE IT
No ratings yet
Unit 3 Deep Learning SPPU BE IT
30 pages
Deep Learning for Data Scientists
No ratings yet
Deep Learning for Data Scientists
21 pages
LSTM Networks Thesis Updated
No ratings yet
LSTM Networks Thesis Updated
5 pages
Unit Iii
No ratings yet
Unit Iii
5 pages
Sequence Models231205
No ratings yet
Sequence Models231205
72 pages
CS5560 Lect12-RNN - LSTM
No ratings yet
CS5560 Lect12-RNN - LSTM
30 pages
CH4 - AA1.1-Sequence Models
No ratings yet
CH4 - AA1.1-Sequence Models
26 pages
DL CS12 RNN
No ratings yet
DL CS12 RNN
44 pages
Long Short-Term Memory (LSTM)
No ratings yet
Long Short-Term Memory (LSTM)
25 pages
Lecture Notes - RRN
No ratings yet
Lecture Notes - RRN
8 pages
Chap 7.2 Sequence Analysis Using RNN LSTM
No ratings yet
Chap 7.2 Sequence Analysis Using RNN LSTM
60 pages
15.03.2024 Csa3007 A24+d23+d24
No ratings yet
15.03.2024 Csa3007 A24+d23+d24
8 pages
Day 4
No ratings yet
Day 4
22 pages
Unit III - Recurrent Neural Networks
No ratings yet
Unit III - Recurrent Neural Networks
44 pages
CNN RNN LSTM Attention
No ratings yet
CNN RNN LSTM Attention
86 pages
9 RNN LSTM Gru
No ratings yet
9 RNN LSTM Gru
91 pages
LSTM Tutorial for AI Beginners
No ratings yet
LSTM Tutorial for AI Beginners
34 pages
42 Recurrent Neural Networks and LSTM
No ratings yet
42 Recurrent Neural Networks and LSTM
68 pages
Survey of Prediction Using Recurrent Neural Network
No ratings yet
Survey of Prediction Using Recurrent Neural Network
3 pages
RNN StannfordBased
No ratings yet
RNN StannfordBased
102 pages
RNNs & LSTMs for Tech Enthusiasts
No ratings yet
RNNs & LSTMs for Tech Enthusiasts
9 pages
Unit 4 - MachineLearning
No ratings yet
Unit 4 - MachineLearning
16 pages
Deep Learning L3
No ratings yet
Deep Learning L3
37 pages
Advanced Deep Learning with RNNs
No ratings yet
Advanced Deep Learning with RNNs
50 pages
RNN 2
No ratings yet
RNN 2
144 pages
RNNs and Their Types - 15 Slides (Easy Copy-Paste Format)
No ratings yet
RNNs and Their Types - 15 Slides (Easy Copy-Paste Format)
6 pages
Recurrent Neural Network Using LSTM Model
No ratings yet
Recurrent Neural Network Using LSTM Model
15 pages
Lec 10
No ratings yet
Lec 10
37 pages
Lab 9 RNN
No ratings yet
Lab 9 RNN
8 pages
Unit 4
No ratings yet
Unit 4
50 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
49 pages
DeepLear Qes
No ratings yet
DeepLear Qes
9 pages
Stock Prediction with RNN and LSTM
0% (1)
Stock Prediction with RNN and LSTM
24 pages
RNNs: Applications and Training Guide
No ratings yet
RNNs: Applications and Training Guide
36 pages
Long Short-Term Memory Networks (LSTM) - Simply Explained! - Data Basecamp
No ratings yet
Long Short-Term Memory Networks (LSTM) - Simply Explained! - Data Basecamp
4 pages
Sentiment Analysis With An Recurrent Neural Networks
No ratings yet
Sentiment Analysis With An Recurrent Neural Networks
12 pages
6b. Recurrent Neural Networks
No ratings yet
6b. Recurrent Neural Networks
38 pages
RNNs Explained for Tech Enthusiasts
No ratings yet
RNNs Explained for Tech Enthusiasts
6 pages
Dis6 Sol
No ratings yet
Dis6 Sol
6 pages
1 Juniper Networks Security Firewall Gateway Comparison Chart
No ratings yet
1 Juniper Networks Security Firewall Gateway Comparison Chart
4 pages
Digital Impact on Skincare Purchases
No ratings yet
Digital Impact on Skincare Purchases
2 pages
Computer Network and Information Security
No ratings yet
Computer Network and Information Security
33 pages
Retail Marketing Assignment 2
No ratings yet
Retail Marketing Assignment 2
4 pages
02 - Decision Constructs Loops
No ratings yet
02 - Decision Constructs Loops
45 pages
Correction The Knowledge of Autism Questionnaire-U
No ratings yet
Correction The Knowledge of Autism Questionnaire-U
2 pages
Deep Residual U Net For Automatic Detection OfMoroccan Coastal Upwelling Using SST Images
No ratings yet
Deep Residual U Net For Automatic Detection OfMoroccan Coastal Upwelling Using SST Images
5 pages
Software Cost Estimation Techniques
No ratings yet
Software Cost Estimation Techniques
22 pages
Energy Conservation and Management Guide
No ratings yet
Energy Conservation and Management Guide
28 pages
ĐỀ 9 (HS)
No ratings yet
ĐỀ 9 (HS)
4 pages
Erba XL-200 Brochure WEB
No ratings yet
Erba XL-200 Brochure WEB
6 pages
Software Engineer Profile
No ratings yet
Software Engineer Profile
1 page
Internal Audit Planning and Scheduling Sample Format
100% (7)
Internal Audit Planning and Scheduling Sample Format
3 pages
User-Manual NJoy UserManual UPS ISIS For View
No ratings yet
User-Manual NJoy UserManual UPS ISIS For View
27 pages
The Modellers Guide PDF
93% (14)
The Modellers Guide PDF
180 pages
EMC Directive 2004/108/EC Impact on Railways
No ratings yet
EMC Directive 2004/108/EC Impact on Railways
5 pages
Integrating Factors in Linear ODEs
No ratings yet
Integrating Factors in Linear ODEs
10 pages
Problem No: 1: Submitted To
No ratings yet
Problem No: 1: Submitted To
4 pages
IT Application Tools in Business - Module 1
No ratings yet
IT Application Tools in Business - Module 1
7 pages
Day 2 ME Hydraulic Power System HPC - HCU
No ratings yet
Day 2 ME Hydraulic Power System HPC - HCU
147 pages
8942 Tampa Ave 2ND Review 060324
No ratings yet
8942 Tampa Ave 2ND Review 060324
27 pages
Ball Type Load Cell
No ratings yet
Ball Type Load Cell
1 page
New and Changes Queries Details (9.6.3)
No ratings yet
New and Changes Queries Details (9.6.3)
43 pages
Mogheshwar Gokul Rajasekaran: Chennai, India Email: Contact: +91-8939324594
No ratings yet
Mogheshwar Gokul Rajasekaran: Chennai, India Email: Contact: +91-8939324594
2 pages
FLGT Guide Dec21
No ratings yet
FLGT Guide Dec21
379 pages
Scrutiny Form PDF
No ratings yet
Scrutiny Form PDF
6 pages
Chapter 4 Search in Complex Environments
No ratings yet
Chapter 4 Search in Complex Environments
61 pages
1730680363
No ratings yet
1730680363
3 pages
Neos Epabx 6s Programmingmanual
No ratings yet
Neos Epabx 6s Programmingmanual
124 pages
Number Pattern
No ratings yet
Number Pattern
10 pages

RNN LSTM

Uploaded by

RNN LSTM

Uploaded by

RNN

It is one type of ANN

Forget input Why sigmoid or tanh:

Updating the cell state

Decide what part of the cell

You might also like