0% found this document useful (0 votes)

68 views10 pages

IMDB Sentiment Analysis with Keras

The document loads and preprocesses movie review data from the IMDB dataset to build and evaluate a simple neural network movie classifier. It loads and vectorizes the training and test data, defines a model with an input, two hidden and an output layer, trains it for 20 epochs and evaluates its accuracy on the test set, achieving 50% accuracy. It then plots the confusion matrix to analyze errors.

Uploaded by

4416 Likhitha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

68 views10 pages

IMDB Sentiment Analysis with Keras

Uploaded by

4416 Likhitha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

from

keras.datasets import imdb

# Load the data, keeping only 10,000 of the most frequently occuring words
(train_data, train_labels), (test_data, test_labels) = imdb.load_data(num_words = 1)

Downloading data from https://storage.googleapis.com/tensorflow/tf-keras-datasets/imdb.npz

17464789/17464789 [==============================] - 0s 0us/step

%matplotlib inline
import matplotlib
import matplotlib.pyplot as plt

import numpy as np
from keras.utils import to_categorical
from keras import models
from keras import layers

# Since we restricted ourselves to the top 10000 frequent words, no word index should exceed 10000
# we'll verify this below

# Here is a list of maximum indexes in every review --- we search the maximum index in this list of
print(type([max(sequence) for sequence in train_data]))

# Find the maximum of all max indexes
max([max(sequence) for sequence in train data])
max([max(sequence) for sequence in train_data])

<class 'list'>
2

# Let's quickly decode a review

# step 1: load the dictionary mappings from word to integer index
word_index = imdb.get_word_index()

# step 2: reverse word index to map integer indexes to their respective words
reverse_word_index = dict([(value, key) for (key, value) in word_index.items()])

# Step 3: decode the review, mapping integer indices to words
#
# indices are off by 3 because 0, 1, and 2 are reserverd indices for "padding", "Start of sequence"
decoded_review = ' '.join([reverse_word_index.get(i-3, '?') for i in train_data[0]])

decoded_review

Downloading data from https://storage.googleapis.com/tensorflow/tf-keras-datasets/imdb_word_i

1641221/1641221 [==============================] - 0s 0us/step
'? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?
? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?
? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?
? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?

import numpy as np
def vectorize_sequences(sequences, dimension=10000):
    results = np.zeros((len(sequences), dimension))    # Creates an all zero matrix of shape (len(s
    for i,sequence in enumerate(sequences):
        results[i,sequence] = 1                        # Sets specific indices of results[i] to 1s
    return results

# Vectorize training Data
X_train = vectorize_sequences(train_data)

# Vectorize testing Data
X_test = vectorize_sequences(test_data)

X_train[0]

array([0., 0., 1., ..., 0., 0., 0.])

X_train.shape

(25000, 10000)

Vectorize labels
y_train = np.asarray(train_labels).astype('float32')
y_test = np.asarray(test_labels).astype('float32')

Building the Neural Network

Our input data is vectors that need to be mapped to scaler labels (0s and 1s). This is one of the easiest
setups, and a simple stack of fully-connected, Dense layers with relu activation perform quite well.

Hidden layers In this network, we will leverage hidden layers. We will define our layers as such.

from keras.models import Sequential
from keras.layers import Flatten,Dense
model=Sequential()
from keras import models
from keras import layers

model = models.Sequential()
model.add(layers.Dense(16, activation='relu', input_shape=(10000,)))
model.add(layers.Dense(16, activation='relu'))
model.add(layers.Dense(1, activation='sigmoid'))

from keras import optimizers
from keras import losses
from keras import metrics
model.compile(optimizer=optimizers.RMSprop(learning_rate=0.001),
loss = losses.binary_crossentropy,
metrics = [metrics.binary_accuracy])

Setting up Validation

# Input for Validation
X_val = X_train[:10000]
partial_X_train = X_train[10000:]

# Labels for validation
y_val = y_train[:10000]
partial_y_train = y_train[10000:]

history = model.fit(partial_X_train,
                   partial_y_train,
                   epochs=20,
                   batch_size=512,
                   validation_data=(X_val, y_val))

Epoch 1/20
30/30 [==============================] - 3s 68ms/step - loss: 0.6932 - binary_accuracy: 0.503
Epoch 2/20
30/30 [==============================] - 1s 40ms/step - loss: 0.6932 - binary_accuracy: 0.503
Epoch 3/20
30/30 [==============================] - 1s 39ms/step - loss: 0.6932 - binary_accuracy: 0.503
Epoch 4/20
30/30 [==============================] - 1s 37ms/step - loss: 0.6931 - binary_accuracy: 0.503
Epoch 5/20
30/30 [==============================] - 1s 37ms/step - loss: 0.6932 - binary_accuracy: 0.500
Epoch 6/20
30/30 [==============================] - 1s 36ms/step - loss: 0.6932 - binary_accuracy: 0.503
Epoch 7/20
30/30 [==============================] - 2s 51ms/step - loss: 0.6932 - binary_accuracy: 0.503
Epoch 8/20
30/30 [==============================] - 2s 64ms/step - loss: 0.6932 - binary_accuracy: 0.503
Epoch 9/20
30/30 [==============================] - 1s 40ms/step - loss: 0.6932 - binary_accuracy: 0.503
Epoch 10/20
30/30 [==============================] - 1s 38ms/step - loss: 0.6932 - binary_accuracy: 0.503
Epoch 11/20
30/30 [==============================] - 1s 39ms/step - loss: 0.6932 - binary_accuracy: 0.503
Epoch 12/20
30/30 [==============================] - 1s 50ms/step - loss: 0.6932 - binary_accuracy: 0.503
Epoch 13/20
30/30 [==============================] - 1s 46ms/step - loss: 0.6931 - binary_accuracy: 0.500
Epoch 14/20
30/30 [==============================] - 1s 47ms/step - loss: 0.6932 - binary_accuracy: 0.497
Epoch 15/20
30/30 [==============================] - 1s 38ms/step - loss: 0.6931 - binary_accuracy: 0.501
Epoch 16/20
30/30 [==============================] - 1s 39ms/step - loss: 0.6931 - binary_accuracy: 0.503
Epoch 17/20
30/30 [==============================] - 2s 68ms/step - loss: 0.6931 - binary_accuracy: 0.503
Epoch 18/20
30/30 [==============================] - 1s 47ms/step - loss: 0.6931 - binary_accuracy: 0.503
Epoch 19/20
30/30 [==============================] - 1s 38ms/step - loss: 0.6931 - binary_accuracy: 0.503
Epoch 20/20
30/30 [==============================] - 1s 38ms/step - loss: 0.6931 - binary_accuracy: 0.503

history_dict = history.history
history_dict.keys()

dict_keys(['loss', 'binary_accuracy', 'val_loss', 'val_binary_accuracy'])

# Making Predictions for testing data
np.set_printoptions(suppress=True)
result = model.predict(X_test)

782/782 [==============================] - 2s 3ms/step

result

array([[0.4968544],
[0.4968544],
[0.4968544],
...,
[0.4968544],
[0.4968544],
[0.4968544]], dtype=float32)

y_pred = np.zeros(len(result))
for i, score in enumerate(result):
y_pred[i] = 1 if score > 0.5 else 0

y_pred

array([0., 0., 0., ..., 0., 0., 0.])

from sklearn.metrics import accuracy_score
accuracy_score(y_test, y_pred)

0.5

import matplotlib.pyplot as plt
import numpy
from sklearn import metrics
confusion_matrix = metrics.confusion_matrix(y_test, y_pred)

cm_display·=·metrics.ConfusionMatrixDisplay(confusion_matrix·=·confusion_matrix,·display_labels·=·[
cm_display.plot()
plt.show()

Binary Classification - Ipynb - Colab
No ratings yet
Binary Classification - Ipynb - Colab
5 pages
Plotting Loss History in TensorFlow
No ratings yet
Plotting Loss History in TensorFlow
65 pages
Design A Neural Network For Classifying Movie Reviews
No ratings yet
Design A Neural Network For Classifying Movie Reviews
5 pages
IMDB Sentiment Analysis with LSTM
No ratings yet
IMDB Sentiment Analysis with LSTM
5 pages
CCS355
No ratings yet
CCS355
29 pages
Assignment 2
No ratings yet
Assignment 2
8 pages
Deep Learning Programs Updated
No ratings yet
Deep Learning Programs Updated
24 pages
Exp 6,7,8
No ratings yet
Exp 6,7,8
17 pages
Deep Learning Manual
No ratings yet
Deep Learning Manual
24 pages
Deep Learning Lab Assignments - 6-9
No ratings yet
Deep Learning Lab Assignments - 6-9
14 pages
DL 3
No ratings yet
DL 3
6 pages
CV Exp - 9 - Colab
No ratings yet
CV Exp - 9 - Colab
5 pages
DL Exps
No ratings yet
DL Exps
9 pages
Chapter04 - Getting Started With Neural Networks
No ratings yet
Chapter04 - Getting Started With Neural Networks
9 pages
DL Record Merged
No ratings yet
DL Record Merged
113 pages
Deep Learning Lab Manual (M.TECH AI &ML) NEW
No ratings yet
Deep Learning Lab Manual (M.TECH AI &ML) NEW
50 pages
L2 - Basic ANN Model Building With TF-Keras
No ratings yet
L2 - Basic ANN Model Building With TF-Keras
16 pages
DL 5 Excuted
No ratings yet
DL 5 Excuted
13 pages
DL Lab Manual 29.7.25
No ratings yet
DL Lab Manual 29.7.25
53 pages
Deep Learning Lab
No ratings yet
Deep Learning Lab
7 pages
Ass 3
No ratings yet
Ass 3
5 pages
Deep Learning Lab Exercises
No ratings yet
Deep Learning Lab Exercises
56 pages
Deep Learning Manual
No ratings yet
Deep Learning Manual
53 pages
RLDL
No ratings yet
RLDL
27 pages
Ad3511 Deep Learning Lab Manual
No ratings yet
Ad3511 Deep Learning Lab Manual
80 pages
NN & DL Lab Manual 1
No ratings yet
NN & DL Lab Manual 1
44 pages
DL Merged
No ratings yet
DL Merged
19 pages
DL Exp-10,11,12
No ratings yet
DL Exp-10,11,12
6 pages
DL 6
No ratings yet
DL 6
5 pages
Final Code
No ratings yet
Final Code
16 pages
DL Internal
No ratings yet
DL Internal
12 pages
Deep Learning Experiments
No ratings yet
Deep Learning Experiments
42 pages
Deep Learning - Lab - Manual
No ratings yet
Deep Learning - Lab - Manual
59 pages
Assignment No 2
No ratings yet
Assignment No 2
3 pages
Autoencoder Image Compression with TensorFlow
No ratings yet
Autoencoder Image Compression with TensorFlow
24 pages
DL Lab1
No ratings yet
DL Lab1
15 pages
MLP 40522 97.11%
No ratings yet
MLP 40522 97.11%
12 pages
Hand Written
No ratings yet
Hand Written
13 pages
Deep Learning Python Code Notebook
No ratings yet
Deep Learning Python Code Notebook
9 pages
AD3511 - Deep Learning Lab Manual
No ratings yet
AD3511 - Deep Learning Lab Manual
61 pages
DL Expt 8
No ratings yet
DL Expt 8
8 pages
HW4ML Project Code
No ratings yet
HW4ML Project Code
24 pages
MNIST Classification Using CNN
No ratings yet
MNIST Classification Using CNN
5 pages
Python Deep Learning Lab Programs
No ratings yet
Python Deep Learning Lab Programs
35 pages
CCS355-Neural Networks and Deep Learning - Assignment 1
No ratings yet
CCS355-Neural Networks and Deep Learning - Assignment 1
15 pages
Deep Learning Lab Manual AD3511
No ratings yet
Deep Learning Lab Manual AD3511
63 pages
LSTM and Neural Network Models in TensorFlow
No ratings yet
LSTM and Neural Network Models in TensorFlow
6 pages
S. NO. Title of The Experiments Page No
No ratings yet
S. NO. Title of The Experiments Page No
11 pages
Soc DL Manual
No ratings yet
Soc DL Manual
50 pages
DL Practical
No ratings yet
DL Practical
23 pages
Ej Stanford Dog Densenet
No ratings yet
Ej Stanford Dog Densenet
6 pages
MNIST, IMDB, Reuters Neural Networks
100% (1)
MNIST, IMDB, Reuters Neural Networks
35 pages
Hand Writing Using - CNN
No ratings yet
Hand Writing Using - CNN
5 pages
Content: From Import Import As Import Import Import As
No ratings yet
Content: From Import Import As Import Import Import As
8 pages
Multiclass Classification with CSV Data
No ratings yet
Multiclass Classification with CSV Data
5 pages
Deep Learning Lab Manual
No ratings yet
Deep Learning Lab Manual
49 pages
Shaurya DL File
No ratings yet
Shaurya DL File
75 pages
Pantech-Pyrhon Full Stack Intenship
No ratings yet
Pantech-Pyrhon Full Stack Intenship
1 page
Pantech-Java Full Stack
No ratings yet
Pantech-Java Full Stack
1 page
Pantech-Python Full Stack
No ratings yet
Pantech-Python Full Stack
1 page
Microsoft Exam Booking Guide
No ratings yet
Microsoft Exam Booking Guide
10 pages
BDA Lab Experiment - Queues
No ratings yet
BDA Lab Experiment - Queues
21 pages
Computer Networking Lab Record
No ratings yet
Computer Networking Lab Record
85 pages
UNIT-2 Error Detection Methods
No ratings yet
UNIT-2 Error Detection Methods
74 pages
Big Data Analytics - Stacks
No ratings yet
Big Data Analytics - Stacks
9 pages
ML Assignments
No ratings yet
ML Assignments
2 pages
R17CS357 - Microsoft Azure Report
No ratings yet
R17CS357 - Microsoft Azure Report
13 pages
Fake Currency Detection with Python
No ratings yet
Fake Currency Detection with Python
24 pages
A Simple Guide To OpenAI API With Python
No ratings yet
A Simple Guide To OpenAI API With Python
9 pages
Unit 2 MCA275 PPT Part 1
No ratings yet
Unit 2 MCA275 PPT Part 1
34 pages
Final Year Project Report
No ratings yet
Final Year Project Report
44 pages
Experiment ML
No ratings yet
Experiment ML
14 pages
Python Basics: Lists, Loops, and Numpy
No ratings yet
Python Basics: Lists, Loops, and Numpy
32 pages
Python for Chemical Technology
No ratings yet
Python for Chemical Technology
140 pages
Retained Ex1 To 7 Simplified
No ratings yet
Retained Ex1 To 7 Simplified
4 pages
Python Interview Questions 1
100% (1)
Python Interview Questions 1
32 pages
Deep Learning with Neural Networks
No ratings yet
Deep Learning with Neural Networks
45 pages
Scientific Computing with Python
No ratings yet
Scientific Computing with Python
4 pages
Analysis of Structures. Book of Examples
No ratings yet
Analysis of Structures. Book of Examples
160 pages
Python Programming Lab (21AML38)
No ratings yet
Python Programming Lab (21AML38)
31 pages
Cp4252-Machine Learning Lab Manual 23-24
No ratings yet
Cp4252-Machine Learning Lab Manual 23-24
28 pages
UNIT IV MC4103 Python UNIT IV MC4103 Python
No ratings yet
UNIT IV MC4103 Python UNIT IV MC4103 Python
45 pages
Data Science Course Overview in Hyderabad
100% (1)
Data Science Course Overview in Hyderabad
29 pages
20 - Stock Price Prediction Using Machine Learning
No ratings yet
20 - Stock Price Prediction Using Machine Learning
54 pages
Intro To Pytorch - Ipynb
No ratings yet
Intro To Pytorch - Ipynb
59 pages
NumPy Guide for Data Scientists
50% (2)
NumPy Guide for Data Scientists
16 pages
Mechanical Resume
No ratings yet
Mechanical Resume
2 pages
Google Collab & Python
100% (2)
Google Collab & Python
50 pages
Python by Example Book 2 (Data Manipulation and Analysis)
No ratings yet
Python by Example Book 2 (Data Manipulation and Analysis)
105 pages
Sound Device
No ratings yet
Sound Device
4 pages
Python Deep Learning Lab Manual R20
No ratings yet
Python Deep Learning Lab Manual R20
52 pages
Batch 12
No ratings yet
Batch 12
45 pages
A Practical Guide To AI and Data Analytics 1641384046
No ratings yet
A Practical Guide To AI and Data Analytics 1641384046
86 pages
Unit 5-Python Packages 240127 185930
100% (1)
Unit 5-Python Packages 240127 185930
34 pages
Python Finance Roadmap
No ratings yet
Python Finance Roadmap
3 pages

IMDB Sentiment Analysis with Keras

Uploaded by

IMDB Sentiment Analysis with Keras

Uploaded by

from

Downloading data from https://storage.googleapis.com/tensorflow/tf-keras-datasets/imdb.npz

Downloading data from https://storage.googleapis.com/tensorflow/tf-keras-datasets/imdb_word_i

array([0., 0., 1., ..., 0., 0., 0.])

Building the Neural Network

dict_keys(['loss', 'binary_accuracy', 'val_loss', 'val_binary_accuracy'])

782/782 [==============================] - 2s 3ms/step

array([0., 0., 0., ..., 0., 0., 0.])

You might also like