AML Assignment 2

The document outlines an AML assignment involving three tasks: building classifiers using RNN and LSTM models for SMS classification, creating an SMS generator, and developing a Variational Autoencoder (VAE) and Conditional Generative Adversarial Network (CGAN) for image generation. The classifiers achieved high validation accuracy, while the SMS generator successfully predicts the latter half of messages. The VAE demonstrated effective image reconstruction and generation, and the CGAN produced class-specific images for the notMNIST dataset.

Uploaded by

Pranav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

1 views2 pages

AML Assignment 2

Uploaded by

Pranav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

AML Assignment 2

Team Members:
Nalla Janardhana Rao – MDS202426
Pranav Pothan – MDS202429
Raja S – MDS202430

Task 1
Aim: Build a classifier using an RNN and an LSTM model to classify SMS as Spam or HAM.
Data Processing & Visualization: URLs, symbols and numbers are removed. SMS are
lemmatised, tokenised and padded to the same length.
Architecture: Embedding layer → RNN/LSTM → Dropout → Dense → Dropout → Dense
(1 output).
Training: Adam optimizer (learning rate = 0.0001), Binary Cross-Entropy loss, and Early
Stopping (patience = 5) to prevent overfitting.
Accuracy: RNN model achieves 97.74% validation accuracy and the LSTM model achieves
98.03%.

Task 2
Aim: Build an SMS generator that takes the first half of the SMS as input and predicts the
later half.
Data Processing & Visualization: All SMS are tokenised. SMS longer than 41 are rare, so
max length = 50. All SMS are padded; the first half is given as input and the second half as
target with an <END> token.
Architecture: Embedding layer, followed by 3 RNN layers or 1 LSTM layer (with same number
of parameters), and a final Linear layer with vocab-size output.
Training: Adam optimizer (learning rate = 0.001), Binary Cross-Entropy loss, trained for 100
epochs.
Performance: Both RNN and LSTM models successfully generate most of the second part of
SMS after sufficient training.

Task 3
Aim: Create a Variational Autoencoder (VAE) for notMNIST to demonstrate reconstruction,
generation, and latent space interpolation, and a Conditional Generative Adversarial Network
(CGAN) to generate 4 distinct, class-specific images for each of the 10 notMNIST classes (A–J).
Data Processing & Normalization: The anubhavmaity/notMNIST dataset is loaded as
28 × 28 grayscale images. Two data loaders are used: one scaled to [0, 1] for the VAE (BCE
loss), and another scaled to [−1, 1] for the CGAN (Generator uses Tanh output).
VAE Architecture: The VAE uses a convolutional encoder mapping the 28 × 28 input to a

1
20-dimensional latent distribution (µ, log σ 2 ). Using the reparameterization trick, a vector z is
sampled and decoded back into an image through transposed convolutions.
VAE Training and Loss: Trained for 10 epochs using Adam. The total loss = Reconstruction
Loss (Binary Cross-Entropy) + KL Divergence (to enforce latent space ∼ N (0, 1)).
VAE Results: After training, the VAE demonstrates successful image reconstruction (real vs
decoded), random generation from N (0, 1), and smooth latent interpolations.
CGAN Generator: Takes a random noise vector concatenated with a one-hot class label,
passes through a dense projection and three ConvTranspose2d layers with BatchNorm + ReLU,
ending with a Tanh activation output in [−1, 1].
CGAN Discriminator: A convolutional network taking the image concatenated with a label
embedding, using Conv2d layers, LeakyReLU, BatchNorm, and a final Linear + Sigmoid output
for real/fake classification.
CGAN Training Setup: Trained for 15 epochs using Adam (β = (0.5, 0.999)). Both Gener-
ator and Discriminator use Binary Cross-Entropy Loss (BCELoss).
CGAN Training Procedure: The Discriminator is trained on real and fake (Image, Label)
pairs, followed by training the Generator to fool the Discriminator.
Conditional Generation: After training, the Generator produces 4 examples per class (A–J),
showing successful label-controlled notMNIST image generation.

CV BankOfAmerica
No ratings yet
CV BankOfAmerica
1 page
Sunday Live Speakers Forum - 58th Meeting
No ratings yet
Sunday Live Speakers Forum - 58th Meeting
1 page
NMD 25 Brochure - Revised
No ratings yet
NMD 25 Brochure - Revised
4 pages
AML Assign3 Report
No ratings yet
AML Assign3 Report
3 pages
KANTAR CMI Coding Test
No ratings yet
KANTAR CMI Coding Test
4 pages
Project Proposal
No ratings yet
Project Proposal
9 pages
RL Framework
No ratings yet
RL Framework
3 pages
SCMHRD
No ratings yet
SCMHRD
3 pages
Job Offer Letter for Analyst Position
No ratings yet
Job Offer Letter for Analyst Position
1 page
Soft Computing Question Paper
No ratings yet
Soft Computing Question Paper
3 pages
Intro to Feed Forward Neural Networks
No ratings yet
Intro to Feed Forward Neural Networks
41 pages
Ex 5 NN Wheat Seed Data
No ratings yet
Ex 5 NN Wheat Seed Data
6 pages
Answer Key Deep Nural Netwrok Unit 6
No ratings yet
Answer Key Deep Nural Netwrok Unit 6
8 pages
Reinforcement Learning and Deep Learning
No ratings yet
Reinforcement Learning and Deep Learning
3 pages
Convolutional Neural Networks Guide
No ratings yet
Convolutional Neural Networks Guide
40 pages
Back-Propagation Algorithm
No ratings yet
Back-Propagation Algorithm
51 pages
Weight Initialization Techniques Assignment Questions
No ratings yet
Weight Initialization Techniques Assignment Questions
8 pages
Deep Learning Course at IIT Ropar
No ratings yet
Deep Learning Course at IIT Ropar
4 pages
PoA Computer Vision
No ratings yet
PoA Computer Vision
2 pages
Plant Leaf Disease Detection and Classification Based On CNN With LVQ Algorithm
100% (1)
Plant Leaf Disease Detection and Classification Based On CNN With LVQ Algorithm
4 pages
05 NN
No ratings yet
05 NN
151 pages
Machine Learning Clustering & NN
No ratings yet
Machine Learning Clustering & NN
28 pages
Introduction To Neural Networks
No ratings yet
Introduction To Neural Networks
10 pages
Terminologies of ANN
No ratings yet
Terminologies of ANN
3 pages
The Transformer Architecture
No ratings yet
The Transformer Architecture
9 pages
Ai For Beginners
No ratings yet
Ai For Beginners
9 pages
Deep Learning & Feature Learning
No ratings yet
Deep Learning & Feature Learning
2 pages
Introduction To Neurofuzzy Technologies: Combining Neural Networks and Fuzzy Logic
No ratings yet
Introduction To Neurofuzzy Technologies: Combining Neural Networks and Fuzzy Logic
8 pages
An Introduction To Mathematics Behind Neural Networks - Towards Data Science
No ratings yet
An Introduction To Mathematics Behind Neural Networks - Towards Data Science
14 pages
CNN Vs Transformer Variants Malware Classification Using Binary Malware
No ratings yet
CNN Vs Transformer Variants Malware Classification Using Binary Malware
9 pages
Long Short Term Memory (LSTM)
No ratings yet
Long Short Term Memory (LSTM)
23 pages
NNDL Record Final
No ratings yet
NNDL Record Final
46 pages
CNN Training Tricks with PyTorch
No ratings yet
CNN Training Tricks with PyTorch
19 pages
"Artificial Neural Networks": A Presentation On
No ratings yet
"Artificial Neural Networks": A Presentation On
13 pages
Bagging vs Boosting in Ensemble Learning
No ratings yet
Bagging vs Boosting in Ensemble Learning
40 pages
MCQ ST2
No ratings yet
MCQ ST2
15 pages
Syllabus INTRODUCTION TO DEEP LEARNING
No ratings yet
Syllabus INTRODUCTION TO DEEP LEARNING
11 pages
Deep Learning LAB
No ratings yet
Deep Learning LAB
47 pages
III CSM-C ML PPT (Lab) Presentation Schedule
No ratings yet
III CSM-C ML PPT (Lab) Presentation Schedule
3 pages

AML Assignment 2

Uploaded by

AML Assignment 2

Uploaded by

AML Assignment 2

You might also like