Module 2 Gen

Variational Autoencoders (VAEs) are a type of neural network that learn efficient data representations by outputting probability distributions in latent space rather than single values. They are trained to minimize reconstruction loss while regularizing the latent space, allowing for the generation of new samples. However, VAEs have limitations such as generating blurry images, difficulties with high-resolution data, and challenges in capturing distinct variations.

Uploaded by

girik11004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views57 pages

Module 2 Gen

Uploaded by

girik11004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 57

Variational Autoencoders(VAEs)

Module -2
What is an Autoencoder?
An autoencoder is a type of artificial neural network used to learn efficient
representations of data, typically for the purpose of dimensionality reduction, data
compression, or unsupervised learning. Autoencoders are unsupervised learning
models because they don’t require labeled data; instead, they rely on reconstructing
their input data.
Introduction to
Autoencoders
Introduction to
Autoencoders
Autoencoders: The Sneaky Idea
Necessary conditions to learn a
representation

• Data should have dependencies across dimensions

• If dimensions are all independent, then it is impossible to learn lower dimensional
representation.
PCA vs Encoders

• Both Perform Dimensionality Reduction.

• PCA learns linear Relationships.
• Encoders can learn Non-linear Relationships.
• Encoders = PCA if it uses linear activation functions.
PCA vs Encoders
Decoders = Decompress representation back to original Domain
How can we train an Autoencoder?
• Backpropagation
• Minimize reconstruction error.
What we ask an Autoencoder?
• Sensitive enough to input data to reconstruct it.
• Insensitive enough to input data not to overfit it.
Deep Autoencoder
Deep Convolutional Autoencoder

• Similar architecture to AE
• Convolutional Layers
• Encoder: Convolution + Leaky ReLU + Batch Normalization
• Decoder: Convolution transpose + Leaky ReLU + Batch Normalization
Autoencoder Applications

• Generation
• Denoising
• Anomaly Detection
Generation with AEs
Generation with AEs
Generation with AEs
Generation with AEs
Denoising with AEs
Generative AI models works with different data
Variational Autoencoders
What is a Variational Autoencoder?
• Variational autoencoder was proposed in 2013 by Diederik P. Kingma
and Max Welling at Google and Qualcomm.
• A variational autoencoder (VAE) provides a probabilistic manner for
describing an observation in latent space. Thus, rather than building an
encoder that outputs a single value to describe each latent state attribute,
we’ll formulate our encoder to describe a probability distribution for each
latent attribute.
• It has many applications, such as data compression, synthetic data
creation, etc.
Variational Autoencoders

• Variational autoencoder is different from an autoencoder in a way that it

provides a statistical manner for describing the samples of the dataset in
latent space.
• In the variational autoencoder, the encoder outputs a probability
distribution in the bottleneck layer instead of a single output value.
Architecture of Variational Autoencoders
Architecture of Variational Autoencoders
• The encoder-decoder architecture lies at the heart of Variational Autoencoders (VAEs),
distinguishing them from traditional autoencoders. The encoder network takes raw
input data and transforms it into a probability distribution within the latent space.
• The latent code generated by the encoder is a probabilistic encoding, allowing the
VAE to express not just a single point in the latent space but a distribution of potential
representations.
• The decoder network, in turn, takes a sampled point from the latent distribution and
reconstructs it back into data space.
• During training, the model refines both the encoder and decoder parameters to
minimize the reconstruction loss – the disparity between the input data and the
decoded output. The goal is not just to achieve accurate reconstruction but also to
regularize the latent space, ensuring that it confirms to a specified distribution.
Architecture of Variational Autoencoders
• The process involves a delicate balance between two essential components: the
reconstruction loss and the regularization term, often represented by the Kullback-
Leibler divergence.
• The reconstruction loss compels the model to accurately reconstruct the input, while
the regularization term encourages the latent space to adhere to the chosen
distribution, preventing overfitting and promoting generalization.
• By iteratively adjusting these parameters during training, the VAE learns to encode
input data into a meaningful latent space representation.
• This optimized latent code encapsulates the underlying features and structures of
the data, facilitating precise reconstruction. The probabilistic nature of the latent
space also enables the generation of novel samples by drawing random points from
the learned distribution.
Mathematics behind Variational Autoencoders
• Variational autoencoder uses KL-divergence as its loss function, the goal of this is
to minimize the difference between a supposed distribution and original distribution
of dataset.
• Suppose we have a distribution z and we want to generate the observation x from it.
In other words, we want to calculate .
• We can do it as
But, the calculation of can be quite difficult.

This usually makes it an intractable distribution. Hence, we need to approximate to

to make it a tractable distribution.
Mathematics behind Variational Autoencoders
• To better approximate to we will minimize the KL-divergence loss which calculates
how similar two distributions are:

By simplifying, the above minimization problem is equivalent to the following

maximization problem :

The first term represents the reconstruction likelihood and the other term ensures that
our learned distribution is similar to the true prior distribution .
Thus, our total loss consists of two terms, one is reconstruction error and other is KL-
divergence loss:
Limitations of VAEs
Limitation Explanation
Blurry Image Generation VAEs generate blurry images because they optimize a likelihood-based
reconstruction loss, which often leads to averaging pixel values. The
output images lack sharp details.
Trade-off Between The KL divergence term forces the latent space to follow a specific prior
Reconstruction Quality distribution (e.g., Gaussian), which can reduce the quality of
and Latent Space reconstructions by making them less specific.
Regularization
Mode Averaging VAEs tend to average different data modes instead of capturing distinct
variations, resulting in unrealistic or less diverse outputs.
Poorly Structured Latent For highly complex data (e.g., realistic human faces), VAEs struggle to
Space for Complex Data capture meaningful latent representations due to their probabilistic
nature.
Inefficient Sampling Since VAEs rely on a fixed prior (like a Gaussian), samples from the latent
space may not align well with real-world data distributions, leading to
unrealistic generations.
KL Divergence Instability Tuning the weight of the KL divergence term is tricky. Too much
regularization leads to poor reconstructions, while too little reduces
generative diversity.
Overly Smoothed VAEs encourage smooth and continuous latent spaces, which can lead to
Limitations of VAEs
Limitation Explanation
Difficulties in High- Standard VAEs struggle with generating high-resolution images due to
Resolution Image the inherent limitations of their decoder structure.
Synthesis
Difficulty Handling VAEs perform well on continuous data but face challenges when working
Discrete Data with discrete data (e.g., text generation, categorical data) because of
the reparameterization trick limitations.
Lack of Sharp Latent The latent variables in VAEs often encode blurry or ambiguous
Representations representations due to the variational posterior approximation.
Higher Computational The extra KL divergence computation and sampling steps make VAEs
Cost than Standard computationally more expensive than standard autoencoders.
Autoencoders
Variability in Training While VAEs are generally more stable than GANs, fine-tuning their loss
Stability balance requires careful tuning of hyperparameters.
Suboptimal Although VAEs enforce structured representations, they might not
Representation Learning always capture the most meaningful features for downstream tasks like
classification.
Unclear Evaluation Evaluating VAEs is challenging because likelihood-based metrics don’t
Metrics always correlate well with perceptual quality.
Need for Post-Processing Often, extra steps like adversarial training or post-processing techniques
Why to choose GANs over VAEs
Reason to choose
Explanation
GANs
High-Quality, Sharp GANs use adversarial loss, which encourages the generator to produce
Images high-quality, sharp images rather than blurry reconstructions.

No Explicit Latent Space GANs do not enforce a predefined prior on the latent space, allowing for
Regularization more flexible and diverse data generation.

Better Mode Coverage Advanced GAN variants (e.g., WGAN, BigGAN) improve mode collapse
issues and capture multiple data modes effectively.
More Realistic GANs directly learn the data distribution and refine outputs using the
Generations discriminator, resulting in more realistic images.
Strong Performance in Models like StyleGAN can generate ultra-high-resolution, photorealistic
High-Resolution Image images.
Generation
Better Suitability for GANs are widely used for style transfer, face aging, and super-
Image Translation & resolution tasks where VAEs struggle.
Super-Resolution
More Flexible Latent Unlike VAEs, GANs do not enforce smooth distributions, allowing for
Representations sharper and more structured feature representations.
Works Well with Discrete GANs have been successfully adapted for text generation (e.g.,
Why to choose GANs over VAEs
Reason to choose
Explanation
GANs
More Popular in Creative GANs are used extensively in AI art, deepfake generation, and media
Applications content creation due to their high fidelity.

No Need for KL GANs do not suffer from the KL divergence balancing problem, making
Divergence Optimization them more effective in capturing fine details.

Variational AutoEncoder
No ratings yet
Variational AutoEncoder
21 pages
MODULE 6 - 1 Variational Autoencoders (VAE)
No ratings yet
MODULE 6 - 1 Variational Autoencoders (VAE)
46 pages
Variational Autoencoders Overview
No ratings yet
Variational Autoencoders Overview
37 pages
Understanding Variational Autoencoders
100% (1)
Understanding Variational Autoencoders
3 pages
C 03 Variational Autoencoders Generative Adversarial Network
No ratings yet
C 03 Variational Autoencoders Generative Adversarial Network
54 pages
5 - Vae
No ratings yet
5 - Vae
20 pages
Unsupervised Deep Learning
No ratings yet
Unsupervised Deep Learning
11 pages
GAPE Module 3
No ratings yet
GAPE Module 3
21 pages
Variational Autoencoders (VAEs)
No ratings yet
Variational Autoencoders (VAEs)
5 pages
Unit 2
No ratings yet
Unit 2
26 pages
Auto Encoder S
No ratings yet
Auto Encoder S
22 pages
Week 2 - VAE - Lesson
No ratings yet
Week 2 - VAE - Lesson
22 pages
Variational Autoencoders Overview
No ratings yet
Variational Autoencoders Overview
14 pages
465-Lecture 12
No ratings yet
465-Lecture 12
31 pages
MuskanSharma - III IT
No ratings yet
MuskanSharma - III IT
10 pages
VAEs Talk
No ratings yet
VAEs Talk
44 pages
Lecture # 6 Latent Variable Models
No ratings yet
Lecture # 6 Latent Variable Models
55 pages
Understanding Variational Autoencoders
No ratings yet
Understanding Variational Autoencoders
18 pages
ML Lec 20 VAE
No ratings yet
ML Lec 20 VAE
44 pages
Unsupervised Learning with Autoencoders
No ratings yet
Unsupervised Learning with Autoencoders
136 pages
Variational Autoencoder (VAE)
No ratings yet
Variational Autoencoder (VAE)
14 pages
AAI - Module 2 - Variational Autoencoders
No ratings yet
AAI - Module 2 - Variational Autoencoders
9 pages
Auto Encoder S
No ratings yet
Auto Encoder S
16 pages
AVAE
No ratings yet
AVAE
21 pages
Tutorial - What Is A Variational Autoencoder - Jaan Altosaar
No ratings yet
Tutorial - What Is A Variational Autoencoder - Jaan Altosaar
20 pages
7.variational Autoencoders
No ratings yet
7.variational Autoencoders
4 pages
Unit 2 Variational Auto Encoder
No ratings yet
Unit 2 Variational Auto Encoder
11 pages
Variational Autoencoders
No ratings yet
Variational Autoencoders
14 pages
Autoencoder NPTEL Presentation
No ratings yet
Autoencoder NPTEL Presentation
11 pages
Chapter 10
No ratings yet
Chapter 10
20 pages
P A S - L S - R VAE: Erformance Nalysis of EMI Supervised Earning in THE Mall Data Egime Using S
No ratings yet
P A S - L S - R VAE: Erformance Nalysis of EMI Supervised Earning in THE Mall Data Egime Using S
7 pages
Blaauw16 Interspeech
No ratings yet
Blaauw16 Interspeech
5 pages
Lect-Gen Ai-2
No ratings yet
Lect-Gen Ai-2
22 pages
Variational Autoencoders Explained
No ratings yet
Variational Autoencoders Explained
44 pages
UNIT-II (Variational Autoencoders)
No ratings yet
UNIT-II (Variational Autoencoders)
24 pages
AI-Driven Material Generation
No ratings yet
AI-Driven Material Generation
33 pages
1 Autoencoders
No ratings yet
1 Autoencoders
22 pages
Auto Encoder
No ratings yet
Auto Encoder
12 pages
Lect-Gen Ai-2
No ratings yet
Lect-Gen Ai-2
22 pages
Autoencoders in Generative Models
No ratings yet
Autoencoders in Generative Models
65 pages
Deep Learning Subject Practicals Uni Mumbai
No ratings yet
Deep Learning Subject Practicals Uni Mumbai
11 pages
Variational Autoencoders-Fashion Mnist
No ratings yet
Variational Autoencoders-Fashion Mnist
9 pages
Variational Autoencoders - Post Quiz - Attempt Review
No ratings yet
Variational Autoencoders - Post Quiz - Attempt Review
5 pages
Exploring The Latent Space of Autoencoders With
No ratings yet
Exploring The Latent Space of Autoencoders With
34 pages
Adversarial Variational Bayes
No ratings yet
Adversarial Variational Bayes
14 pages
Autoencoders
No ratings yet
Autoencoders
35 pages
Lec15 Generative Models
No ratings yet
Lec15 Generative Models
51 pages
Lecture 09 - Generative Models
No ratings yet
Lecture 09 - Generative Models
58 pages
ACV - Notes - Final
No ratings yet
ACV - Notes - Final
7 pages
AI60201 Module3
No ratings yet
AI60201 Module3
61 pages
L12 Autoencoders and More
No ratings yet
L12 Autoencoders and More
29 pages
DLA Unit 5
No ratings yet
DLA Unit 5
18 pages
Intro to Variational Autoencoders
No ratings yet
Intro to Variational Autoencoders
89 pages
Types of Autoencoders Explained
No ratings yet
Types of Autoencoders Explained
13 pages
Introduction to Variational Autoencoders
No ratings yet
Introduction to Variational Autoencoders
89 pages
NVAE: Hierarchical VAE for Image Generation
No ratings yet
NVAE: Hierarchical VAE for Image Generation
20 pages
Sse 25 10 584 1
No ratings yet
Sse 25 10 584 1
18 pages
Combinevae&Gan 4
No ratings yet
Combinevae&Gan 4
19 pages
Hands On Machine Learning With Scikit Learn and TensorFlow Concepts Tools and Techniques To Build Intelligent Systems 1st Edition by Aurelien Geron ISBN 1491962291 9781491962299pdf Download
100% (10)
Hands On Machine Learning With Scikit Learn and TensorFlow Concepts Tools and Techniques To Build Intelligent Systems 1st Edition by Aurelien Geron ISBN 1491962291 9781491962299pdf Download
83 pages
@ilapss Ilaps GenAI Syllabuspdf
No ratings yet
@ilapss Ilaps GenAI Syllabuspdf
7 pages
Understanding Model Overfitting in Data Mining
No ratings yet
Understanding Model Overfitting in Data Mining
15 pages
Deep Learning & Edge AI Course
No ratings yet
Deep Learning & Edge AI Course
3 pages
Rohini 50134589512
No ratings yet
Rohini 50134589512
7 pages
Radix Sort
No ratings yet
Radix Sort
5 pages
Crossover
100% (1)
Crossover
86 pages
Guc 401 56 19070 2022-03-17T12 25 58
No ratings yet
Guc 401 56 19070 2022-03-17T12 25 58
25 pages
Math Script
No ratings yet
Math Script
4 pages
01 - Signal Flow Graph
No ratings yet
01 - Signal Flow Graph
31 pages
UNIT V Compression and Recognition
No ratings yet
UNIT V Compression and Recognition
97 pages
Cat Swarm Optimization Algorithm Code
No ratings yet
Cat Swarm Optimization Algorithm Code
5 pages
Canny Filter
No ratings yet
Canny Filter
12 pages
MPEG Audio: Multimedia Communications: Coding, Systems, and Networking
No ratings yet
MPEG Audio: Multimedia Communications: Coding, Systems, and Networking
15 pages
A Common Sense Guide To Data Structure and Algorithms Study Guide
No ratings yet
A Common Sense Guide To Data Structure and Algorithms Study Guide
232 pages
LU Decomposition Method Explained
No ratings yet
LU Decomposition Method Explained
3 pages
Data Mining - Rule Based Classification
No ratings yet
Data Mining - Rule Based Classification
1 page
Lecture 1 - Introduction To Optimization PDF
No ratings yet
Lecture 1 - Introduction To Optimization PDF
31 pages
Optical Flow Algorithm Explained
No ratings yet
Optical Flow Algorithm Explained
4 pages
Autodesk
No ratings yet
Autodesk
3 pages
DFT Matrix - Wikipedia
No ratings yet
DFT Matrix - Wikipedia
4 pages
C
No ratings yet
C
20 pages
DTS Tutorial
No ratings yet
DTS Tutorial
14 pages
Optimization Techniques Bca
No ratings yet
Optimization Techniques Bca
18 pages
Problem 883 Moment Distribution Method
No ratings yet
Problem 883 Moment Distribution Method
2 pages
Nqueen
No ratings yet
Nqueen
12 pages
Unit 1.2 Algorithms-and-Flowchart
No ratings yet
Unit 1.2 Algorithms-and-Flowchart
29 pages
K Means Clustering Numerical
No ratings yet
K Means Clustering Numerical
35 pages
Unit IV
No ratings yet
Unit IV
21 pages
Practice Assesment Questions
No ratings yet
Practice Assesment Questions
2 pages