0% found this document useful (0 votes)

28 views31 pages

Unit 5 - Autoencoders and Generative Models

The document discusses autoencoders, a type of neural network used for data compression and reconstruction, detailing their architecture, properties, and various types including undercomplete, regularized, and denoising autoencoders. It highlights the importance of hyperparameters in training, applications such as image coloring and denoising, and provides implementation examples using the MNIST dataset. Additionally, it explains the advantages and disadvantages of different autoencoder types and the role of regularization in enhancing their performance.

Uploaded by

ajishaanilkumar560

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views31 pages

Unit 5 - Autoencoders and Generative Models

Uploaded by

ajishaanilkumar560

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

UNIT V

AUTOENCODERS AND GENERATIVE MODELS

Autoencoders: Undercomplete autoencoders -- Regularized autoencoders -- Stochastic encoders

and decoders -- Learning with autoencoders; Deep Generative Models: Variational autoencoders
– Generative adversarial networks.
=====
1. Introduction to Autoencoders
 Autoencoders are a specific type of feedforward neural networks where the input is the
same as the output.
 They compress the input into a lower-dimensional code and then reconstruct the output
from this representation.
 The code is a compact “summary” or “compression” of the input, also called the latent-
space representation.
Components
 An autoencoder consists of 3 components: encoder, code and decoder.
 The encoder compresses the input and produces the code,
 the decoder then reconstructs the input only using this code.

To build an autoencoder we need 3 things:

 an encoding method, decoding method, and a loss function to compare the output with
the target.
Properties of Autoencoders
Autoencoders are mainly a dimensionality reduction (or compression) algorithm with a couple of
important properties:
 Data-specific: Autoencoders are only able to meaningfully compress data similar to what
they have been trained on. Since they learn features specific for the given training data,
they are different than a standard data compression algorithm like gzip. So we can’t
expect an autoencoder trained on handwritten digits to compress landscape photos.
 Lossy: The output of the autoencoder will not be exactly the same as the input, it will be
a close but degraded representation. If you want lossless compression they are not the
way to go.
 Unsupervised: To train an autoencoder we don’t need to do anything fancy, just throw
the raw input data at it. Autoencoders are considered an unsupervised learning technique

1
since they don’t need explicit labels to train on. But to be more precise they are self-
supervised because they generate their own labels from the training data.

Autoencoder Architecture:
The network architecture for autoencoders can vary between a simple FeedForward network,
LSTM network or Convolutional Neural Network depending on the use case.

 This is a more detailed visualization of an autoencoder.

 First the input passes through the encoder, which is a fully-connected ANN, to produce
the code.
 The decoder, which has the similar ANN structure, then produces the output only using
the code.
 The goal is to get an output identical with the input.
 Note that the decoder architecture is the mirror image of the encoder.
 This is not a requirement but it’s typically the case.
 The only requirement is the dimensionality of the input and output needs to be the same.
Anything in the middle can be played with.

 The layer between the encoder and decoder, ie. the code is also known as Bottleneck /
latent-space representation.
 This is a well-designed approach to decide which aspects of observed data are relevant
information and what aspects can be discarded.

2
It does this by balancing two criteria :
o Compactness of representation, measured as the compressibility.
o It retains some behaviourally relevant variables from the input.

There are 4 hyperparameters that we need to set before training an autoencoder:

 Code size: number of nodes in the middle layer. Smaller size results in more
compression.
 Number of layers: the autoencoder can be as deep as we like. In the figure above we have
2 layers in both the encoder and decoder, without considering the input and output.
 Number of nodes per layer: the autoencoder architecture we’re working on is called a
stacked autoencoder since the layers are stacked one after another. Usually stacked
autoencoders look like a “sandwitch”. The number of nodes per layer decreases with each
subsequent layer of the encoder, and increases back in the decoder. Also the decoder is
symmetric to the encoder in terms of layer structure. As noted above this is not necessary
and we have total control over these parameters.
 Loss function: we either use mean squared error (mse) or binary crossentropy. If the
input values are in the range [0, 1] then we typically use crossentropy, otherwise we use
the mean squared error.
Autoencoders are trained the same way as ANNs via back propagation.

Applications of Autoencoders

3
Image Coloring

Autoencoders are used for converting any black and white picture into a colored image.
Depending on what is in the picture, it is possible to tell what the color should be.
Feature variation
It extracts only the required features of an image and generates the output by removing any noise
or unnecessary interruption.

Dimensionality Reduction
The reconstructed image is the same as our input but with reduced dimensions. It helps in
providing the similar image with a reduced pixel value.

4
Denoising Image
The input seen by the autoencoder is not the raw input but a stochastically corrupted version. A
denoising autoencoder is thus trained to reconstruct the original input from the noisy version.

Watermark Removal
It is also used for removing watermarks from images or to remove any object while filming a
video or a movie.

Implementation
Now let’s implement an autoencoder for the following architecture, 1 hidden layer in the encoder
and decoder.
We will use the extremely popular MNIST dataset as input. It contains black-and-white images
of handwritten digits.

5
They’re of size 28x28 and we use them as a vector of 784 numbers between [0, 1]
We will now implement the autoencoder with Keras. The hyperparameters are: 128 nodes in
the hidden layer, code size is 32, and binary crossentropy is the loss function.
Code:
Let’s import the required libraries
import numpy as np
from keras.layers import Input, Dense
from keras.models import Model
from keras.datasets import mnist
import matplotlib.pyplot as plt
Declaration of Hidden Layers and Variables
# this is the size of our encoded representations
encoding_dim = 32 # 32 floats -> compression of factor 24.5, assuming the input is 784 floats

# this is our input placeholder

input_img = Input(shape=(784,))

# "encoded" is the encoded representation of the input

encoded = Dense(encoding_dim, activation='relu')(input_img)

# "decoded" is the lossy reconstruction of the input

decoded = Dense(784, activation='sigmoid')(encoded)

# this model maps an input to its reconstruction

autoencoder = Model(input_img, decoded)

# this model maps an input to its encoded representation

encoder = Model(input_img, encoded)

# create a placeholder for an encoded (32-dimensional) input

encoded_input = Input(shape=(encoding_dim,))

# retrieve the last layer of the autoencoder model

decoder_layer = autoencoder.layers[-1]

6
# create the decoder model
decoder = Model(encoded_input, decoder_layer(encoded_input))

# configure our model to use a per-pixel binary crossentropy loss, and the Adadelta optimizer:
autoencoder.compile(optimizer='adadelta', loss='binary_crossentropy')

Preparing the input data (MNIST Dataset)

(x_train, _), (x_test, _) = mnist.load_data()
# normalize all values between 0 and 1 and we will flatten the 28x28 images
into vectors of size 784.

x_train = x_train.astype('float32') / 255.

x_test = x_test.astype('float32') / 255.

x_train = x_train.reshape((len(x_train), np.prod(x_train.shape[1:])))

x_test = x_test.reshape((len(x_test), np.prod(x_test.shape[1:])))

print x_train.shape
print x_test.shape
Training Autoencoders for 50 epochs
autoencoder.fit(x_train, x_train,
epochs=50,
batch_size=256,
shuffle=True,
validation_data=(x_test, x_test))
# encode and decode some digits
# note that we take them from the *test* set
encoded_imgs = encoder.predict(x_test)
decoded_imgs = decoder.predict(encoded_imgs)
Visualizing the reconstructed inputs and the encoded representations using Matplotlib

7
n = 20 # how many digits we will display
plt.figure(figsize=(20, 4))
for i in range(n):
# display original
ax = plt.subplot(2, n, i + 1)
plt.imshow(x_test[i].reshape(28, 28))
plt.gray()
ax.get_xaxis().set_visible(False)
ax.get_yaxis().set_visible(False)

# display reconstruction
ax = plt.subplot(2, n, i + 1 + n)
plt.imshow(decoded_imgs[i].reshape(28, 28))
plt.gray()
ax.get_xaxis().set_visible(False)
ax.get_yaxis().set_visible(False)
plt.show()

====
2. Under complete autoencoders

 Under complete autoencoders is an unsupervised neural network that you can use to generate
a compressed version of the input data.
 It is done by taking in an image and trying to predict the same image as output, thus
reconstructing the image from its compressed bottleneck region.
 The primary use for autoencoders like these is generating a latent space or bottleneck, which
forms a compressed substitute of the input data and can be easily decompressed back with
the help of the network when needed.
Undercomplete autoencoders learn features by minimizing the same loss function:

8
Where L is the loss function penalizing g(f(x)) from diverging from the original input x. L can
be a mean squared error or even a mean absolute error.

 Goal of the Autoencoder is to capture the most important features present in the data.
 Undercomplete autoencoders have a smaller dimension for hidden layer compared to the input
layer. This helps to obtain important features from the data.
 Objective is to minimize the loss function by penalizing the g(f(x)) for being different from
the input x.
 When decoder is linear and we use a mean squared error loss function then undercomplete
autoencoder generates a reduced feature space similar to PCA
 We get a powerful nonlinear generalization of PCA when encoder function f and decoder
function g are non linear.
 Undercomplete autoencoders do not need any regularization as they maximize the probability
of data rather than copying the input to the output.

Advantages
 Undercomplete autoencoders, with code dimension less than the input dimension, can learn
the most salient features of the data distribution.
Disadvantages
 We have seen that these autoencoders fail to learn anything useful if the encoder and decoder
are given too much capacity.
 A similar problem occurs if the hidden code is allowed to have dimension equal to the input,
and in the overcomplete case in which the hidden code has dimension greater than the input.
In these cases, even a linear encoder and a linear decoder can learn to copy the input to the
output without learning anything useful about the data distribution.
 Ideally, one could train any architecture of autoencoder successfully, choosing the code
dimension and the capacity of the encoder and decoder based on the complexity of
distribution to be modelled.
===

3. Regularized autoencoders

Regularized autoencoders provide the ability to do so. Rather than limiting the model capacity by
keeping the encoder and decoder shallow and the code size small, regularized autoencoders use a
loss function that encourages the model to have other properties besides the ability to copy its
input to its output. These other properties include sparsity of the representation, smallness of the
derivative of the representation, and robustness to noise or to missing inputs. A regularized
autoencoder can be nonlinear and overcomplete but still learn something useful about the data
distribution, even if the model capacity is great enough to learn a trivial identity function.

9
In addition to the methods described here, which are most naturally interpreted as regularized
autoencoders, nearly any generative model with latent variables and equipped with an inference
procedure (for computing latent representations given input) may be viewed as a particular form
of autoencoder.
In practice, we usually find two types of regularized autoencoder: the sparse autoencoder and
the denoising autoencoder.

(i) Sparse autoencoder : Sparse autoencoders are typically used to learn features for another task
such as classification. An autoencoder that has been regularized to be sparse must respond to
unique statistical features of the dataset it has been trained on, rather than simply acting as an
identity function. In this way, training to perform the copying task with a sparsity penalty can
yield a model that has learned useful features as a byproduct.
Another way we can constraint the reconstruction of autoencoder is to impose a constraint in its
loss. We could, for example, add a regularization term in the loss function. Doing this will make
our autoencoder learn sparse representation of data.

There are actually two different ways to construct our sparsity penalty: L1
regularization and KL-divergence.

10
Why L1 Regularization Sparse
L1 regularization and L2 regularization are widely used in machine learning and deep learning.
L1 regularization adds “absolute value of magnitude” of coefficients as penalty term while L2
regularization adds “squared magnitude” of coefficient as a penalty term.
Although L1 and L2 can both be used as regularization term, the key difference between them is
that L1 regularization tends to shrink the penalty coefficient to zero while L2 regularization
would move coefficients towards zero but they will never reach. Thus L1 regularization is often
used as a method of feature extraction. But why L1 regularization leads to sparsity?
Consider that we have two loss functions L1 and L2 which represent L1 regularization and L2
regularization respectively.

Gradient descent is always used in optimizing neural networks. If we plot these two loss
functions and their derivatives, it looks like this:

L1 regularization and its derivative

Loss Function
Finally, after the above analysis, we get the idea of using L1 regularization in sparse autoencoder
and the loss function is as below:

Except for the first two terms, we add the third term which penalizes the absolute value of the
vector of activations a in layer h for sample i. Then we use a hyperparameter to control its effect
on the whole loss function. And in this way, we do build a sparse autoencoder.

11
L2 regularization and its derivative
We can notice that for L1 regularization, the gradient is either 1 or -1 except when w=0, which
means that L1 regularization will always move w towards zero with same step size (1 or -1)
regardless of the value of w. And when w=0, the gradient becomes zero and no update will be
made anymore. However, for L2 regularization things are different. L2 regularization will also
move w towards zero but the step size becomes smaller and smaller which means that w
will never reach zero.
Visualization
We tried to build a deep autoencoder and train it on MNIST dataset without L1 regularization
and with regularization. The structure of this deep autoencoder is plotted as below:

Code:
input_size = 784
hidden_size = 64
output_size = 784

x = Input(shape=(input_size,))

# Encoder

12
h = Dense(hidden_size, activation='relu', activity_regularizer=regularizers.l1(10e-5))(x)

# Decoder
r = Dense(output_size, activation='sigmoid')(h)

autoencoder = Model(input=x, output=r)

autoencoder.compile(optimizer='adam', loss='mse')

Notice in our hidden layer, we added an l1 activity regularizer, that will apply a penalty to the
loss function during the optimization phase. As a result, the representation is now sparser
compared to the vanilla autoencoder.
And after 100 epochs of training using 128 batch size and Adam as the optimizer, we got below
results:

(ii) Denoising autoencoder :

Denoising autoencoders are a specific type of neural network that enables unsupervised learning
of data representations or encodings. Their primary objective is to reconstruct the original
version of the input signal corrupted by noise. This capability proves valuable in problems such
as image recognition or fraud detection, where the goal is to recover the original signal from its
noisy form.
An autoencoder consists of two main components:
 Encoder: This component maps the input data into a low-dimensional representation or
encoding.

13
 Decoder: This component returns the encoding to the original data space.
During the training phase, present the autoencoder with a set of clean input examples along with
their corresponding noisy versions. The objective is to learn a task using an encoder-decoder
architecture that efficiently transforms noisy input into clean output.
Architecture of DAE
The denoising autoencoder (DAE) architecture is similar to a standard autoencoder. It consists of
two main components:
Encoder
 The encoder creates a neural network equipped with one or more hidden layers.
 Its purpose is to receive noisy input data and generate an encoding, which represents a
low-dimensional representation of the data.
 Understand an encoder as a compression function because the encoding has fewer
parameters than the input data.
Decoder
 Decoder acts as an expansion function, which is responsible for reconstructing the
original data from the compressed encoding.
 It takes as input the encoding generated by the encoder and reconstructs the original data.
 Like encoders, decoders are implemented as neural networks featuring one or more
hidden layers.

During the training phase, present the denoising autoencoder (DAE) with a collection of clean
input examples along with their respective noisy counterparts. The objective is to acquire a
function that maps a noisy input to a relatively clean output using an encoder-decoder
architecture. To achieve this, a reconstruction loss function is typically employed to evaluate the
disparity between the clean input and the reconstructed output. A DAE is trained by minimizing
this loss through the use of backpropagation, which involves updating the weights of both
encoder and decoder components.

Code:
x = Input(shape=(28, 28, 1))

14
# Encoder
conv1_1 = Conv2D(32, (3, 3), activation='relu', padding='same')(x)
pool1 = MaxPooling2D((2, 2), padding='same')(conv1_1)
conv1_2 = Conv2D(32, (3, 3), activation='relu', padding='same')(pool1)
h = MaxPooling2D((2, 2), padding='same')(conv1_2)

# Decoder
conv2_1 = Conv2D(32, (3, 3), activation='relu', padding='same')(h)
up1 = UpSampling2D((2, 2))(conv2_1)
conv2_2 = Conv2D(32, (3, 3), activation='relu', padding='same')(up1)
up2 = UpSampling2D((2, 2))(conv2_2)
r = Conv2D(1, (3, 3), activation='sigmoid', padding='same')(up2)

autoencoder = Model(input=x, output=r)

autoencoder.compile(optimizer='adam', loss='mse')

OUTPUT:

Applications of Denoising Autoencoders (DAEs) span a variety of domains, including computer

vision, speech processing, and natural language processing.
Examples
 Image Denoising: DAEs are effective in removing noise from images, such as Gaussian
noise or salt-and-pepper noise.
 Fraud Detection: DAEs can contribute to identifying fraudulent transactions by learning
to reconstruct common transactions from their noisy counterparts.

15
 Data Imputation: To reconstruct missing values from available data by learning, DAEs
can facilitate data imputation in datasets with incomplete information.
 Data Compression: DAEs can compress data by obtaining a concise representation of
the data in the encoding space.
 Anomaly Detection: Using DAEs, anomalies in a dataset can be detected by training a
model to reconstruct normal data and then flag challenging inputs as potentially
abnormal.

4. Stochastic Encoders and Decoders

Generative Models

16
Loss function for Stochastic Decoder

17
- -----------------------------------------------------------------------------------------------==

5. Learning with autoencoders; Deep Generative Models: Variational autoencoders

Variational Autoencoders (VAEs) are generative models explicitly designed to capture the
underlying probability distribution of a given dataset and generate novel samples. They utilize an
architecture that comprises an encoder-decoder structure. The encoder transforms input data into
a latent form, and the decoder aims to reconstruct the original data based on this latent
representation. The VAE is programmed to minimize the dissimilarity between the original and
reconstructed data, enabling it to comprehend the underlying data distribution and generate new
samples that conform to the same distribution.

18
One notable advantage of VAEs is their ability to generate new data samples resembling the
training data. Because the VAE’s latent space is continuous, the decoder can generate new data
points that seamlessly interpolate among the training data points. VAEs find applications in
various domains like density estimation and text generation.

The Architecture of Variational Autoencoder

A VAE typically has two major components: An encoder connection and a decoder connection.
An encoder network transforms The input data into a low-dimensional secret space, often called
a “secret code”.
Various neural network topologies, such as fully connected or convolutional neural networks,
can be investigated for implementing encoder networks. The architecture chosen is based on the
characteristics of the data. The encoder network produces essential parameters, such as the mean
and variance of a Gaussian distribution, necessary for sampling and generating the latent code.

A VAE comprises an encoder network that maps input data to a latent code and a decoder
network that conducts the inverse operation by translating the latent code back to the
reconstruction data. By undergoing this training process, the VAE learns an optimized latent
representation that captures the fundamental characteristics of the data, enabling precise
reconstruction.
It achieves this by doing something that seems rather surprising at first: making its encoder not
output an encoding vector of size n, rather, outputting two vectors of size n: a vector of means, μ,
and another vector of standard deviations, σ.

19
Variational Autoencoder
They form the parameters of a vector of random variables of length n, with the i th element
of μ and σ being the mean and standard deviation of the i th random variable, X i, from which we
sample, to obtain the sampled encoding which we pass onward to the decoder:

20
Stochastically generating encoding vectors
This stochastic generation means, that even for the same input, while the mean and standard
deviations remain the same, the actual encoding will somewhat vary on every single pass simply
due to sampling.

Code:
# build your encoder upto here. It can simply be a series of dense layers, a convolutional network
# or even an LSTM decoder. Once made, flatten out the final layer of the encoder, call it hidden.

# we use Keras to build the graph

latent_size = 5

21
mean = Dense(latent_size)(hidden)

# we usually don't directly compute the stddev σ

# but the log of the stddev instead, which is log(σ)
# the reasoning is similar to why we use softmax, instead of directly outputting
# numbers in fixed range [0, 1], the network can output a wider range of numbers which we can
later compress down
log_stddev = Dense(latent_size)(hidden)

def sampler(mean, log_stddev):

# we sample from the standard normal a matrix of batch_size * latent_size (taking into
account minibatches)
std_norm = K.random_normal(shape=(K.shape(mean)[0], latent_size), mean=0, stddev=1)
# sampling from Z~N(μ, σ^2) is the same as sampling from μ + σX, X~N(0,1)
return mean + K.exp(log_stddev) * std_norm

latent_vector = Lambda(sampler)([mean, log_stddev])

Output

=======

22
6. Learning with autoencoders; Deep Generative Models: Generative Adversarial
Networks
Generative Adversarial Networks (GANs) were introduced in 2014 by Ian J. Goodfellow and co-
authors. GANs perform unsupervised learning tasks in machine learning. It consists of 2 models
that automatically discover and learn the patterns in input data.
The two models are known as Generator and Discriminator.
They compete with each other to scrutinize, capture, and replicate the variations within a dataset.
GANs can be used to generate new examples that plausibly could have been drawn from the
original dataset.
Shown below is an example of a GAN. There is a database that has real 100 rupee notes. The
generator neural network generates fake 100 rupee notes. The discriminator network will help
identify the real and fake notes.

What is a Generator?
A Generator in GANs is a neural network that creates fake data to be trained on the
discriminator. It learns to generate plausible data. The generated examples/instances become
negative training examples for the discriminator. It takes a fixed-length random vector carrying
noise as input and generates a sample.

23
The main aim of the Generator is to make the discriminator classify its output as real. The part of
the GAN that trains the Generator includes:
 noisy input vector
 generator network, which transforms the random input into a data instance
 discriminator network, which classifies the generated data
 generator loss, which penalizes the Generator for failing to dolt the discriminator
The backpropagation method is used to adjust each weight in the right direction by calculating
the weight's impact on the output. It is also used to obtain gradients and these gradients can help
change the generator weights.

Let’s see the next topic in this article on what GANs are, i.e., a Discriminator.
Want to Get Paid The Big Bucks?! Join AI & ML
Professional Certificate Program in AI and MLEXPLORE PROGRAM

What is a Discriminator?
The Discriminator is a neural network that identifies real data from the fake data created by the
Generator. The discriminator's training data comes from different two sources:
 The real data instances, such as real pictures of birds, humans, currency notes, etc., are
used by the Discriminator as positive samples during training.
 The fake data instances created by the Generator are used as negative examples during
the training process.

24
While training the discriminator, it connects to two loss functions. During discriminator training,
the discriminator ignores the generator loss and just uses the discriminator loss.
In the process of training the discriminator, the discriminator classifies both real data and fake
data from the generator. The discriminator loss penalizes the discriminator for misclassifying a
real data instance as fake or a fake data instance as real.
The discriminator updates its weights through backpropagation from the discriminator loss
through the discriminator network.

How Do GANs Work?

GANs consists of two neural networks. There is a Generator G(x) and a Discriminator D(x).
Both of them play an adversarial game. The generator's aim is to fool the discriminator by
producing data that are similar to those in the training set. The discriminator will try not to be
fooled by identifying fake data from real data. Both of them work simultaneously to learn and
train complex data like audio, video, or image files.
The Generator network takes a sample and generates a fake sample of data. The Generator is
trained to increase the Discriminator network's probability of making mistakes.

25
Below is an example of a GAN trying to identify if the 100 rupee notes are real or fake. So, first,
a noise vector or the input vector is fed to the Generator network. The generator creates fake 100
rupee notes. The real images of 100 rupee notes stored in a database are passed to the
discriminator along with the fake notes. The Discriminator then identifies the notes as classifying
them as real or fake.
We train the model, calculate the loss function at the end of the discriminator network, and
backpropagate the loss into both discriminator and generator models.

Mathematical Equation

The mathematical equation for training a GAN can be represented as:

Here,
G = Generator
D = Discriminator
Pdata(x) = distribution of real data
p(z) = distribution of generator
x = sample from Pdata(x)

26
z = sample from P(z)
D(x) = Discriminator network
G(z) = Generator network

Code:
Building the Generative Adversarial Network
Python3

# Define the generator and discriminator

# Initialize generator and discriminator
generator = Generator(latent_dim).to(device)
discriminator = Discriminator().to(device)

# Loss function
adversarial_loss = nn.BCELoss()

# Optimizers
optimizer_G = optim.Adam(generator.parameters()\
, lr=lr, betas=(beta1, beta2))
optimizer_D = optim.Adam(discriminator.parameters()\
, lr=lr, betas=(beta1, beta2))

27
Check the results
Let’s plot the generated images at different epochs to see that after how many epochs the
generator was capable to extract some information.

Plot the generated Image at zero epoch

from skimage.io import imread

a = imread('gan_images/0.png')
plt.imshow(a)

No information is extracted from the generator and the discriminator is intelligent enough to
identify it as fake.
Plot Image Generated after training on 1000 epoch
from skimage.io import imread
a = imread('gan_images/10000.png')
plt.imshow(a)

Now Generator is slowly being capable to extract some information that can be observed.
Plot Image Generated after training on 10000 Epochs

28
Now Generator is capable to build as it is an image as of MNIST dataset and there are high
chances of the Discriminator being Fool.

29
Click on Subject/Paper under Semester to enter.
Professional English Discrete Mathematics Environmental Sciences
Professional English - - II - HS3252 - MA3354 and Sustainability -
I - HS3152 GE3451
Digital Principles and
Statistics and Probability and
Computer Organization
Matrices and Calculus Numerical Methods - Statistics - MA3391
MA3251 - CS3351
- MA3151
3rd Semester
1st Semester

4th Semester
2nd Semester

Database Design and Operating Systems -

Engineering Physics - Engineering Graphics
Management - AD3391 AL3452
PH3151 - GE3251

Physics for Design and Analysis of Machine Learning -

Engineering Chemistry Information Science Algorithms - AD3351 AL3451
- CY3151 - PH3256
Data Exploration and Fundamentals of Data
Basic Electrical and
Visualization - AD3301 Science and Analytics
Problem Solving and Electronics Engineering -
BE3251 - AD3491
Python Programming -
GE3151 Artificial Intelligence
Data Structures Computer Networks
- AL3391
Design - AD3251 - CS3591

Deep Learning -
AD3501

Embedded Systems
Data and Information Human Values and
and IoT - CS3691
5th Semester

Security - CW3551 Ethics - GE3791

6th Semester

7th Semester

8th Semester

Open Elective-1
Distributed Computing Open Elective 2
- CS3551 Project Work /
Elective-3
Open Elective 3 Intership
Big Data Analytics -
Elective-4
CCS334 Open Elective 4
Elective-5
Elective 1 Management Elective
Elective-6
Elective 2
All Computer Engg Subjects - [ B.E., M.E., ] (Click on Subjects to enter)
Programming in C Computer Networks Operating Systems
Programming and Data Programming and Data Problem Solving and Python
Structures I Structure II Programming
Database Management Systems Computer Architecture Analog and Digital
Communication
Design and Analysis of Microprocessors and Object Oriented Analysis
Algorithms Microcontrollers and Design
Software Engineering Discrete Mathematics Internet Programming
Theory of Computation Computer Graphics Distributed Systems
Mobile Computing Compiler Design Digital Signal Processing
Artificial Intelligence Software Testing Grid and Cloud Computing
Data Ware Housing and Data Cryptography and Resource Management
Mining Network Security Techniques
Service Oriented Architecture Embedded and Real Time Multi - Core Architectures
Systems and Programming
Probability and Queueing Theory Physics for Information Transforms and Partial
Science Differential Equations
Technical English Engineering Physics Engineering Chemistry
Engineering Graphics Total Quality Professional Ethics in
Management Engineering
Basic Electrical and Electronics Problem Solving and Environmental Science and
and Measurement Engineering Python Programming Engineering

Autoencoders
No ratings yet
Autoencoders
14 pages
659451a19 DL Exp5
No ratings yet
659451a19 DL Exp5
8 pages
Autoencoders
No ratings yet
Autoencoders
12 pages
Introduction To Autoencoders: A Brief Overview
No ratings yet
Introduction To Autoencoders: A Brief Overview
27 pages
Module 4
No ratings yet
Module 4
53 pages
DL Unit 5
No ratings yet
DL Unit 5
19 pages
Auto Encoder
No ratings yet
Auto Encoder
10 pages
Auto Encoders
No ratings yet
Auto Encoders
4 pages
D5 PPT
No ratings yet
D5 PPT
79 pages
Autoencoders and Generative Models Overview
No ratings yet
Autoencoders and Generative Models Overview
25 pages
Autoencoders - Presentation
No ratings yet
Autoencoders - Presentation
18 pages
L23 Autoencoders
No ratings yet
L23 Autoencoders
16 pages
Unit4 1
No ratings yet
Unit4 1
42 pages
Autoencoders & GANs: Concepts & Implementation
No ratings yet
Autoencoders & GANs: Concepts & Implementation
138 pages
DL Unit 2B
No ratings yet
DL Unit 2B
23 pages
Chapter 7 - Autoencoders
No ratings yet
Chapter 7 - Autoencoders
91 pages
DL Exp 4 - Autoencoders
No ratings yet
DL Exp 4 - Autoencoders
5 pages
Autoencoders: K G Atram
No ratings yet
Autoencoders: K G Atram
15 pages
Autoencoders: Applications & Types
No ratings yet
Autoencoders: Applications & Types
21 pages
Deep Learning 2
No ratings yet
Deep Learning 2
4 pages
Unit-5 Auto Encoders in Deep Learning
No ratings yet
Unit-5 Auto Encoders in Deep Learning
23 pages
Autoencoders in Machine Learning
No ratings yet
Autoencoders in Machine Learning
7 pages
Deep Learning 2
No ratings yet
Deep Learning 2
36 pages
Module 03
No ratings yet
Module 03
13 pages
Study Materials - Denoising Autoencoders
No ratings yet
Study Materials - Denoising Autoencoders
7 pages
Autoencoders U
No ratings yet
Autoencoders U
44 pages
Experiment 4
No ratings yet
Experiment 4
26 pages
DeepLearning Unit IV Notes
No ratings yet
DeepLearning Unit IV Notes
58 pages
Chapter17 Autoencoders
No ratings yet
Chapter17 Autoencoders
23 pages
Lecture 23b Auto Encoder
No ratings yet
Lecture 23b Auto Encoder
27 pages
35-Gated RNNs - Optimization For Long-Term Dependencies - Explicit Memory-07!10!2024
No ratings yet
35-Gated RNNs - Optimization For Long-Term Dependencies - Explicit Memory-07!10!2024
3 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
22 pages
Neural Network Unsupervised Machine Learning: What Are Autoencoders?
No ratings yet
Neural Network Unsupervised Machine Learning: What Are Autoencoders?
22 pages
Autoencoder - Unit 4
No ratings yet
Autoencoder - Unit 4
39 pages
Vae Gan
No ratings yet
Vae Gan
214 pages
Autoencoders in Generative Models
No ratings yet
Autoencoders in Generative Models
65 pages
Lecture 14 Autoencoders
No ratings yet
Lecture 14 Autoencoders
39 pages
Auto Encoder S
No ratings yet
Auto Encoder S
52 pages
Deep Learning: Autoencoder
No ratings yet
Deep Learning: Autoencoder
42 pages
Unit II
No ratings yet
Unit II
35 pages
Unit 4
No ratings yet
Unit 4
11 pages
Autoencoders
No ratings yet
Autoencoders
4 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
10 pages
Autoencoders: Types and Applications
No ratings yet
Autoencoders: Types and Applications
91 pages
Swarup Dey
No ratings yet
Swarup Dey
14 pages
Unit V
No ratings yet
Unit V
32 pages
Unit 5e - Autoencoders
No ratings yet
Unit 5e - Autoencoders
32 pages
Auto Encoder
No ratings yet
Auto Encoder
4 pages
Unit V
No ratings yet
Unit V
20 pages
18 11 2016 Baeta Autoencoder
No ratings yet
18 11 2016 Baeta Autoencoder
36 pages
Deep Learning Subject Practicals Uni Mumbai
No ratings yet
Deep Learning Subject Practicals Uni Mumbai
11 pages
Unit 3
No ratings yet
Unit 3
39 pages
Deeplearning Seminar
No ratings yet
Deeplearning Seminar
9 pages
Chap 6 Embedding
No ratings yet
Chap 6 Embedding
44 pages
Lecture 5 Variational Autoencoder
No ratings yet
Lecture 5 Variational Autoencoder
17 pages
DL M3 Tech
No ratings yet
DL M3 Tech
15 pages
Autoencoders: Neural Network Guide
No ratings yet
Autoencoders: Neural Network Guide
20 pages
Lecture 6373 07
No ratings yet
Lecture 6373 07
53 pages
Auto Encoder S
No ratings yet
Auto Encoder S
4 pages
Applications of Machine Learning Methods in Traffic Crash Severity Modelling Current Status and Future Directions
No ratings yet
Applications of Machine Learning Methods in Traffic Crash Severity Modelling Current Status and Future Directions
26 pages
Maths Igcse Scheme of Work 0580 - 2011
0% (1)
Maths Igcse Scheme of Work 0580 - 2011
6 pages
Introduction To JBASE
No ratings yet
Introduction To JBASE
5 pages
Tutorial Sub Circuit
No ratings yet
Tutorial Sub Circuit
12 pages
Database Management Lab: DML & DDL Queries
No ratings yet
Database Management Lab: DML & DDL Queries
12 pages
Excel Charting Guide for Class 7
No ratings yet
Excel Charting Guide for Class 7
5 pages
Statistical Mechanics II: Equilibrium & Liouville
No ratings yet
Statistical Mechanics II: Equilibrium & Liouville
5 pages
VLSM Subnetting Guide
No ratings yet
VLSM Subnetting Guide
10 pages
(Book-2009) Shovel-Truck Systems Modelling, Analysis and Calculations
67% (3)
(Book-2009) Shovel-Truck Systems Modelling, Analysis and Calculations
170 pages
The Most Vital Text Book
No ratings yet
The Most Vital Text Book
35 pages
CE381 Structural Analysis I: Aims and Objectives
No ratings yet
CE381 Structural Analysis I: Aims and Objectives
24 pages
JIS F Standards List-2012
100% (4)
JIS F Standards List-2012
15 pages
Tie Rod Manual
100% (1)
Tie Rod Manual
20 pages
Current Electricity PYQs
No ratings yet
Current Electricity PYQs
10 pages
Protecting Groups in Luminescent Metal Nanoclusters
No ratings yet
Protecting Groups in Luminescent Metal Nanoclusters
19 pages
JEE Main (January Attempt) 2020: A Detailed Analysis by Resonance
No ratings yet
JEE Main (January Attempt) 2020: A Detailed Analysis by Resonance
7 pages
Static Vs Dynamic Liking in Chewing Gum
No ratings yet
Static Vs Dynamic Liking in Chewing Gum
6 pages
MT-XP600 Roll To Roll Manual
No ratings yet
MT-XP600 Roll To Roll Manual
49 pages
Lesson Plans Eist 3 18
No ratings yet
Lesson Plans Eist 3 18
3 pages
HV100 Series Inverter User Manual
No ratings yet
HV100 Series Inverter User Manual
148 pages
MDM Introduction Session1
No ratings yet
MDM Introduction Session1
21 pages
Heat Transfer Lab Setup Guide
No ratings yet
Heat Transfer Lab Setup Guide
1 page
1 s2.0 S0045790625001910 Main
No ratings yet
1 s2.0 S0045790625001910 Main
29 pages
Halcyon Quick Start Guide
No ratings yet
Halcyon Quick Start Guide
26 pages
203J1A04F8 IntReport
No ratings yet
203J1A04F8 IntReport
57 pages
PAF VerbalIntelligence CodingDecoding 100MCQs
No ratings yet
PAF VerbalIntelligence CodingDecoding 100MCQs
30 pages
SB1 - Key Concepts of Biology - Edexcel Practice Paper
No ratings yet
SB1 - Key Concepts of Biology - Edexcel Practice Paper
14 pages
월수금 저녁
No ratings yet
월수금 저녁
2 pages
Competitor Array Analysis Guide
No ratings yet
Competitor Array Analysis Guide
11 pages
Tesla Turbomachinery CFD Analysis
100% (5)
Tesla Turbomachinery CFD Analysis
75 pages

Unit 5 - Autoencoders and Generative Models

Uploaded by

Unit 5 - Autoencoders and Generative Models

Uploaded by

UNIT V

AUTOENCODERS AND GENERATIVE MODELS

Autoencoders: Undercomplete autoencoders -- Regularized autoencoders -- Stochastic encoders

To build an autoencoder we need 3 things:

 This is a more detailed visualization of an autoencoder.

There are 4 hyperparameters that we need to set before training an autoencoder:

# this is our input placeholder

# "encoded" is the encoded representation of the input

# "decoded" is the lossy reconstruction of the input

# this model maps an input to its reconstruction

# this model maps an input to its encoded representation

# create a placeholder for an encoded (32-dimensional) input

# retrieve the last layer of the autoencoder model

Preparing the input data (MNIST Dataset)

x_train = x_train.astype('float32') / 255.

x_train = x_train.reshape((len(x_train), np.prod(x_train.shape[1:])))

L1 regularization and its derivative

autoencoder = Model(input=x, output=r)

(ii) Denoising autoencoder :

autoencoder = Model(input=x, output=r)

Applications of Denoising Autoencoders (DAEs) span a variety of domains, including computer

4. Stochastic Encoders and Decoders

5. Learning with autoencoders; Deep Generative Models: Variational autoencoders

The Architecture of Variational Autoencoder

# we use Keras to build the graph

# we usually don't directly compute the stddev σ

def sampler(mean, log_stddev):

latent_vector = Lambda(sampler)([mean, log_stddev])

How Do GANs Work?

The mathematical equation for training a GAN can be represented as:

# Define the generator and discriminator

Plot the generated Image at zero epoch

from skimage.io import imread

Database Design and Operating Systems -

Physics for Design and Analysis of Machine Learning -

Security - CW3551 Ethics - GE3791

You might also like