0% found this document useful (0 votes)

13 views13 pages

Auto Encoder

The document provides an introduction to autoencoders, a type of feedforward neural network that encodes input into a hidden representation and decodes it back to minimize reconstruction loss. It distinguishes between undercomplete and overcomplete autoencoders based on the dimensions of the hidden representation compared to the input. Additionally, it discusses the choice of activation functions and loss functions for different types of input data, including binary and real-valued inputs.

Uploaded by

deeplearninglab727

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views13 pages

Auto Encoder

Uploaded by

deeplearninglab727

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Module 7.

1: Introduction to Autoencoders

2/55
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 7
An autoencoder is a special type of
x̂i feed forward neural network which
does the following
W∗
Encodes its input xi into a hidden
h representation h
W Decodes the input again from this
hidden representation
xi
The model is trained to minimize a
certain loss function which will ensure
that x̂i is close to xi (we will see some
h = g(W xi + b)
such loss functions soon)
x̂i = f (W ∗ h + c)

3/55
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 7
Let us consider the case where
x̂i dim(h) < dim(xi )
If we are still able to reconstruct x̂i
W∗
perfectly from h, then what does it
h say about h?
h is a loss-free encoding of xi . It cap-
W
tures all the important characteristics
xi of xi
Do you see an analogy with PCA?
h = g(W xi + b)
x̂i = f (W ∗ h + c)

An autoencoder where dim(h) < dim(xi ) is

called an under complete autoencoder

4/55
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 7
Let us consider the case when
x̂i dim(h) ≥ dim(xi )
W∗ In such a case the autoencoder could
learn a trivial encoding by simply
h copying xi into h and then copying
h into x̂i
W
Such an identity encoding is useless
xi in practice as it does not really tell us
anything about the important char-
h = g(W xi + b) acteristics of the data
x̂i = f (W ∗ h + c)

An autoencoder where dim(h) ≥ dim(xi ) is

called an over complete autoencoder

5/55
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 7
The Road Ahead
Choice of f (xi ) and g(xi )
Choice of loss function

6/55
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 7
The Road Ahead
Choice of f (xi ) and g(xi )
Choice of loss function

7/55
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 7
Suppose all our inputs are binary
x̂i = f (W ∗ h + c) (each xij ∈ {0, 1})
W∗ Which of the following functions
would be most apt for the decoder?
h = g(W xi + b)
x̂i = tanh(W ∗ h + c)
W
x̂i = W ∗ h + c
xi
x̂i = logistic(W ∗ h + c)

0 1 1 0 1 (binary inputs) Logistic as it naturally restricts all

outputs to be between 0 and 1

g is typically chosen as the sigmoid

function

8/55
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 7
Suppose all our inputs are real (each
x̂i = f (W ∗ h + c) xij ∈ R)
W∗ Which of the following functions
would be most apt for the decoder?
h = g(W xi + b)
x̂i = tanh(W ∗ h + c)
W
x̂i = W ∗ h + c
xi
x̂i = logistic(W ∗ h + c)

0.25 0.5 1.25 3.5 4.5 What will logistic and tanh do?
(real valued inputs) They will restrict the reconstruc-
ted x̂i to lie between [0,1] or [-1,1]
Again, g is typically chosen as the
whereas we want x̂i ∈ Rn
sigmoid function

9/55
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 7
The Road Ahead
Choice of f (xi ) and g(xi )
Choice of loss function

10/55
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 7
Consider the case when the inputs are real
x̂i valued
The objective of the autoencoder is to recon-
W∗ struct x̂i to be as close to xi as possible

h This can be formalized using the following

objective function:
W m n
1 XX
min (x̂ij − xij )2
xi ∗
W,W ,c,b m i=1 j=1
m
1 X
h = g(W xi + b) i.e., min (x̂i − xi )T (x̂i − xi )
∗
W,W ,c,b m i=1
x̂i = f (W ∗ h + c)
We can then train the autoencoder just like
a regular feedforward network using back-
propagation
∂L (θ) ∂L (θ)
All we need is a formula for ∂W ∗
and ∂W
which we will see now

11/55
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 7
L (θ) = (x̂i − xi )T (x̂i − xi ) ∂L (θ) ∂L (θ) ∂h2 ∂a2
=
h2 = x̂i ∂W ∗ ∂h2 ∂a2 ∂W ∗
a2
∂L (θ) ∂L (θ) ∂h2 ∂a2 ∂h1 ∂a1
=
W∗ ∂W ∂h2 ∂a2 ∂h1 ∂a1 ∂W
h1
a1 We have already seen how to calculate the expres-
sion in the boxes when we learnt backpropagation
W
h0 = xi ∂L (θ) ∂L (θ)
=
∂h2 ∂x̂i
= ∇x̂i {(x̂i − xi )T (x̂i − xi )}
Note that the loss function is = 2(x̂i − xi )
shown for only one training
example.

12/55
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 7
Consider the case when the inputs are
x̂i = f (W ∗ h + c) binary

W∗ We use a sigmoid decoder which will

produce outputs between 0 and 1, and
h = g(W xi + b) can be interpreted as probabilities.

W For a single n-dimensional ith input we

can use the following loss function
xi X n
min{− (xij log x̂ij + (1 − xij ) log(1 − x̂ij ))}
j=1
0 1 1 0 1 (binary inputs) ∂L (θ)
Again we need is a formula for ∂W ∗ and
What value of x̂ij will minimize this ∂L (θ)
∂W to use backpropagation
function?
If xij = 1 ?
If xij = 0 ?
Indeed the above function will be
minimized when x̂ij = xij !
13/55
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 7
n ∂L (θ) ∂L (θ) ∂h2 ∂a2
L (θ) = − =
P
(xij log x̂ij + (1 − xij ) log(1 − x̂ij )) ∗
j=1 ∂W ∂h2 ∂a2 ∂W ∗
h2 = x̂i
a2 ∂L (θ) ∂L (θ) ∂h2 ∂a2 ∂h1 ∂a1
=
∂W ∂h2 ∂a2 ∂h1 ∂a1 ∂W
W∗
h1 We have already seen how to
a1 calculate the expressions in the
W square boxes when we learnt BP
h0 = xi The first two terms on RHS can be
computed as:
∂L (θ) xij 1 − xij
=− +
1 − x̂ij
 ∂L (θ) 
∂h2j x̂ij
∂h
 ∂L 21
(θ) 
 ∂h2j
∂L (θ) 
 ∂h22  = σ(a2j )(1 − σ(a2j ))
= .  ∂a2j
∂h2  .. 
 
∂L (θ)
∂h2n
14/55
Mitesh M. Khapra CS7015 (Deep Learning) : Lecture 7

CS7015 (Deep Learning) : Lecture 7
No ratings yet
CS7015 (Deep Learning) : Lecture 7
55 pages
Lecture7 PDF
No ratings yet
Lecture7 PDF
228 pages
Lecture - 14 - FFNN
No ratings yet
Lecture - 14 - FFNN
59 pages
Unit 5
100% (1)
Unit 5
36 pages
Lecture 18. Backpropagation
No ratings yet
Lecture 18. Backpropagation
55 pages
DL-U01W03a ANN
No ratings yet
DL-U01W03a ANN
33 pages
Unit 3
No ratings yet
Unit 3
33 pages
Intro to Machine Learning Basics
No ratings yet
Intro to Machine Learning Basics
61 pages
DNN - M2 - Deep Feedforward NN 23dec
No ratings yet
DNN - M2 - Deep Feedforward NN 23dec
97 pages
Deep Feedforward Neural Networks Guide
No ratings yet
Deep Feedforward Neural Networks Guide
97 pages
Lecture 3 H
No ratings yet
Lecture 3 H
70 pages
CS6910 Tutorial5
No ratings yet
CS6910 Tutorial5
9 pages
Lecture 4
No ratings yet
Lecture 4
288 pages
DL 02 Basics
No ratings yet
DL 02 Basics
95 pages
Week 4
No ratings yet
Week 4
61 pages
Lecture 03 - Feedforward Networks - 4p
No ratings yet
Lecture 03 - Feedforward Networks - 4p
19 pages
DL 02 Basics
No ratings yet
DL 02 Basics
94 pages
Autoencoders
No ratings yet
Autoencoders
52 pages
Week2 DL
No ratings yet
Week2 DL
29 pages
Deep Learning Lectures - 2
No ratings yet
Deep Learning Lectures - 2
73 pages
Encoders
No ratings yet
Encoders
29 pages
DeepLearning Recap
No ratings yet
DeepLearning Recap
104 pages
L07-NeuralNets 2024
No ratings yet
L07-NeuralNets 2024
153 pages
Autoencoders in Deep Learning
No ratings yet
Autoencoders in Deep Learning
73 pages
Lecture 9 H
No ratings yet
Lecture 9 H
69 pages
ANN Unit1
No ratings yet
ANN Unit1
29 pages
Lecture Slides 2 - Neural Networks - 2021
No ratings yet
Lecture Slides 2 - Neural Networks - 2021
42 pages
Deep Neural Networks
No ratings yet
Deep Neural Networks
79 pages
Autoencoder
No ratings yet
Autoencoder
39 pages
ANN Unit-2
No ratings yet
ANN Unit-2
48 pages
Advance Deep Learning - BIT L3
No ratings yet
Advance Deep Learning - BIT L3
45 pages
Ece18898g Neural Networks
No ratings yet
Ece18898g Neural Networks
47 pages
Auto Encoder
No ratings yet
Auto Encoder
39 pages
Unit 5 - Week 4: Assignment 4
No ratings yet
Unit 5 - Week 4: Assignment 4
4 pages
03 Autoencoders 4
No ratings yet
03 Autoencoders 4
159 pages
Minsky y Papert
No ratings yet
Minsky y Papert
77 pages
Unit 2.1
No ratings yet
Unit 2.1
37 pages
DAT300 Project One Overview
No ratings yet
DAT300 Project One Overview
12 pages
1.deep Learning Assignment1 Solutions 1
100% (3)
1.deep Learning Assignment1 Solutions 1
12 pages
Sparse Autoencoder Overview
No ratings yet
Sparse Autoencoder Overview
15 pages
Unit5 Autoencoders
No ratings yet
Unit5 Autoencoders
45 pages
Formulas
No ratings yet
Formulas
5 pages
DNN Cluster S2 22 MidSem Makeup
No ratings yet
DNN Cluster S2 22 MidSem Makeup
7 pages
Neural Networks & Backpropagation
No ratings yet
Neural Networks & Backpropagation
77 pages
UNIT 1 Introduction Part 1
No ratings yet
UNIT 1 Introduction Part 1
37 pages
Deep Learning - IIT Ropar - Unit 5 - Week 2
No ratings yet
Deep Learning - IIT Ropar - Unit 5 - Week 2
9 pages
DL - Module 3
No ratings yet
DL - Module 3
62 pages
Chapter 02.background-Theory
No ratings yet
Chapter 02.background-Theory
20 pages
Lecture 2 - GD Linear Regression
No ratings yet
Lecture 2 - GD Linear Regression
28 pages
Week 6
No ratings yet
Week 6
4 pages
Logistic Regression and Sigmoid Function
No ratings yet
Logistic Regression and Sigmoid Function
32 pages
Linearity: Skip To Content
No ratings yet
Linearity: Skip To Content
10 pages
Practical-5 - 2CEIT606 - Artificial Intelligence
No ratings yet
Practical-5 - 2CEIT606 - Artificial Intelligence
14 pages
Deep Learning - IIT Ropar - Unit 5 - Week 2
No ratings yet
Deep Learning - IIT Ropar - Unit 5 - Week 2
4 pages
CSC 2541: Neural Net Training Dynamics: Lecture 1 - A Toy Model: Linear Regression
No ratings yet
CSC 2541: Neural Net Training Dynamics: Lecture 1 - A Toy Model: Linear Regression
62 pages
Linear Regression
No ratings yet
Linear Regression
6 pages
Machine Learning
No ratings yet
Machine Learning
4 pages
Module 2
No ratings yet
Module 2
55 pages
32-Bit ALU Design & Simulation
No ratings yet
32-Bit ALU Design & Simulation
3 pages
Catia V5 Knowledge Ware
100% (1)
Catia V5 Knowledge Ware
43 pages
Flutter To-Do App Report
100% (1)
Flutter To-Do App Report
11 pages
StruxureWare Data Center Operation 7.2 Server Configuration Guide
No ratings yet
StruxureWare Data Center Operation 7.2 Server Configuration Guide
13 pages
GOOSE for Substation Communication
No ratings yet
GOOSE for Substation Communication
16 pages
Harshvardhan K: Education
No ratings yet
Harshvardhan K: Education
1 page
ICMP
No ratings yet
ICMP
5 pages
Course Offerings in Construction Engineering
No ratings yet
Course Offerings in Construction Engineering
1 page
SAP Material Ledger Workshop Insights
No ratings yet
SAP Material Ledger Workshop Insights
10 pages
Obras Completas de Azorín PDF
100% (1)
Obras Completas de Azorín PDF
264 pages
Sap On db2 Commands: S.no Command Description
No ratings yet
Sap On db2 Commands: S.no Command Description
4 pages
Dot Net Practical Lab Book for BCA
No ratings yet
Dot Net Practical Lab Book for BCA
17 pages
Billing Software
No ratings yet
Billing Software
13 pages
Java Declarations and Access Modifiers Guide
No ratings yet
Java Declarations and Access Modifiers Guide
4 pages
Rajalakshmi Ansys Lab Manual
No ratings yet
Rajalakshmi Ansys Lab Manual
98 pages
Geoparser
No ratings yet
Geoparser
8 pages
CH 11
No ratings yet
CH 11
5 pages
FMS Delhi Summer Placements '17-'19
No ratings yet
FMS Delhi Summer Placements '17-'19
9 pages
BWSSB SOR 2017-18 Rate Comparison
No ratings yet
BWSSB SOR 2017-18 Rate Comparison
184 pages
Chapter6 RL-ARM Real-Time Executive (RTX)
No ratings yet
Chapter6 RL-ARM Real-Time Executive (RTX)
16 pages
METHODOLOGY Analitis Data Raya Sektor Awam (DRSA)
100% (1)
METHODOLOGY Analitis Data Raya Sektor Awam (DRSA)
24 pages
Btech 2 Sem Programming For Problem Solving Kcs201 2022
No ratings yet
Btech 2 Sem Programming For Problem Solving Kcs201 2022
2 pages
MCA Semester 3: System Programming Overview
0% (1)
MCA Semester 3: System Programming Overview
15 pages
HAT400 V1.0 en
No ratings yet
HAT400 V1.0 en
6 pages
ICT Tools and Equipment Overview
100% (5)
ICT Tools and Equipment Overview
21 pages
Yoga12 Frubom 20160606
No ratings yet
Yoga12 Frubom 20160606
13 pages
Overview of Probabilistic Models
No ratings yet
Overview of Probabilistic Models
29 pages
CIS Controls Version 7.1 Implementation Groups 1.2
No ratings yet
CIS Controls Version 7.1 Implementation Groups 1.2
64 pages
Student Project List on Operating Systems
No ratings yet
Student Project List on Operating Systems
4 pages
Comparative Adjectives
No ratings yet
Comparative Adjectives
4 pages

Auto Encoder

Uploaded by

Auto Encoder

Uploaded by

Module 7.

An autoencoder where dim(h) < dim(xi ) is

An autoencoder where dim(h) ≥ dim(xi ) is

0 1 1 0 1 (binary inputs) Logistic as it naturally restricts all

g is typically chosen as the sigmoid

h This can be formalized using the following

W∗ We use a sigmoid decoder which will

W For a single n-dimensional ith input we

You might also like