100% found this document useful (1 vote)

374 views34 pages

GANppt

The document discusses recent advances in generative adversarial networks (GANs) for computer vision applications. It provides an overview of GANs, including how they work by pitting a generator network against a discriminator network. It describes different types of GANs such as DCGANs, CGANs, CycleGANs, and SeqGANs. DCGANs apply convolutional networks to GANs for image generation. CGANs add conditional information to GANs. CycleGANs perform image-to-image translation without paired examples. SeqGANs use reinforcement learning to generate discrete sequential data like text.

Uploaded by

Sreejith PB

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

374 views34 pages

GANppt

Uploaded by

Sreejith PB

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Recent Advances of Generative Adversarial

Networks in Computer Vision

SREEJITH PB (PKD16IT053)
Guided By
Sibily Joseph and Joby NJ
Asst. Professors
Department of Computer Science and Engineering

GOVERNMENT ENGINNERING COLLEGE, SREEKRISHNAPURAM

September 2019

GEC SREEKRISHNAPURAM GAN 1 / 34

CONTENTS

• Introduction
• System Overview
• Types of GAN
• Applications
• Conclusion
• References

GEC SREEKRISHNAPURAM GAN 2 / 34

Introduction

GEC SREEKRISHNAPURAM GAN 3 / 34

Introduction

• Generative Adversarial Network (GAN), a generative approach

proposed by Goodfellow in 2014 has become one of the most
discussed topics in machine learning
• Generative Adversarial Network can
• Generate high quality images
• Generate high quality audios and videos
• Generate images from text
• Convert images from one domain to another(Image translation)
• etc.
• Different types of GANs are available now for various
application.

GEC SREEKRISHNAPURAM GAN 4 / 34

Introduction

GEC SREEKRISHNAPURAM GAN 5 / 34

System Overview
• Generative adversarial networks (GANs) are deep neural net
architectures comprised of two networks Generator(D) and
Discriminator, pitting one against the other (thus the
adversarial)
• Working of GAN

.
GEC SREEKRISHNAPURAM GAN 6 / 34
System Overview

• The Generator takes in random noise and returns an image.

• This generated image is fed into the Discriminator alongside a
stream of images taken from the actual data set.
• The Discriminator takes in both real and fake images and
returns probabilities, a number between 0 and 1, with 1
representing a prediction of authenticity and 0 representing
fake
• The entities/adversaries are in constant battle as
one(generator) tries to fool the other(discriminator),
while the other tries not to be fooled.

GEC SREEKRISHNAPURAM GAN 7 / 34

System Overview

Two Feedback Loops:

• The Discriminator is in a feedback loop with the ground truth
of the images (are they real or fake)
• The Generator is in a feedback loop with the Discriminator
(did the Discriminator label it real or fake, regardless of the
truth)

GEC SREEKRISHNAPURAM GAN 8 / 34

System Overview
Discriminator vs Generator

GEC SREEKRISHNAPURAM GAN 9 / 34

System Overview

GEC SREEKRISHNAPURAM GAN 10 / 34

System Overview
Loss Function In Discriminative Model
Loss function in Discriminative Model is a regular cross entropy
loss function associated with a binary classifier.

P can be represented as D(x); ie, Probability estimated by

Discriminator D that image X is real image.

GEC SREEKRISHNAPURAM GAN 11 / 34

System Overview

Applying Gradient descent algorithm for minimizing the loss

function
the equation becomes

Loss Function In Generative Model

GEC SREEKRISHNAPURAM GAN 12 / 34

System Overview

GEC SREEKRISHNAPURAM GAN 13 / 34

System Overview

Advantages Over VAE

• GAN belongs to the type of non-parametric production-based
modeling methods, which does not require prior approximate
distributions of training
• GAN works on the whole image and takes less time to
generate samples by directly using global information

GEC SREEKRISHNAPURAM GAN 14 / 34

System Overview

GAN Problems
• Non-convergence:The model parameters oscillate, destabilize
and never converge.
• Mode collapse:The generator collapses which produces limited
varieties of samples.
• Diminished gradient: the discriminator gets too successful
that the generator gradient vanishes and learns nothing.
• Unbalance between the generator and discriminator causing
overfitting
• Highly sensitive to the hyperparameter selections.

GEC SREEKRISHNAPURAM GAN 15 / 34

Types Of GAN

1.DCGAN(Deep Convolutional GAN)

• The generator and discriminator of simple GAN is a simple
fully connected network
generator=Sequential([
Dense(128,inputshape=(100,)),
LeakyReLu(alpha-0.01),
Dense(784),
Activation(’tanh’),
],name=’generator’)
• But in DCGAN Discriminator is a Convolutional Nueral
Network (CNN) and Generator is Transposed Convolutional
Network(Deconvolutional network)
• ie DCGAN will be more fit for the image/video data than a
Simple GAN

GEC SREEKRISHNAPURAM GAN 16 / 34

Types Of GAN(DCGAN cont..)

Similarities Of Neural Networks And CNN

• Both Nueral Network and CNN have learn able weights and
biases.
• In both networks nueron receives some input,perform a dot
product follows it up with a non linear function like
RELU(Rectified Linear Unit)
Main problems with fully connected layers
• Number of weights needed for the nueral network is large
• Networks with large number of parameters faces several
problems.
• slower training time
• chances of overfitting
• etc..

GEC SREEKRISHNAPURAM GAN 17 / 34

Types Of GAN(DCGAN cont..)
Convolutional Neural Network(CNN)
• In CNN the main image matrix is reduced to a matrix of lower
dimension in the first layer through an operation called
Convolution
eg:an image of 64x64xx3 can be reduced to 1x1x10 following
subsequent operation.

Figure: Architecture of Convolutional Neural Network

GEC SREEKRISHNAPURAM GAN 18 / 34

Types Of GAN(DCGAN cont..)
Convolutional Layer

GEC SREEKRISHNAPURAM GAN 19 / 34

Types Of GAN(DCGAN cont..)
Max pooling

Figure: Max Pooling

GEC SREEKRISHNAPURAM GAN 20 / 34

Types Of GAN(DCGAN cont..)

Figure: Discriminator
GEC SREEKRISHNAPURAM GAN 21 / 34
Types Of GAN(DCGAN cont..)

Figure: Generator
GEC SREEKRISHNAPURAM GAN 22 / 34
Types Of GAN
2.CGAN(Conditional GAN)
• when the data set is complex or large-scale, it is difficult for
GAN to control generated result.
• Conditional GANs (CGANs) are an extension of the GANs
model.
• In CGAN the Generator and Discriminator both receive some
additional conditioning input information(y). This could be
the class of the current image or some other property.

NOTE: CGANs have one disadvantage. CGANs are not strictly

unsupervised and we need some kind of labels for them to work
GEC SREEKRISHNAPURAM GAN 23 / 34
Types Of GAN
3.CYCLE GAN
• The CycleGAN is an extension of the GAN architecture that
involves the simultaneous training of two generator models
and two discriminator models.
• The CycleGAN is a technique that involves the automatic
training of image-to-image translation models without paired
examples.
• The models are trained in an unsupervised manner using a
collection of images from the source and target domain that
do not need to be related in any way.

GEC SREEKRISHNAPURAM GAN 24 / 34

Types Of GAN (CYCLEGAN cont...)

GEC SREEKRISHNAPURAM GAN 25 / 34

Types Of GAN

4.SEQGAN(Sequential GAN)
• In sequential data (text, speech, etc), there are some
limitations in applying the exact same concepts of GAN.
These limitations arise mainly due to the sequential and
discrete nature of the data.
• This is the image representation of a random matrix (M)

GEC SREEKRISHNAPURAM GAN 26 / 34

Types Of GAN(SEQGAN cont...)

• This is the image representation of M+0.08

• .But in case of a text ,Suppose that the word computer is

represented by the real-valued vector v = [0.11143, -0.97712,
0.445216 .., 0.7221240]. Now, v + 0.08 is another vector
which need not necessarily represent some word in the
vocabulary.
• eg:”penguin”+0.001==¿”ostrich”

GEC SREEKRISHNAPURAM GAN 27 / 34

Types Of GAN(SEQGAN cont...)

• To overcome,Goodfellow( father of GAN )recommended to

use Reinforcement learning to train GAN to generate discrete
tokens.
• SeqGan(Sequence Generative Adversarial Nets) Using
Reinforcement Learning to combat the non-differentiability
issue in text GANs.

GEC SREEKRISHNAPURAM GAN 28 / 34

Types Of GAN(SEQGAN cont...)

• The generator is treated as an RL agent.

• previous tokens are the states (stored in the hidden states)
and the action is the next token to generate.

• The discriminator is fed with both real and synthetic data to

local the difference.
• To evaluate some partial sequence, they use another
generator.

GEC SREEKRISHNAPURAM GAN 29 / 34

Types Of GAN(SEQGAN cont...)

• Finally completing the sentence .ie completing the action it

will get some rewards(in this case from the discriminator) how
good the sentence is?
• For picking the right action from the particular state using the
concept of policy.
• For optimizing the policy gradient methods are used.

GEC SREEKRISHNAPURAM GAN 30 / 34

Applications
Different types of GAN and its applications:

GEC SREEKRISHNAPURAM GAN 31 / 34

Conclusion

Conclusion:
GANs are one of the new state of the art neural networks which
can be used to do many things.There is a lot of active research in
the field to apply GANs for language tasks, to improve their
stability and ease of training, and so on. They are already being
applied in industry for a variety of applications ranging from
interactive image editing, 3D shape estimation, drug discovery,
semi-supervised learning to robotics etc.

GEC SREEKRISHNAPURAM GAN 32 / 34

References

GEC SREEKRISHNAPURAM GAN 33 / 34

THANK YOU

GEC SREEKRISHNAPURAM GAN 34 / 34

Generative Adversarial Networks Review 1-06-08-1.edit
No ratings yet
Generative Adversarial Networks Review 1-06-08-1.edit
24 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
15 pages
Generative Adversial Network
No ratings yet
Generative Adversial Network
21 pages
Lec16 - Autoencoders
No ratings yet
Lec16 - Autoencoders
18 pages
Dropout Vs Pruning
No ratings yet
Dropout Vs Pruning
2 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
5 pages
Generative AI: Creative Chaos Unleashed
No ratings yet
Generative AI: Creative Chaos Unleashed
1 page
Overview of Recurrent Neural Networks
100% (2)
Overview of Recurrent Neural Networks
53 pages
Advanced Deep Learning Questions - ChatGPT
No ratings yet
Advanced Deep Learning Questions - ChatGPT
13 pages
Backpropagation in Neural Networks
No ratings yet
Backpropagation in Neural Networks
7 pages
LSTM for Touchpoint Prediction
100% (1)
LSTM for Touchpoint Prediction
73 pages
Deep Learning (MODULE-3)
No ratings yet
Deep Learning (MODULE-3)
85 pages
Deep Learning Interview Guide
No ratings yet
Deep Learning Interview Guide
17 pages
Autoencoders - Presentation
No ratings yet
Autoencoders - Presentation
18 pages
NLP and Generative AI Syllabus - 2025
No ratings yet
NLP and Generative AI Syllabus - 2025
5 pages
Overview of Deep Learning Concepts
100% (2)
Overview of Deep Learning Concepts
49 pages
Deep Learning Laboratory
No ratings yet
Deep Learning Laboratory
69 pages
Rakesh Kumar - Data Scientist
No ratings yet
Rakesh Kumar - Data Scientist
3 pages
Notes On Backpropagation
No ratings yet
Notes On Backpropagation
14 pages
Object Detection
No ratings yet
Object Detection
57 pages
Artificial Neural Networks: Part 1/3
No ratings yet
Artificial Neural Networks: Part 1/3
25 pages
Unit 5
No ratings yet
Unit 5
23 pages
Deep Learning Unit-1 Finals
No ratings yet
Deep Learning Unit-1 Finals
23 pages
TensorFlow Deep Learning Guide
No ratings yet
TensorFlow Deep Learning Guide
35 pages
Deep Learning: Autoencoders Overview
100% (1)
Deep Learning: Autoencoders Overview
31 pages
GANs: A Deep Dive for Researchers
No ratings yet
GANs: A Deep Dive for Researchers
62 pages
02 - Lecture Note - TensorFlow Ops
No ratings yet
02 - Lecture Note - TensorFlow Ops
21 pages
Face Recognition with Neural Networks
100% (3)
Face Recognition with Neural Networks
33 pages
GenAI Unit1 3
No ratings yet
GenAI Unit1 3
31 pages
Face Detection and Smile Detection
No ratings yet
Face Detection and Smile Detection
8 pages
Word2Vec: Skip-Gram vs CBOW Explained
100% (1)
Word2Vec: Skip-Gram vs CBOW Explained
37 pages
AI-Enhanced QA: EmbeddingAlign RAG
No ratings yet
AI-Enhanced QA: EmbeddingAlign RAG
7 pages
Gradient Descent for Deep Learning
No ratings yet
Gradient Descent for Deep Learning
21 pages
Neural Networks
No ratings yet
Neural Networks
29 pages
Omkar Sabnis B4-764 Experiment No. 7 Aim: Implementation of MC-Culloch Pitt Model For AND Gate Using Python. Theory
No ratings yet
Omkar Sabnis B4-764 Experiment No. 7 Aim: Implementation of MC-Culloch Pitt Model For AND Gate Using Python. Theory
10 pages
Implementing MLPs with Keras
No ratings yet
Implementing MLPs with Keras
61 pages
Deep Learning Step by Step
No ratings yet
Deep Learning Step by Step
171 pages
KNN (K Nearest Neighbor)
No ratings yet
KNN (K Nearest Neighbor)
21 pages
AML - Theory - Syllabus - Chandigarh University
No ratings yet
AML - Theory - Syllabus - Chandigarh University
4 pages
Unit 5
No ratings yet
Unit 5
36 pages
Deploy A Machine Learning Model Using Flask - Towards Data Science
No ratings yet
Deploy A Machine Learning Model Using Flask - Towards Data Science
12 pages
Automated Violence Detection System
No ratings yet
Automated Violence Detection System
17 pages
Btech CSE
100% (1)
Btech CSE
17 pages
Machine Learning
No ratings yet
Machine Learning
17 pages
Transformers For Natural Language Processing and Computer Vision
No ratings yet
Transformers For Natural Language Processing and Computer Vision
150 pages
Autoencoders - Buffalo University
100% (1)
Autoencoders - Buffalo University
36 pages
Deep Learning
100% (2)
Deep Learning
21 pages
Autoregressive Generative Models Guide
No ratings yet
Autoregressive Generative Models Guide
57 pages
RNN and LSTM for Sentiment Analysis
No ratings yet
RNN and LSTM for Sentiment Analysis
14 pages
Neural Networks & Deep Learning Basics
100% (1)
Neural Networks & Deep Learning Basics
24 pages
Advanced Deep Learning Syllabus
No ratings yet
Advanced Deep Learning Syllabus
2 pages
Ai Agents
No ratings yet
Ai Agents
31 pages
Machine Learning Classification Guide
No ratings yet
Machine Learning Classification Guide
7 pages
Autoencoders & Keras Overview
No ratings yet
Autoencoders & Keras Overview
42 pages
RBF Neural Network
No ratings yet
RBF Neural Network
34 pages
Career Plans For Next 2 Years
No ratings yet
Career Plans For Next 2 Years
11 pages
Predicting Heart Disease at Early Stages Using Machine Learning: A Survey
No ratings yet
Predicting Heart Disease at Early Stages Using Machine Learning: A Survey
4 pages
Deep Learning Insights on GANs
No ratings yet
Deep Learning Insights on GANs
75 pages
Module 6.2 GAN
No ratings yet
Module 6.2 GAN
29 pages
Aai 2
No ratings yet
Aai 2
83 pages
W3schools: CSS Reference
No ratings yet
W3schools: CSS Reference
21 pages
PK Sinah - Computer Fundamentals
No ratings yet
PK Sinah - Computer Fundamentals
536 pages
Search On Codescracker Search: F T G L y
No ratings yet
Search On Codescracker Search: F T G L y
6 pages
Learn To Submit HTML Data To Mysql Database Using PHP: Programming-Tutorials/)
No ratings yet
Learn To Submit HTML Data To Mysql Database Using PHP: Programming-Tutorials/)
35 pages
Automated Doctor Appointment System
No ratings yet
Automated Doctor Appointment System
21 pages
HTML Basics: Search On Codescracker Search
No ratings yet
HTML Basics: Search On Codescracker Search
6 pages
Diamond Pattern in C Programming
No ratings yet
Diamond Pattern in C Programming
3 pages
Clarification Finalsem PDF
No ratings yet
Clarification Finalsem PDF
1 page
Hyperparameters Without Learning Rate
No ratings yet
Hyperparameters Without Learning Rate
16 pages
Lit Survey
No ratings yet
Lit Survey
2 pages
Efficient Maximum Likelihood Decoding of Linear Block Codes Using A Trellis
No ratings yet
Efficient Maximum Likelihood Decoding of Linear Block Codes Using A Trellis
5 pages
Polynomials Division
No ratings yet
Polynomials Division
3 pages
ACT2
No ratings yet
ACT2
5 pages
AI and Machine Learning Exam Paper
No ratings yet
AI and Machine Learning Exam Paper
2 pages
Factorial C Program
No ratings yet
Factorial C Program
7 pages
G8 Second Term B
No ratings yet
G8 Second Term B
5 pages
Signals & Systems Study Guide
No ratings yet
Signals & Systems Study Guide
10 pages
Geo AI
No ratings yet
Geo AI
50 pages
Backpropagation Neural Network
No ratings yet
Backpropagation Neural Network
34 pages
Ece-Am-2021-Ec 8553
No ratings yet
Ece-Am-2021-Ec 8553
3 pages
Informed Search
No ratings yet
Informed Search
9 pages
Polynomial Division Worksheet
No ratings yet
Polynomial Division Worksheet
6 pages
Remeshing Deformed Mesh Techniques
No ratings yet
Remeshing Deformed Mesh Techniques
3 pages
HPC Codes-2
No ratings yet
HPC Codes-2
15 pages
Malware Detection Using Frequency Domain-Based Image Visualization and Deep Learning
No ratings yet
Malware Detection Using Frequency Domain-Based Image Visualization and Deep Learning
10 pages
Ijser: Hilbert Transform and Its Applications: A Survey
No ratings yet
Ijser: Hilbert Transform and Its Applications: A Survey
6 pages
Train Py
No ratings yet
Train Py
4 pages
Nonlinear Analysis Basics in ANSYS
No ratings yet
Nonlinear Analysis Basics in ANSYS
11 pages
Finite Difference Method: I I I I I I
No ratings yet
Finite Difference Method: I I I I I I
3 pages
Data Structures & Algorithms Solutions
No ratings yet
Data Structures & Algorithms Solutions
6 pages
Automatic Number Plate Recognition System
No ratings yet
Automatic Number Plate Recognition System
6 pages
Algorithm Correctness and Time Complexity
No ratings yet
Algorithm Correctness and Time Complexity
36 pages
DL Unit 1a
No ratings yet
DL Unit 1a
31 pages
Polynomial Problem Worksheets
No ratings yet
Polynomial Problem Worksheets
13 pages
CS25C03 Lesson Plan
No ratings yet
CS25C03 Lesson Plan
8 pages
Ex No 5 Write A Program To Implement The Naïve Bayesian Classifier
No ratings yet
Ex No 5 Write A Program To Implement The Naïve Bayesian Classifier
3 pages
Lecture 7 - Perceptrons and Multi-Layer Feedforward Neural Networks Using Matlab Part 3
No ratings yet
Lecture 7 - Perceptrons and Multi-Layer Feedforward Neural Networks Using Matlab Part 3
6 pages
Solving Linear Differential Equations
No ratings yet
Solving Linear Differential Equations
4 pages

GANppt

Uploaded by

GANppt

Uploaded by

Recent Advances of Generative Adversarial

Networks in Computer Vision

GOVERNMENT ENGINNERING COLLEGE, SREEKRISHNAPURAM

GEC SREEKRISHNAPURAM GAN 1 / 34

GEC SREEKRISHNAPURAM GAN 2 / 34

GEC SREEKRISHNAPURAM GAN 3 / 34

• Generative Adversarial Network (GAN), a generative approach

GEC SREEKRISHNAPURAM GAN 4 / 34

GEC SREEKRISHNAPURAM GAN 5 / 34

• The Generator takes in random noise and returns an image.

GEC SREEKRISHNAPURAM GAN 7 / 34

Two Feedback Loops:

GEC SREEKRISHNAPURAM GAN 8 / 34

GEC SREEKRISHNAPURAM GAN 9 / 34

GEC SREEKRISHNAPURAM GAN 10 / 34

P can be represented as D(x); ie, Probability estimated by

GEC SREEKRISHNAPURAM GAN 11 / 34

Applying Gradient descent algorithm for minimizing the loss

Loss Function In Generative Model

GEC SREEKRISHNAPURAM GAN 12 / 34

GEC SREEKRISHNAPURAM GAN 13 / 34

Advantages Over VAE

GEC SREEKRISHNAPURAM GAN 14 / 34

GEC SREEKRISHNAPURAM GAN 15 / 34

1.DCGAN(Deep Convolutional GAN)

GEC SREEKRISHNAPURAM GAN 16 / 34

Similarities Of Neural Networks And CNN

GEC SREEKRISHNAPURAM GAN 17 / 34

Figure: Architecture of Convolutional Neural Network

GEC SREEKRISHNAPURAM GAN 18 / 34

GEC SREEKRISHNAPURAM GAN 19 / 34

Figure: Max Pooling

GEC SREEKRISHNAPURAM GAN 20 / 34

NOTE: CGANs have one disadvantage. CGANs are not strictly

GEC SREEKRISHNAPURAM GAN 24 / 34

GEC SREEKRISHNAPURAM GAN 25 / 34

GEC SREEKRISHNAPURAM GAN 26 / 34

• This is the image representation of M+0.08

• .But in case of a text ,Suppose that the word computer is

GEC SREEKRISHNAPURAM GAN 27 / 34

• To overcome,Goodfellow( father of GAN )recommended to

GEC SREEKRISHNAPURAM GAN 28 / 34

• The generator is treated as an RL agent.

• The discriminator is fed with both real and synthetic data to

GEC SREEKRISHNAPURAM GAN 29 / 34

• Finally completing the sentence .ie completing the action it

GEC SREEKRISHNAPURAM GAN 30 / 34

GEC SREEKRISHNAPURAM GAN 31 / 34

GEC SREEKRISHNAPURAM GAN 32 / 34

GEC SREEKRISHNAPURAM GAN 33 / 34

GEC SREEKRISHNAPURAM GAN 34 / 34

You might also like