0% found this document useful (0 votes)

12 views4 pages

Deep Learnig U2

The document provides an overview of various neural network architectures and optimization algorithms, including Deep Feedforward Neural Networks, Gradient Descent, and different types of auto-encoders. It discusses their definitions, architectures, formulas, advantages, and applications in machine learning. Additionally, it covers techniques for regularization and dataset augmentation to enhance model performance.

Uploaded by

shivamchoubeyrishu4747

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views4 pages

Deep Learnig U2

Uploaded by

shivamchoubeyrishu4747

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

1.

Deep Feedforward Neural Networks

• Definition: Deep Feedforward Neural Networks are a type of artificial neural network
where connections between the nodes do not form a cycle. This is the simplest form of
neural networks.

• Architecture: Consists of an input layer, several hidden layers, and an output layer.

• Activation Functions: Commonly used activation functions include Sigmoid, Tanh, and
ReLU.

• Forward Propagation: Involves calculating the output of each neuron from the input
layer to the output layer.

• Use Cases: Image and speech recognition, language translation, and other applications
requiring pattern recognition.

2. Gradient Descent (GD)

• Definition: An optimization algorithm used to minimize the cost function by iteratively

adjusting the model parameters in the opposite direction of the gradient.

• Types of Gradient Descent:

o Batch Gradient Descent: Uses the entire dataset for each update.

o Stochastic Gradient Descent: Uses one training example at each update.

o Mini-batch Gradient Descent: Uses a small random subset of the dataset at each
update.

3. Momentum-Based GD

• Definition: Enhances gradient descent by adding a momentum term to accelerate

convergence and prevent oscillations.

• Formula: v(t)=γv(t−1)+η∇J(θ)v(t) = \gamma v(t-1) + \eta \nabla J(\theta)

o v(t)v(t): velocity (momentum term)

o γ\gamma: momentum hyperparameter

o η\eta: learning rate

o ∇J(θ)\nabla J(\theta): gradient of the cost function

4. Nesterov Accelerated GD
• Definition: An improved version of momentum-based GD that looks ahead to the
estimated future position.

• Formula: v(t)=γv(t−1)+η∇J(θ−γv(t−1))v(t) = \gamma v(t-1) + \eta \nabla J(\theta -

\gamma v(t-1))

5. Stochastic Gradient Descent (SGD)

• Definition: An iterative method for optimizing an objective function using one training
example at a time.

• Advantages: Faster convergence for large datasets, reduced computational cost.

• Disadvantages: Can lead to noisy updates and require careful tuning of the learning rate.

6. AdaGrad

• Definition: An adaptive gradient algorithm that adjusts the learning rate for each
parameter based on historical gradient information.

• Formula: θt+1=θt−ηGt+ϵ∇J(θt)\theta_{t+1} = \theta_t - \frac{\eta}{\sqrt{G_{t} +

\epsilon}} \nabla J(\theta_t)

o GtG_t: sum of the squares of the past gradients

o ϵ\epsilon: small constant to avoid division by zero

7. Adam

• Definition: Combines the advantages of AdaGrad and RMSProp, using adaptive learning
rates and momentum.

• Parameters: β1\beta_1 (decay rate for the first moment), β2\beta_2 (decay rate for the
second moment), ϵ\epsilon (small constant).

• Formula: mt=β1mt−1+(1−β1)∇J(θt)m_t = \beta_1 m_{t-1} + (1 - \beta_1) \nabla

J(\theta_t) and vt=β2vt−1+(1−β2)(∇J(θt))2v_t = \beta_2 v_{t-1} + (1 - \beta_2) (\nabla
J(\theta_t))^2

8. RMSProp

• Definition: An optimization algorithm that adjusts the learning rate by dividing the
gradient by a running average of its recent magnitude.

• Formula: θt+1=θt−ηE[g2]t+ϵ∇J(θt)\theta_{t+1} = \theta_t - \frac{\eta}{\sqrt{E[g^2]_{t} +

\epsilon}} \nabla J(\theta_t)
9. Auto-encoder

• Definition: An unsupervised learning model used to encode input data into a

compressed representation and then decode it back to reconstruct the input.

• Architecture: Consists of an encoder (compresses the input) and a decoder (reconstructs

the input).

• Applications: Dimensionality reduction, feature learning, and anomaly detection.

10. Regularization in Auto-encoders

• Purpose: Prevent overfitting and improve generalization by adding constraints to the

model.

• Techniques:

o L1 Regularization: Adds the absolute value of the weights to the loss function.

o L2 Regularization: Adds the square of the weights to the loss function.

11. Denoising Auto-encoders

• Definition: Train on a noisy version of the input data and aim to reconstruct the clean
input.

• Objective: Improve the model's robustness and ability to capture relevant structures in
the data.

12. Sparse Auto-encoders

• Definition: Apply sparsity constraints on the hidden layer activations to encourage

learning a compact and efficient representation.

• Technique: Use an additional sparsity penalty term in the loss function.

13. Contractive Auto-encoders

• Definition: Penalize the gradient of the encoder's activations with respect to the input to
make the learned representation robust to small variations.

• Formula: Add a term to the loss function proportional to the Frobenius norm of the
Jacobian of the hidden representations.

14. Variational Auto-encoder

• Definition: A generative model that learns a probabilistic distribution over the latent
space, allowing for the generation of new data samples.
• Objective: Maximize the Evidence Lower Bound (ELBO) to ensure the latent variables
follow a desired distribution.

15. Auto-encoders relationship with PCA and SVD

• PCA: Auto-encoders can be seen as a non-linear extension of PCA, which performs linear
dimensionality reduction.

• SVD: Singular Value Decomposition can be used to analyze the linear transformations
performed by auto-encoders.

16. Dataset Augmentation

• Definition: Techniques to artificially increase the size and diversity of a dataset by

applying transformations like rotation, scaling, flipping, and adding noise.

• Purpose: Improve the model's generalization by providing more varied training

examples.

Module 2
No ratings yet
Module 2
67 pages
DL
No ratings yet
DL
4 pages
Tutorial 1,2
No ratings yet
Tutorial 1,2
12 pages
Deep Learning
No ratings yet
Deep Learning
23 pages
Important Optimization Algorithms Essentials
No ratings yet
Important Optimization Algorithms Essentials
12 pages
Mcculloh: Linear Activation Function
No ratings yet
Mcculloh: Linear Activation Function
12 pages
Module4 AI
No ratings yet
Module4 AI
12 pages
Terms To Review
No ratings yet
Terms To Review
9 pages
2023246032-Backward Propagation and Other Differential Algorithms
No ratings yet
2023246032-Backward Propagation and Other Differential Algorithms
48 pages
Dl-Unit 3
No ratings yet
Dl-Unit 3
14 pages
Deep Learning Insights & Techniques
No ratings yet
Deep Learning Insights & Techniques
12 pages
Deep Neural Network Optimization Techniques
No ratings yet
Deep Neural Network Optimization Techniques
23 pages
Deep Learning Exam: Key Concepts
No ratings yet
Deep Learning Exam: Key Concepts
32 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
DL Unit 1
No ratings yet
DL Unit 1
9 pages
DL Module 2 1 (Sami)
No ratings yet
DL Module 2 1 (Sami)
17 pages
Machine Learning Engineer Cheatsheet
No ratings yet
Machine Learning Engineer Cheatsheet
3 pages
Inference and Learning
No ratings yet
Inference and Learning
33 pages
Assignment Jaiprakash
No ratings yet
Assignment Jaiprakash
5 pages
Cst414-Deep Learning Module 2
No ratings yet
Cst414-Deep Learning Module 2
13 pages
DL Objectives
No ratings yet
DL Objectives
4 pages
Deep Learning Updated
No ratings yet
Deep Learning Updated
11 pages
Unit 2.a Optimzer
No ratings yet
Unit 2.a Optimzer
10 pages
Assignment 2 QSN 1
No ratings yet
Assignment 2 QSN 1
4 pages
Unit 5e - Autoencoders
No ratings yet
Unit 5e - Autoencoders
32 pages
UNIT 1 Introduction Part 1
No ratings yet
UNIT 1 Introduction Part 1
37 pages
Comprehensive Machine Learning Guide
No ratings yet
Comprehensive Machine Learning Guide
20 pages
Jntuk r20 Unit-V Deep Learning Techniques (WWW - Jntumaterials.co - In)
No ratings yet
Jntuk r20 Unit-V Deep Learning Techniques (WWW - Jntumaterials.co - In)
61 pages
GD Compare
No ratings yet
GD Compare
5 pages
Unit IV V Deep Learning Material
No ratings yet
Unit IV V Deep Learning Material
32 pages
Basic Machine Learning Terms 2
No ratings yet
Basic Machine Learning Terms 2
4 pages
Optimization Techniques in Deep Learning
No ratings yet
Optimization Techniques in Deep Learning
14 pages
I2DL Student Lecture Notes
No ratings yet
I2DL Student Lecture Notes
97 pages
21CS743
No ratings yet
21CS743
27 pages
DL Ut - 1
No ratings yet
DL Ut - 1
14 pages
Deep Learning Viva Questions (1-3)
No ratings yet
Deep Learning Viva Questions (1-3)
4 pages
Data Input
No ratings yet
Data Input
6 pages
AI As Subset
No ratings yet
AI As Subset
16 pages
UCS - 401 - Unit-LV - Trends in Machine Learning - Model and Symbols - Bagging and Boosting, Multitask
No ratings yet
UCS - 401 - Unit-LV - Trends in Machine Learning - Model and Symbols - Bagging and Boosting, Multitask
44 pages
Lecture 1-Unit 3.3
No ratings yet
Lecture 1-Unit 3.3
3 pages
Deep Learning: Autoencoders Overview
100% (1)
Deep Learning: Autoencoders Overview
31 pages
Categorization of ML-DL Algorithms
No ratings yet
Categorization of ML-DL Algorithms
1 page
02 Neural Networks
No ratings yet
02 Neural Networks
32 pages
Chapter 1
No ratings yet
Chapter 1
28 pages
Deep Learning Techniques Overview
No ratings yet
Deep Learning Techniques Overview
24 pages
New - Neural Network & Deep Learning
No ratings yet
New - Neural Network & Deep Learning
8 pages
Understanding Neurons and Perceptrons
No ratings yet
Understanding Neurons and Perceptrons
23 pages
MlUnit 4
No ratings yet
MlUnit 4
9 pages
Lecture 14 Autoencoders
No ratings yet
Lecture 14 Autoencoders
39 pages
Deep Learning Curriculum
No ratings yet
Deep Learning Curriculum
23 pages
DL Test-2
No ratings yet
DL Test-2
28 pages
Cours 5
No ratings yet
Cours 5
23 pages
Lecture 2
No ratings yet
Lecture 2
31 pages
Introduction To Hyperparameters
No ratings yet
Introduction To Hyperparameters
4 pages
Deep Learning Cheats
No ratings yet
Deep Learning Cheats
13 pages
Autoencoder - Unit 4
No ratings yet
Autoencoder - Unit 4
39 pages
Deep Learning
No ratings yet
Deep Learning
15 pages
Understanding Bias-Variance Tradeoff
No ratings yet
Understanding Bias-Variance Tradeoff
10 pages
Model Test Paper Class IX Final
No ratings yet
Model Test Paper Class IX Final
5 pages
Nvidia DGX Pod For Prescriptive Health and Maintenance of Oil and Gas Systems
No ratings yet
Nvidia DGX Pod For Prescriptive Health and Maintenance of Oil and Gas Systems
21 pages
Autoencoders
No ratings yet
Autoencoders
103 pages
RF-Based Sensing and AI Decision Support
No ratings yet
RF-Based Sensing and AI Decision Support
15 pages
Resume - Vaasu Sohee (v5)
No ratings yet
Resume - Vaasu Sohee (v5)
2 pages
VideoMAE V2: Dual Masking for Scaling
No ratings yet
VideoMAE V2: Dual Masking for Scaling
12 pages
Applied Machine and Deep Learning
No ratings yet
Applied Machine and Deep Learning
34 pages
Generative Deep Learning 1st Edition David Foster Ebook All Chapters PDF
100% (10)
Generative Deep Learning 1st Edition David Foster Ebook All Chapters PDF
51 pages
A Review On Machine Learning and Deep Learning Image-Based Plant Disease Classification For Industrial Farming Systems
No ratings yet
A Review On Machine Learning and Deep Learning Image-Based Plant Disease Classification For Industrial Farming Systems
18 pages
Deep Learning Module-2 & 4
No ratings yet
Deep Learning Module-2 & 4
48 pages
ENG6500 8 DL IntroductionToDeepLearning Part2
No ratings yet
ENG6500 8 DL IntroductionToDeepLearning Part2
65 pages
Neural Bellman-Ford Networks - A General Graph Neural Network Framework For Link Prediction
No ratings yet
Neural Bellman-Ford Networks - A General Graph Neural Network Framework For Link Prediction
24 pages
Artculopublicado Applied Science
No ratings yet
Artculopublicado Applied Science
29 pages
KTU Deep Learning Course Overview
No ratings yet
KTU Deep Learning Course Overview
6 pages
Autoencoder Models for Interest Rates
No ratings yet
Autoencoder Models for Interest Rates
44 pages
M.SC Computer Science Sem III IV Vide Item No. 6.2 N
No ratings yet
M.SC Computer Science Sem III IV Vide Item No. 6.2 N
39 pages
Introduction To Deep Learning 17th January 2025
No ratings yet
Introduction To Deep Learning 17th January 2025
60 pages
Auto Encoder
No ratings yet
Auto Encoder
10 pages
Neuroscience Spike Sorting Advances
No ratings yet
Neuroscience Spike Sorting Advances
4 pages
AlexNet and Other Pretrained Models - Presentation
No ratings yet
AlexNet and Other Pretrained Models - Presentation
182 pages
PyTorch Guide
No ratings yet
PyTorch Guide
17 pages
NEP Electronics Sem VII & VIII Courses
No ratings yet
NEP Electronics Sem VII & VIII Courses
53 pages
Classification of Multi-Spectral Data With Fine-Tuning Variants of Representative Models
No ratings yet
Classification of Multi-Spectral Data With Fine-Tuning Variants of Representative Models
23 pages
Autoencoder Techniques Explained
No ratings yet
Autoencoder Techniques Explained
17 pages
25 Effective ChatGPT Prompts Guide
82% (11)
25 Effective ChatGPT Prompts Guide
10 pages
AI For Data Science - Artificial Intelligence Frameworks and Functionality For Deep Learning, Optimization, and Beyond
100% (1)
AI For Data Science - Artificial Intelligence Frameworks and Functionality For Deep Learning, Optimization, and Beyond
231 pages
Gen Ai Solutions
No ratings yet
Gen Ai Solutions
14 pages
Www-Pyimagesearch-Com-2020-02-24-Denoising-Autoencoders-With-Keras-Tensorflow-An (1-18)
No ratings yet
Www-Pyimagesearch-Com-2020-02-24-Denoising-Autoencoders-With-Keras-Tensorflow-An (1-18)
28 pages
Deep Learning Seminar Project Overview
No ratings yet
Deep Learning Seminar Project Overview
22 pages