Module 4

Uploaded by

rashmi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views68 pages

Module 4

Uploaded by

rashmi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

MODULE 4

SYLLABUS
Convolutional Neural Networks – Convolution operation, Motivation,
Pooling, Convolution and Pooling as an infinitely strong prior,
Variants of convolution functions, Structured outputs, Data types,
Efficient convolution algorithms. Practical use cases for CNNs, Case
study - Building CNN model AlexNet for simple pattern analysis
tasks benchmark datasets
Introduction
❑Convolutional networks convolutional (LeCun, 1989), also known as
neural networks or CNNs, are a specialized kind of neural network for
processing data that has a known, grid-like topology.
❑The name “convolutional neural network” indicates that the
network employs a mathematical operation called convolution.
Convolution is a specialized kind of linear operation. Convolutional
networks are simply neural networks that use convolution in place
of general matrix multiplication in at least one of their layers
The Convolution Operation
Convolution in 2d input
An example of 2-D convolution without
kernel-flipping
Relation between input size, output size and filter size
To make the output size same as that of
input - Padding
Padding
❑The convolution operation reduces the size of the output.
❑This type of reduction in size is not desirable in general, because it tends to lose
some information along the borders of the image.
❑This problem can be resolved by using padding. In padding, one adds (F −1)/2
“pixels” all around the borders of the feature map in order to maintain the spatial
footprint.
❑The value of each of these padded feature values is set to 0, irrespective of
whether the input or the hidden layers are being padded.
❑ As a result, the spatial height and width of the input volume will both increase
by (F− 1), which is exactly what they reduce by (in the output volume) after the
convolution is performed.
❑The padded portions do not contribute to the final dot product because their
values are set to 0.
❑This type of padding is referred to as half-padding because
(almost) half the filter is sticking out from all sides of the
spatial input in the case where the filter is placed in its
extreme spatial position along the edges.
❑Another useful form of padding is full-padding. In full-
padding, we allow (almost) the full filter to stick out from
various sides of the input.
❑In other words, a portion of the filter of size F − 1 is
allowed to stick out from any side of the input with an
overlap of only one spatial feature.
Strides
❑The traditional approach performs convolution at every position in the feature
map, but this is not necessary.
❑Using the concept of strides, one can reduce the granularity of the convolution
and only perform it at certain spatial positions in the layer.
❑This can lead to a reduction in the spatial footprint of the image or layer, while
still maintaining important features.
❑When a stride of S is used in a layer, the convolution is performed at the
locations 1, S + 1, 2S + 1, and so on along both spatial dimensions of the layer.
❑As a result, the use of strides will reduce each spatial dimension of the layer by
a factor of approximately S.
The Basic Structure of a Convolutional Network
❑Each layer in the convolutional network is a 3-dimensional grid structure,
which has a height, width, and depth.
❑The word “depth” refers to the number of channels in each layer, such as the
number of primary color channels (e.g., blue, green, and red) in the input image
or the number of feature maps in the hidden layers.
❑The convolutional neural network functions much like a traditional feed-
forward neural network, except that the operations in its layers are spatially
organized with sparse connections between layers.
❑The three types of layers that are commonly present in a convolutional neural
network are convolution, pooling, and ReLU.
Pooling
❑The pooling operation is, however, quite different. The pooling
operation works on small grid regions of size P × P in each layer, and
produces another layer with the same depth.
❑ For each square region of size P ×P in each of the activation maps,
the maximum of these values is returned. This approach is referred
to as max-pooling. If a stride of 1 is used, then this will produce a
new layer of size (H − P + 1) × (W − P + 1) ×D.
❑However, it is more common to use a stride S > 1 in pooling.
❑In such cases, the length of the new layer will be (H −P)/S +1 and
the breadth will be (W −P)/S +1. Therefore, pooling drastically
reduces the spatial dimensions of each activation map.
❑Unlike with convolution operations, pooling is done at the level of each
activation map.
❑Whereas a convolution operation simultaneously uses all feature maps in
combination with a filter to produce a single feature value, pooling
independently operates on each feature map to produce another feature map.
❑Therefore, the operation of pooling does not change the number of feature
maps. In other words, the depth of the layer created using pooling is the same
as that of the layer on which the pooling operation was performed.
❑Other types of pooling (like average-pooling) are possible but rarely used. In the earliest convolutional
network, referred to as LeNet-5, a variant of average pooling was used and was referred to as subsampling.
In general, max-pooling remains more popular than average pooling.
The ReLU Layer
❑The convolution operation is interleaved with the pooling and ReLU operations.
❑For each of the H ×W ×D values in a layer, the ReLU activation function is
applied to it to create H×W×D thresholded values.
❑These values are then passed on to the next layer. Therefore, applying the
ReLU does not change the dimensions of a layer because it is a simple one-to
one mapping of activation value.
❑The use of the ReLU has tremendous advantages over these activation
functions both in terms of speed and accuracy.
❑Increased speed is also connected to accuracy because it allows one to use
deeper models and train them for a longer time. In recent years, the use of the
ReLU activation function has replace the other activation functions in
convolutional neural network
LeNet 5
Motivation
1. sparse interactions
2. parameter sharing and
3. equivariant representations.
SPARSE CONNECTIVITY
PARAMETER SHARING
❑Parameter sharing refers to using the same parameter for more than one
function in a model.
❑In a traditional neural net, each element of the weight matrix is used exactly
once when computing the output of a layer. It is multiplied by one element of
the input and then never revisited.
❑In a convolutional neural net, each member of the kernel is used at every
position of the input.
❑The parameter sharing used by the convolution operation means that rather
than learning a separate set of parameters for every location, we learn only one
set.
❑This does not affect the runtime of forward propagation—it is still O(k × n)—
but it does further reduce the storage requirements of the model to k
parameters. Recall that k is usually several orders of magnitude less than m.
❑Since m and n are usually roughly the same size, k is practically insignificant
compared to m× n. Convolution is thus dramatically more efficient than dense
matrix multiplication in terms of the memory requirements and statistical
efficiency.
EQUIVARIENT REPRESENTATIONS
❑A function is equivariant means that if the input changes, the
output changes in the same way. Specifically, a function f(x) is
equivariant to a function g if f (g(x)) = g(f(x)).
❑In the case of convolution, if we let g be any function that
translates the input, i.e., shifts it, then the convolution function is
equivariant to g.
❑Convolution is not naturally equivariant to some other
transformations, such as changes in the scale or rotation of an
image. Other mechanisms are necessary for handling these kinds of
transformations
TYPES OF CONVOLUTION
Data types
❑The data used with a convolutional network usually consists of several
channels, each channel being the observation of a different quantity at some
point in space or time.
ALEXNET
Architecture

FDL Module III
No ratings yet
FDL Module III
83 pages
Unit 5th Ig Ann
No ratings yet
Unit 5th Ig Ann
112 pages
Iii Unit - Deeplearning
No ratings yet
Iii Unit - Deeplearning
93 pages
Unit 3
No ratings yet
Unit 3
10 pages
CNN Essentials for Data Science Students
No ratings yet
CNN Essentials for Data Science Students
17 pages
Aiml Ece Unit-5
No ratings yet
Aiml Ece Unit-5
48 pages
Unit 3.4
No ratings yet
Unit 3.4
65 pages
CC511 Week 7 - Deep - Learning
No ratings yet
CC511 Week 7 - Deep - Learning
33 pages
Understanding Convolution in CNNs
No ratings yet
Understanding Convolution in CNNs
62 pages
Sarma CNN Vce Oct 2022
No ratings yet
Sarma CNN Vce Oct 2022
63 pages
Unit 2
No ratings yet
Unit 2
45 pages
Lecture 3 Updated
No ratings yet
Lecture 3 Updated
56 pages
Module 3
No ratings yet
Module 3
46 pages
Unit 2 Part 02
No ratings yet
Unit 2 Part 02
37 pages
CNN Concepts for Computer Science Students
No ratings yet
CNN Concepts for Computer Science Students
15 pages
Aiml Ece Unit-5
No ratings yet
Aiml Ece Unit-5
48 pages
Unit 3 DL
No ratings yet
Unit 3 DL
72 pages
Module 3
No ratings yet
Module 3
67 pages
CNNs Explained for Students
No ratings yet
CNNs Explained for Students
11 pages
21CS743 DL Module4 Notes
No ratings yet
21CS743 DL Module4 Notes
7 pages
Convolutional Networks Guide
No ratings yet
Convolutional Networks Guide
15 pages
Deep Learning CNN 4th Unit
No ratings yet
Deep Learning CNN 4th Unit
16 pages
AE556 2024 Topic4 CNN
No ratings yet
AE556 2024 Topic4 CNN
26 pages
CNN Final
No ratings yet
CNN Final
17 pages
Simple CNN Implementation Guide
No ratings yet
Simple CNN Implementation Guide
9 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
15 pages
3.convolutional Networks and Sequence Modeling
No ratings yet
3.convolutional Networks and Sequence Modeling
19 pages
CNN Basics: Convolution & Layers
No ratings yet
CNN Basics: Convolution & Layers
18 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
35 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
68 pages
CNNs for Machine Learning Experts
No ratings yet
CNNs for Machine Learning Experts
6 pages
Module 4
No ratings yet
Module 4
20 pages
CNN Tutorial: Learn from Scratch
No ratings yet
CNN Tutorial: Learn from Scratch
11 pages
Convolution Neural Networks: S. Sumitra Department of Mathematics Indian Institute of Space Science and Technology
No ratings yet
Convolution Neural Networks: S. Sumitra Department of Mathematics Indian Institute of Space Science and Technology
123 pages
21CSE424T - Deep Learning For Data Analytics - Unit I - 06082025
No ratings yet
21CSE424T - Deep Learning For Data Analytics - Unit I - 06082025
125 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
7 pages
Understanding Convolution in CNNs
No ratings yet
Understanding Convolution in CNNs
31 pages
FODL Unit-4
No ratings yet
FODL Unit-4
46 pages
CNN Iitkgp
No ratings yet
CNN Iitkgp
112 pages
Unit 5 Ann
No ratings yet
Unit 5 Ann
28 pages
CNN Architecture and Transfer Learning Guide
100% (1)
CNN Architecture and Transfer Learning Guide
43 pages
Unit II Deep Learning
No ratings yet
Unit II Deep Learning
26 pages
CNN Architecture & Concepts Explained
No ratings yet
CNN Architecture & Concepts Explained
21 pages
Convolutional Neural Networks (CNNS) Introduction, Convolution Operation, Pooling Layers, Padding, Hyper Parameter Tuning
No ratings yet
Convolutional Neural Networks (CNNS) Introduction, Convolution Operation, Pooling Layers, Padding, Hyper Parameter Tuning
51 pages
CNN New
No ratings yet
CNN New
225 pages
DL Unit 4&5
No ratings yet
DL Unit 4&5
30 pages
UNIT 2 Study Materials 1
No ratings yet
UNIT 2 Study Materials 1
42 pages
Unit 2
No ratings yet
Unit 2
136 pages
Lec 6
No ratings yet
Lec 6
31 pages
Convolution Neural Network-1
No ratings yet
Convolution Neural Network-1
44 pages
Mod 5
No ratings yet
Mod 5
96 pages
Lecture 3
No ratings yet
Lecture 3
48 pages
DL Mod4
No ratings yet
DL Mod4
18 pages
Ad3501 - Deep Learning - Iat-Unit 1i - Expected Qns
No ratings yet
Ad3501 - Deep Learning - Iat-Unit 1i - Expected Qns
13 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
38 pages
DL Unit-3
No ratings yet
DL Unit-3
70 pages
Cnnbasics 171028092801
No ratings yet
Cnnbasics 171028092801
43 pages
CNN Hyperparameters Affecting Output Size
No ratings yet
CNN Hyperparameters Affecting Output Size
10 pages
Convolution Operation
No ratings yet
Convolution Operation
23 pages
Safari 2
No ratings yet
Safari 2
4 pages
Ai900 Full Document
No ratings yet
Ai900 Full Document
8 pages
EDADet EncoderDecoder Domain Augmented Alignment Detector For Tiny Objects in Remote Sensing Images-5
No ratings yet
EDADet EncoderDecoder Domain Augmented Alignment Detector For Tiny Objects in Remote Sensing Images-5
15 pages
Al3502 - DLV Unit 2
No ratings yet
Al3502 - DLV Unit 2
18 pages
Network Intrusion Detection Using UNSW-NB15 Dataset Stacking Machine Learning Based Approach
No ratings yet
Network Intrusion Detection Using UNSW-NB15 Dataset Stacking Machine Learning Based Approach
6 pages
(Mle)
No ratings yet
(Mle)
3 pages
Toc
No ratings yet
Toc
12 pages
Syllabus
No ratings yet
Syllabus
3 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
DL Notes (Extra)
No ratings yet
DL Notes (Extra)
41 pages
Deep Learning Methods in Speaker Recognition: A Review
No ratings yet
Deep Learning Methods in Speaker Recognition: A Review
19 pages
Semester - 7-Deep Learning
No ratings yet
Semester - 7-Deep Learning
3 pages
ML-LECTURE9 KNN Classification
No ratings yet
ML-LECTURE9 KNN Classification
23 pages
Averaging Strategy To Improve Sar-To-Ndvi Estimations in A Region of Interest
No ratings yet
Averaging Strategy To Improve Sar-To-Ndvi Estimations in A Region of Interest
4 pages
01 Introduction
No ratings yet
01 Introduction
136 pages
Gen AI
No ratings yet
Gen AI
61 pages
OCS351-AI&MLF LAB MANUAL Front Page
No ratings yet
OCS351-AI&MLF LAB MANUAL Front Page
3 pages
ChatGPT + SQL Power Tips
No ratings yet
ChatGPT + SQL Power Tips
39 pages
Cobbe 21 A
No ratings yet
Cobbe 21 A
8 pages
Databricks ML Associate Crash Study Material
No ratings yet
Databricks ML Associate Crash Study Material
2 pages
Detecting AI-Generated Images Using Facial Similarity and Feature Extraction For Digital Security
No ratings yet
Detecting AI-Generated Images Using Facial Similarity and Feature Extraction For Digital Security
5 pages
Chase The Cloud Predicting Cloud Motion With Diffusion Models PDF
No ratings yet
Chase The Cloud Predicting Cloud Motion With Diffusion Models PDF
13 pages
Ispr 21 22 CNN
No ratings yet
Ispr 21 22 CNN
101 pages
Unit 3 Jntu
No ratings yet
Unit 3 Jntu
9 pages
Enhancing Decision Tree Performance Using NSUM For Diabetic Patients
No ratings yet
Enhancing Decision Tree Performance Using NSUM For Diabetic Patients
2 pages
Lecture 19 PPT Cross-Validation and Classification Accuracy
No ratings yet
Lecture 19 PPT Cross-Validation and Classification Accuracy
15 pages
Northwestern University Hackathon Submission
No ratings yet
Northwestern University Hackathon Submission
17 pages
Janus - Decoupling Visual Encoding For Unified Multimodal Understanding and Generation
No ratings yet
Janus - Decoupling Visual Encoding For Unified Multimodal Understanding and Generation
24 pages
DL unit-1 NOTES
0% (1)
DL unit-1 NOTES
36 pages
Mathematics 11 02445
No ratings yet
Mathematics 11 02445
28 pages

Module 4

Uploaded by

Module 4

Uploaded by

MODULE 4

You might also like