0% found this document useful (0 votes)

16 views71 pages

Module 3 A

Uploaded by

anoop042004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views71 pages

Module 3 A

Uploaded by

anoop042004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

This Session

• Neural Network and Image

– Dimensionality
– Local relationship

• Convolutional Neural Network (CNN)

– Convolution Layer
– Non-linearity Layer
– Pooling Layer
– Fully Connected Layer
– Classification Layer

• ImageNet Challenge
– Progress
– Human Level Performance
Neural Networks

Source: [Link]
Multi-layer Neural Network & Image

How to apply NN over Image?

Multi-layer Neural Network & Image

Stretch pixels
in single
column vector
Multi-layer Neural Network & Image

Stretch pixels
in single
column vector

Problems ?
Multi-layer Neural Network & Image

Stretch pixels
in single
column vector

Problems:
High dimensionality
Local relationship
Multi-layer Neural Network & Image

Stretch pixels
in single
column vector

Problems: Solution ?
High dimensionality
Local relationship
Multi-layer Neural Network & Image

Stretch pixels
in single
column vector

Problems: Solution:
High dimensionality Convolutional Neural Network
Local relationship
Convolutional Neural Networks
• Also known as
CNN,
ConvNet,
DCN

• CNN = a multi-layer neural network with

1. Local connectivity
2. Weight sharing
CNN: Local Connectivity

Hidden layer

Input layer

Global connectivity Local connectivity

• # input units (neurons): 7
• # hidden units: 3
• Number of parameters
– Global connectivity: ?
– Local connectivity: ?
CNN: Local Connectivity

Hidden layer

Input layer

Global connectivity Local connectivity

• # input units (neurons): 7
• # hidden units: 3
• Number of parameters
– Global connectivity: ?
– Local connectivity: ?
CNN: Local Connectivity

Hidden layer

Input layer

Global connectivity Local connectivity

• # input units (neurons): 7
• # hidden units: 3
• Number of parameters
– Global connectivity: 3 x 7 = 21
– Local connectivity: 3 x 3 = 9
CNN: Weight Sharing

Hidden layer

w1 w3 w5 w7 w9 w1 w3 w2 w1 w3
w2 w4 w6 w8 w2 w1 w3 w2

Input layer

Without weight sharing With weight sharing

• # input units (neurons): 7

• # hidden units: 3
• Number of parameters
– Without weight sharing: ?
– With weight sharing : ?
CNN: Weight Sharing

Hidden layer

w1 w3 w5 w7 w9 w1 w3 w2 w1 w3
w2 w4 w6 w8 w2 w1 w3 w2

Input layer

Without weight sharing With weight sharing

• # input units (neurons): 7

• # hidden units: 3
• Number of parameters
– Without weight sharing: ?
– With weight sharing : ?
CNN: Weight Sharing

Hidden layer

w1 w3 w5 w7 w9 w1 w3 w2 w1 w3
w2 w4 w6 w8 w2 w1 w3 w2

Input layer

Without weight sharing With weight sharing

• # input units (neurons): 7

• # hidden units: 3
• Number of parameters
– Without weight sharing: 3 x 3 = 9
– With weight sharing : 3 x 1 = 3
Convolutional Neural Networks

Source: cs231n, Stanford University

Layers used to build ConvNets
Input Layer (Input image)

Convolutional Layer

Non-linearity Layer (such as Sigmoid, Tanh, ReLU, PReLU,

ELU, Swish, etc.)

Pooling Layer (such as Max Pooling, Average Pooling, etc.)

Fully-Connected Layer

Classification Layer (Softmax, etc.)

Convolutional Layer

32×32×3 Image -> preserve spatial structure

Width
32

Height 32

3 Depth
Convolutional Layer

32×32×3 Image

Width
32
5×5×3 Filter

Height 32

Convolve the filter with the image i.e.

“slide over the image spatially, computing
3 Depth dot products”
Convolutional Layer
Handling multiple input channels
Filters always extend the full depth of
the input volume
32×32×3 Image

Width
32
5×5×3 Filter

Height 32

Convolve the filter with the image i.e.

“slide over the image spatially, computing
3 Depth dot products”
Convolutional Layer

32×32×3 Image

Width
weight mask
32

5×5×3 Filter

Height 32

3 Depth
Convolutional Layer

32×32×3 Image

Width
weight mask
32

5×5×3 Filter

A single value
Height 32
the result of taking a dot product
between the filter and a small 5x5x3
chunk of the image (i.e. 5*5*3 = 75-
3 Depth dimensional dot product + bias)
wT.x + b
Convolutional Layer

32×32×3 Image

Width
weight mask
32

5×5×3 Filter

32×32×3 Image

Width
weight mask
32
28
5×5×3 Filter

Height 32
convolve (slide)
over all spatial 28
locations
3 Depth 1
Convolutional Layer
Handling multiple output maps
32×32×3 Image Activation maps

Width
weight mask
32
28
5×5×3 Filter

Height 32
Second filter
28

3 Depth 1 1
Convolutional Layer
Handling multiple output maps
32×32×3 Image Activation maps

Width
weight mask
32
28
5×5×3 Filter

Height 32
Third filter
28

3 Depth 1 1 1
Convolutional Layer
Handling multiple output maps
32×32×3 Image Activation maps

Width
weight mask
32
28
5×5×3 Filter

Height 32
Total 96 filters
28

3 Depth
Depth of output volume: 96
Image Source: cs231n, Oxford University
Image Source: cs231n, Oxford University
Image Source: cs231n, Oxford University
Convolutional Layer
Preview: ConvNet is a sequence of Convolution Layers, interspersed with activation
functions

32×32×3 Image Activation maps

32
28

32
CONV, 28
e.g.
96
3 5x5x3 96
filters
Convolutional Layer
Preview: ConvNet is a sequence of Convolution Layers, interspersed with activation
functions

32×32×3 Image Activation maps

32
28
One
number
5×5×96
Filter
32
CONV, 28
e.g.
96
3 5x5x3 96
filters
Convolutional Layer
Preview: ConvNet is a sequence of Convolution Layers, interspersed with activation
functions

32×32×3 Image Deeper activation

Activation maps
map

32
28
24

5×5×96
Filter
32
CONV, 28 24
e.g. convolve (slide)
96 over all spatial
3 5x5x3 96 locations 1
filters
Convolutional Layer
Preview: ConvNet is a sequence of Convolution Layers, interspersed with activation
functions

32×32×3 Image Deeper activation

Activation maps
maps

32
28
24

128
5×5×96
32 Filters
CONV, 28
e.g. 24
96
3 5x5x3 96
filters 128
Multilayer Convolution
Preview: ConvNet is a sequence of Convolution Layers, interspersed with activation
functions

...
CONV, CONV, CONV
e.g. e.g.
96 128
32 5x5x3 28 5x5x96 24
filters filters

32 28 24

3 96 128
Any Convolution Layer
• Local connectivity
• Weight sharing
• Handling multiple input channels
• Handling multiple output maps
Weight sharing

Local connectivity

# input channels # output (activation) maps Image credit: A. Karpathy

A closer look at spatial dimensions
7
7×7 input (spatially)
assume 3×3 filter

7
A closer look at spatial dimensions
7
7×7 input (spatially)
assume 3×3 filter

7
5×5 output
A closer look at spatial dimensions
7
7×7 input (spatially)
assume 3×3 filter
applied with stride 2
7
A closer look at spatial dimensions
7
7×7 input (spatially)
assume 3×3 filter
applied with stride 2
7
A closer look at spatial dimensions
7
7×7 input (spatially)
assume 3×3 filter
applied with stride 2
7
3×3 output
A closer look at spatial dimensions
7
7×7 input (spatially)
assume 3×3 filter
applied with stride 3
7
A closer look at spatial dimensions
7
7×7 input (spatially)
assume 3×3 filter
applied with stride 3
7
doesn’t fit!
cannot apply 3x3 filter on
7x7 input with stride 3.
A closer look at spatial dimensions
N
Output size
(N - F) / stride + 1
F

F N e.g. N = 7, F = 3
stride 1 => (7 - 3)/1 + 1 = 5
stride 2 => (7 - 3)/2 + 1 = 3
stride 3 => (7 - 3)/3 + 1 = 2.33
A closer look at spatial dimensions

...
CONV, CONV, CONV
e.g. e.g.
96 128
32 5x5x3 28 5x5x96 24
filters filters

32 28 24

3 96 128

E.g. 32x32 input convolved repeatedly with 5x5 filters shrinks

volumes spatially! (32 -> 28 -> 24 ...). Shrinking too fast is not
good, doesn’t work well.
Source: cs231n, Stanford University
In practice: common to zero pad

0 0 0 0 0 0 0 0 0 e.g. input 7×7 (spatially)

0 0 3×3 filter, applied with stride 1
0 0 pad with 1 pixel border
0 0
0 0 What is the output dimension?
0 0
0 0
0 0
0 0 0 0 0 0 0 0 0
In practice: common to zero pad

0 0 0 0 0 0 0 0 0 e.g. input 7×7 (spatially)

0 0 3×3 filter, applied with stride 1
0 0 pad with 1 pixel border
0 0
0 0 7×7 Output
0 0
0 0
0 0
0 0 0 0 0 0 0 0 0
In practice: common to zero pad

0 0 0 0 0 0 0 0 0 e.g. input 7×7 (spatially)

0 0 3×3 filter, applied with stride 1
0 0 pad with 1 pixel border
0 0
0 0 7×7 Output
0 0
in general, common to see CONV
layers with stride 1, filters of size
0 0
F×F, and zero-padding with
0 0
(F-1)/2. (will preserve size spatially)
0 0 0 0 0 0 0 0 0
e.g.
F = 3 => zero pad with 1
F = 5 => zero pad with 2
F = 7 => zero pad with 3
Example
Input volume: 32x32x3
10 5x5 filters with stride 1, pad 2

Output volume size: ?

Example
Input volume: 32x32x3
10 5x5 filters with stride 1, pad 2

Output volume size:

(32+2*2-5)/1+1 = 32 spatially, so
32x32x10
Example
Input volume: 32x32x3
10 5x5 filters with stride 1, pad 2

Number of parameters in this layer?

Example
Input volume: 32x32x3
10 5x5 filters with stride 1, pad 2

Number of parameters in this layer?

each filter has

5*5*3 + 1 = 76 params (+1 for bias)
=> 76*10 = 760
Source: cs231n, Stanford University
Convolution as feature extraction

Source: cs231n, Stanford University

Non-linearity Layer

Source: cs231n, Stanford University

Pooling Layer
- makes the representations smaller and more manageable
- operates over each activation map independently:

Source: cs231n, Stanford University

Max Pooling

Source: cs231n, Stanford University

Pooling Layer

Source: cs231n, Stanford University

Fully Connected Layer
• Connect every neuron in one layer to every neuron in
another layer

• Same as the traditional multi-layer perceptron neural

network

Image Source: [Link]

Fully Connected Layer
• Connect every neuron in one layer to every neuron in
another layer

• Same as the traditional multi-layer perceptron neural

network

No. of Neurons (Last FC)

= No. of classes

Image Source: [Link]

Loss/Classification Layer
• SVM Classifier (SVM Loss/Hinge Loss/Max-
margin Loss)

• Softmax Classifier (Softmax Loss/Cross-

entropy Loss)
A typical CNN structure

Image Source: [Link]

A typical CNN structure

Image Source: [Link]

ImageNet Challenge
Validation classification

Validation classification • ~14 million labeled images, 20k

Validation classification classes

• Images gathered from Internet

• Human labels via Amazon MTurk

• Challenge: 1.2 million training

images, 1000 classes

[Link]/challenges/LSVRC/
Progress on ImageNet Challenge
ImageNet Image Classification Top5 Error
18 16.4
16
14
11.7
12
10
8 7.3 6.7
6
4 3.57 3.06
2.251
2
0
Progress on ImageNet Challenge
ImageNet Image Classification Top5 Error
18 16.4
16
14
11.7
12
10
8 7.3 6.7
6
4 3.57 3.06
2.251
2
0

Best Non-ConvNet in 2012: 26.2%

Things to remember
• Neural network and Image
– Neuroscience, Perceptron, Problems due to High
Dimensionality and Local Relationship
• Convolutional neural network (CNN)
– Convolution Layer,
– Nonlinearity Layer,
– Pooling Layer,
– Fully Connected Layer,
– Loss/Classification Layer
• Progress on ImageNet challenge
– Latest SENet, Winner 2017
Acknowledgements
• Thanks to the following researchers for making their teaching/research
material online
– Forsyth
– Steve Seitz
– Noah Snavely
– J.B. Huang
– Derek Hoiem
– D. Lowe
– A. Bobick
– S. Lazebnik
– K. Grauman
– R. Zaleski
– Antonio Torralba
– Rob Fergus
– Leibe
– And many more ………..

Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
71 pages
Convolutional Neural Networks - Annotated
No ratings yet
Convolutional Neural Networks - Annotated
83 pages
Convolutional Neural Networks (CNNS) Introduction, Convolution Operation, Pooling Layers, Padding, Hyper Parameter Tuning
No ratings yet
Convolutional Neural Networks (CNNS) Introduction, Convolution Operation, Pooling Layers, Padding, Hyper Parameter Tuning
51 pages
Convolution Output Size Calculation
No ratings yet
Convolution Output Size Calculation
3 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
55 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
48 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
161 pages
CNN Midterm
No ratings yet
CNN Midterm
103 pages
CV Lec6
No ratings yet
CV Lec6
57 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
102 pages
07 Ais302 CNN
No ratings yet
07 Ais302 CNN
56 pages
Additional CNN
No ratings yet
Additional CNN
82 pages
Introduction to Convolutional Neural Networks
No ratings yet
Introduction to Convolutional Neural Networks
72 pages
CNN 01
No ratings yet
CNN 01
79 pages
CNN Iitkgp
No ratings yet
CNN Iitkgp
112 pages
Cnnbasics 171028092801
No ratings yet
Cnnbasics 171028092801
43 pages
CS60010: Deep Learning CNN - Part 1: Sudeshna Sarkar
No ratings yet
CS60010: Deep Learning CNN - Part 1: Sudeshna Sarkar
64 pages
Understanding Convolutional Layers
No ratings yet
Understanding Convolutional Layers
3 pages
Lec9 CNN 25jan18
No ratings yet
Lec9 CNN 25jan18
111 pages
Lecture 3 Updated
No ratings yet
Lecture 3 Updated
56 pages
Lesson 6 Convolutional Neural Network
No ratings yet
Lesson 6 Convolutional Neural Network
43 pages
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
No ratings yet
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
7 pages
Lecture 2 CNN
No ratings yet
Lecture 2 CNN
105 pages
CNN Concepts for Computer Science Students
No ratings yet
CNN Concepts for Computer Science Students
15 pages
Introduction to Convolutional Neural Networks
No ratings yet
Introduction to Convolutional Neural Networks
51 pages
Unit 3 DL
No ratings yet
Unit 3 DL
72 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
32 pages
Unit 5th Ig Ann
No ratings yet
Unit 5th Ig Ann
112 pages
21CS743 DL Module4 Notes
No ratings yet
21CS743 DL Module4 Notes
7 pages
Unit 3
No ratings yet
Unit 3
10 pages
21-Foundations of Convolutional Neural Networks-04!09!2024
No ratings yet
21-Foundations of Convolutional Neural Networks-04!09!2024
10 pages
Convolutional Networks Guide
No ratings yet
Convolutional Networks Guide
15 pages
Mod 5
No ratings yet
Mod 5
96 pages
DL Unit Iii
No ratings yet
DL Unit Iii
13 pages
Unit 4 (CNN and SOM)
No ratings yet
Unit 4 (CNN and SOM)
15 pages
Intro to Convolutional Neural Networks
No ratings yet
Intro to Convolutional Neural Networks
80 pages
Unit 5 CNN
No ratings yet
Unit 5 CNN
151 pages
CNN and Autoencoder
No ratings yet
CNN and Autoencoder
56 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
6 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
26 pages
CC511 Week 7 - Deep - Learning
No ratings yet
CC511 Week 7 - Deep - Learning
33 pages
CNN Basics and Architecture Guide
No ratings yet
CNN Basics and Architecture Guide
65 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
63 pages
Unit4 - 3 CNN
No ratings yet
Unit4 - 3 CNN
19 pages
Understanding CNN Architecture and Applications
No ratings yet
Understanding CNN Architecture and Applications
69 pages
Module 3
No ratings yet
Module 3
46 pages
Week 7
No ratings yet
Week 7
24 pages
DL Unit-3
No ratings yet
DL Unit-3
70 pages
The Math Behind Convolutional Neural Networks - Towards Data Science
No ratings yet
The Math Behind Convolutional Neural Networks - Towards Data Science
37 pages
UNIT 2 Study Materials 1
No ratings yet
UNIT 2 Study Materials 1
42 pages
21CSE424T - Deep Learning For Data Analytics - Unit I - 06082025
No ratings yet
21CSE424T - Deep Learning For Data Analytics - Unit I - 06082025
125 pages
(Fall 2024) Images and Convolutions
No ratings yet
(Fall 2024) Images and Convolutions
69 pages
Intro to Convolutional Neural Networks
No ratings yet
Intro to Convolutional Neural Networks
109 pages
Unit Iii Convolutional Networks and Sequence Modelling
No ratings yet
Unit Iii Convolutional Networks and Sequence Modelling
38 pages
Iii Unit - Deeplearning
No ratings yet
Iii Unit - Deeplearning
93 pages
3.3 - CNNs
No ratings yet
3.3 - CNNs
29 pages
CNNS and Classification Networks
No ratings yet
CNNS and Classification Networks
115 pages
Elliot M. Cramer R. Darrell Bock
No ratings yet
Elliot M. Cramer R. Darrell Bock
24 pages
MSC Psychology Course Structure & Detailed Syllabus
No ratings yet
MSC Psychology Course Structure & Detailed Syllabus
21 pages
Critical Thinking in Nursing Practice
No ratings yet
Critical Thinking in Nursing Practice
19 pages
Fairy Tale Wedding Lesson Plan
No ratings yet
Fairy Tale Wedding Lesson Plan
7 pages
Understanding Variables in Nursing Research
100% (2)
Understanding Variables in Nursing Research
30 pages
Gourav Gupta: Personal Data
No ratings yet
Gourav Gupta: Personal Data
2 pages
Group B: Machine Learning
No ratings yet
Group B: Machine Learning
25 pages
MCQ HRM 23302D
No ratings yet
MCQ HRM 23302D
23 pages
Thesis and Dissertation Writing Guide
No ratings yet
Thesis and Dissertation Writing Guide
52 pages
Thesis Writing Research Approaches
No ratings yet
Thesis Writing Research Approaches
5 pages
9 Instituties
No ratings yet
9 Instituties
21 pages
Dark Paper
No ratings yet
Dark Paper
14 pages
1585143253ba International Social Work
No ratings yet
1585143253ba International Social Work
4 pages
Hduud
No ratings yet
Hduud
55 pages
Hopia Banana Peel Survey
No ratings yet
Hopia Banana Peel Survey
2 pages
Action Research Final Output 2 Finalist
No ratings yet
Action Research Final Output 2 Finalist
30 pages
CS408 Quiz-3 File by Vu Topper RM
No ratings yet
CS408 Quiz-3 File by Vu Topper RM
44 pages
How To Streamline Your Curriculum Map Using MELCs
No ratings yet
How To Streamline Your Curriculum Map Using MELCs
3 pages
Understanding Logarithms: History & Properties
No ratings yet
Understanding Logarithms: History & Properties
5 pages
Organizational Development Questionnaire
60% (5)
Organizational Development Questionnaire
3 pages
Heliyon: Myung-Hee Kim, Won Choi, Woo-Je Lee, Jin-Woo Jung
No ratings yet
Heliyon: Myung-Hee Kim, Won Choi, Woo-Je Lee, Jin-Woo Jung
10 pages
Marketing Research Methods Guide
100% (1)
Marketing Research Methods Guide
2 pages
ERS: Prolog To An Expert System For Transportation Planning: Richard K. Brail
No ratings yet
ERS: Prolog To An Expert System For Transportation Planning: Richard K. Brail
2 pages
Job Analysis for HR Professionals
No ratings yet
Job Analysis for HR Professionals
11 pages
Trends in Nursing MCQ PDF
No ratings yet
Trends in Nursing MCQ PDF
30 pages
Critical Inquiry in Management Studies
No ratings yet
Critical Inquiry in Management Studies
17 pages
Neuropsychological Assessment Overview
No ratings yet
Neuropsychological Assessment Overview
8 pages
Storytelling: Knowledge Solutions
No ratings yet
Storytelling: Knowledge Solutions
4 pages
Riskmatrixmedia
No ratings yet
Riskmatrixmedia
12 pages
SSE20 Module 1
No ratings yet
SSE20 Module 1
6 pages