0% found this document useful (0 votes)

20 views101 pages

Ispr 21 22 CNN

The document provides an overview of Convolutional Neural Networks (CNNs), detailing their components, architectures, and applications in machine vision. It covers topics such as convolution, pooling, and strides, as well as the historical development of CNNs from early models to advanced architectures. Additionally, it includes practical coding examples using Keras to implement CNNs.

Uploaded by

Lamiss Kara

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views101 pages

Ispr 21 22 CNN

Uploaded by

Lamiss Kara

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 101

Convolutional Neural

Networks
INTELLIGENT SYSTEMS FOR PATTERN RECOGNITION (ISPR)

DAVIDE BACCIU – DIPARTIMENTO DI INFORMATICA - UNIVERSITA’ DI PISA

[email protected]
Lecture Outline
○ Introduction and historical perspective
○ Dissecting the components of a CNN
● Convolution, stride, pooling
○ CNN architectures for machine vision
● Putting components back together
● From LeNet to ResNet
○ Advanced topics
● Interpreting convolutions Split in two
● Advanced models and applications lectures

DAVIDE BACCIU - ISPR COURSE 2

CNN Lecture – Part I
Introduction
Convolutional Neural Networks

DAVIDE BACCIU - ISPR COURSE 4

Introduction
Convolutional Neural Networks

Destroying Machine Vision research since 2012

DAVIDE BACCIU - ISPR COURSE 5
Neocognitron
○ Hubel-Wiesel (‘59) model of brain
visual processing
● Simple cells responding to localized
features
● Complex cells pooling responses of
simple cells for invariance
○ Fukushima (‘80) built the first
hierarchical image processing
architecture exploiting this model
Trained by unsupervised learning
DAVIDE BACCIU - ISPR COURSE 6
CNN for Sequences
○ Apply a bank of 16 convolution kernels
to sequences (windows of 15 elements)
○ Trained by backpropagation with
parameter sharing
○ Guess who introduced it?
…yeah, HIM!

Time delay neural network

(Waibel & Hinton, 1987)
DAVIDE BACCIU - ISPR COURSE 7
CNN for Images

First convolutional neural network for images dates back to 1989 (LeCun)

DAVIDE BACCIU - ISPR COURSE 8

Dense Vector Multiplication
Processing images: the dense way
32x32x3 image
Reshape it into An input-sized weight vector
a vector for each hidden neuron

𝒙 𝑾
3072
100x3072
𝑻
𝑾𝒙
Each element contains
the activation of 1 neuron 100

DAVIDE BACCIU - ISPR COURSE 10

About invariances
MLPs are positional

We (most likely) need

translation
invariance!

• If we unfold the two images into two vectors, the features

identifying the cat will be in different positions
• But this still remains a picture of a cat, which we would like to
classify as such irrespectively of its position in the image

DAVIDE BACCIU - ISPR COURSE 11

An inductive bias to keep in mind
Nearby pixels are
more correlated
than far away
ones

The input
representation
should not
destroy pixel
relationships (like
vectorization
does)
DAVIDE BACCIU - ISPR COURSE 12
Convolution (Refresher)
filter
5x5

sum 25 multiplications + bias

32x32

Matrix input preserving

spatial structure

DAVIDE BACCIU - ISPR COURSE 13

Adaptive Convolution
1 0 1 𝑐1 = 𝑤1 + 𝑤3 + 2𝑤4 + 3𝑤5 +4𝑤6 + 𝑤7 + 𝑤9
2 3 4 𝑐1

1 0 1

1 0 1
0 2 0 𝑐2

1 0 1
𝑐2 = 𝑤1 + 𝑤3 + 2𝑤5 + 𝑤7 + 𝑤9
𝒘𝑇 𝒙2,2 𝒘𝑇 𝒙9,7
𝑤1 𝑤2 𝑤3
𝑤4 𝑤5 𝑤6 Convolutional filter (kernel) with
𝑤7 𝑤8 𝑤9 (adaptive) weights 𝑤𝑖
DAVIDE BACCIU - ISPR COURSE 14
Convolutional Features

Convolution
features

32x32 28x28
Slide the filter on the image computing
elementwise products and summing up

DAVIDE BACCIU - ISPR COURSE 15

Multi-Channel Convolution
Convolution filter
has a number of
slices equal to
the number of
5x5x3 image channels
32x32x3

DAVIDE BACCIU - ISPR COURSE 16

Multi-Channel Convolution

28x28

All channels are typically convolved together

o They are summed-up in the convolution
o The convolution map stays bi-dimensional
DAVIDE BACCIU - ISPR COURSE 17
Stride
○ Basic convolution slides the filter
on the image one pixel at a time
● Stride = 1

DAVIDE BACCIU - ISPR COURSE 18

Stride
○ Basic convolution slides the filter
on the image one pixel at a time
● Stride = 1

stride = 1

DAVIDE BACCIU - ISPR COURSE 19

Stride
○ Basic convolution slides the filter
on the image one pixel at a time
● Stride = 1

stride = 1