0% found this document useful (0 votes)

137 views9 pages

CNN Basics for Image Classification

Convolutional neural networks (CNNs) are a type of neural network used for analyzing visual imagery. CNNs process images through a series of convolutional and pooling layers that extract features, followed by fully connected layers that classify images. Key aspects of CNNs include convolutional layers that apply filters to extract features, pooling layers that reduce spatial size, rectified linear unit activations, and multiple convolutional blocks followed by fully connected layers for classification.

Uploaded by

senthil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

137 views9 pages

CNN Basics for Image Classification

Uploaded by

senthil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

11/27/2020 Understanding of Convolutional Neural Network (CNN) — Deep Learning | by Prabhu | Medium

Get started Open in app

Prabhu
1.3K Followers · About Follow

Understanding of Convolutional Neural

Network (CNN) — Deep Learning
Prabhu Mar 4, 2018 · 5 min read

In neural networks, Convolutional neural network (ConvNets or CNNs) is one of the

main categories to do images recognition, images classifications. Objects detections,
recognition faces etc., are some of the areas where CNNs are widely used.

CNN image classifications takes an input image, process it and classify it under certain
categories (Eg., Dog, Cat, Tiger, Lion). Computers sees an input image as array of pixels
and it depends on the image resolution. Based on the image resolution, it will see h x w x
d( h = Height, w = Width, d = Dimension ). Eg., An image of 6 x 6 x 3 array of matrix of
RGB (3 refers to RGB values) and an image of 4 x 4 x 1 array of matrix of grayscale
image.

[Link] 1/9
11/27/2020 Understanding of Convolutional Neural Network (CNN) — Deep Learning | by Prabhu | Medium

Figure 1 : Array of RGB Matrix

Technically, deep learning CNN models to train and test, each input image will pass it
through a series of convolution layers with filters (Kernals), Pooling, fully connected
layers (FC) and apply Softmax function to classify an object with probabilistic values
between 0 and 1. The below figure is a complete flow of CNN to process an input image
and classifies the objects based on values.

[Link] 2/9
11/27/2020 Understanding of Convolutional Neural Network (CNN) — Deep Learning | by Prabhu | Medium

Figure 2 : Neural network with many convolutional layers

Convolution Layer

Convolution is the first layer to extract features from an input image. Convolution
preserves the relationship between pixels by learning image features using small squares
of input data. It is a mathematical operation that takes two inputs such as image matrix
and a filter or kernel.

Figure 3: Image matrix multiplies kernel or filter matrix

Consider a 5 x 5 whose image pixel values are 0, 1 and filter matrix 3 x 3 as shown in
below

Figure 4: Image matrix multiplies kernel or filter matrix

Then the convolution of 5 x 5 image matrix multiplies with 3 x 3 filter matrix which is
called “Feature Map” as output shown in below
[Link] 3/9
11/27/2020 Understanding of Convolutional Neural Network (CNN) — Deep Learning | by Prabhu | Medium

Figure 5: 3 3 Output matrix

Convolution of an image with different filters can perform operations such as edge
detection, blur and sharpen by applying filters. The below example shows various
convolution image after applying different types of filters (Kernels).

Figure 7 : Some common filters

[Link] 4/9
11/27/2020 Understanding of Convolutional Neural Network (CNN) — Deep Learning | by Prabhu | Medium

Strides

Stride is the number of pixels shifts over the input matrix. When the stride is 1 then we
move the filters to 1 pixel at a time. When the stride is 2 then we move the filters to 2
pixels at a time and so on. The below figure shows convolution would work with a stride
of 2.

Figure 6 : Stride of 2 pixels

Padding

Sometimes filter does not fit perfectly fit the input image. We have two options:

Pad the picture with zeros (zero-padding) so that it fits

Drop the part of the image where the filter did not fit. This is called valid padding
which keeps only valid part of the image.

Non Linearity (ReLU)

ReLU stands for Rectified Linear Unit for a non-linear operation. The output is ƒ(x) =
max(0,x).

Why ReLU is important : ReLU’s purpose is to introduce non-linearity in our ConvNet.

Since, the real world data would want our ConvNet to learn would be non-negative
linear values.
[Link] 5/9
11/27/2020 Understanding of Convolutional Neural Network (CNN) — Deep Learning | by Prabhu | Medium

Figure 7 : ReLU operation

There are other non linear functions such as tanh or sigmoid that can also be used
instead of ReLU. Most of the data scientists use ReLU since performance wise ReLU is
better than the other two.

Pooling Layer

Pooling layers section would reduce the number of parameters when the images are too
large. Spatial pooling also called subsampling or downsampling which reduces the
dimensionality of each map but retains important information. Spatial pooling can be of
different types:

Max Pooling

Average Pooling

Sum Pooling

Max pooling takes the largest element from the rectified feature map. Taking the largest
element could also take the average pooling. Sum of all elements in the feature map call
as sum pooling.

[Link] 6/9
11/27/2020 Understanding of Convolutional Neural Network (CNN) — Deep Learning | by Prabhu | Medium

Figure 8 : Max Pooling

Fully Connected Layer

The layer we call as FC layer, we flattened our matrix into vector and feed it into a fully
connected layer like a neural network.

Figure 9 : After pooling layer, flattened as FC layer

In the above diagram, the feature map matrix will be converted as vector (x1, x2, x3,
…). With the fully connected layers, we combined these features together to create a
model. Finally, we have an activation function such as softmax or sigmoid to classify the
outputs as cat, dog, car, truck etc.,

[Link] 7/9
11/27/2020 Understanding of Convolutional Neural Network (CNN) — Deep Learning | by Prabhu | Medium

Figure 10 : Complete CNN architecture

Summary

Provide input image into convolution layer

Choose parameters, apply filters with strides, padding if requires. Perform

convolution on the image and apply ReLU activation to the matrix.

Perform pooling to reduce dimensionality size

Add as many convolutional layers until satisfied

Flatten the output and feed into a fully connected layer (FC Layer)

Output the class using an activation function (Logistic Regression with cost
functions) and classifies images.

In the next post, I would like to talk about some popular CNN architectures such as
AlexNet, VGGNet, GoogLeNet, and ResNet.

References :

[Link]

[Link]
Understanding-Convolutional-Neural-Networks/

[Link]

Machine Learning Cnn Convolution Neural Net Image Recognition Neural Networks

[Link] 8/9
11/27/2020 Understanding of Convolutional Neural Network (CNN) — Deep Learning | by Prabhu | Medium

About Help Legal

Get the Medium app

[Link] 9/9

Understanding CNNs in Deep Learning
No ratings yet
Understanding CNNs in Deep Learning
8 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
9 pages
CNN
No ratings yet
CNN
10 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu Raghav - Medium
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu Raghav - Medium
10 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
7 pages
CNN Convolution Explained
No ratings yet
CNN Convolution Explained
12 pages
CNNs: Deep Learning for Image Tasks
No ratings yet
CNNs: Deep Learning for Image Tasks
27 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
47 pages
Deep Learning & CNN Fundamentals
No ratings yet
Deep Learning & CNN Fundamentals
56 pages
Scan 30 Sep 23 18 20 44
No ratings yet
Scan 30 Sep 23 18 20 44
30 pages
Understanding Convolutional Neural Networks (CNNS) in Depth - by Koushik - Medium
No ratings yet
Understanding Convolutional Neural Networks (CNNS) in Depth - by Koushik - Medium
30 pages
CNNs for AI and Machine Learning
No ratings yet
CNNs for AI and Machine Learning
16 pages
Introduction To Convolution Neural Network - GeeksforGeeks
No ratings yet
Introduction To Convolution Neural Network - GeeksforGeeks
24 pages
E-Note 33951 Content Document 20250328020322PM
No ratings yet
E-Note 33951 Content Document 20250328020322PM
29 pages
Unit Iii Deep Learning
No ratings yet
Unit Iii Deep Learning
31 pages
CNN Basics and Architecture Guide
No ratings yet
CNN Basics and Architecture Guide
16 pages
An Introduction To Convolutional Neural Networks - A Comprehensive Guide To CNNs in Deep Learning - DataCamp
No ratings yet
An Introduction To Convolutional Neural Networks - A Comprehensive Guide To CNNs in Deep Learning - DataCamp
14 pages
Unit 5 CNN
No ratings yet
Unit 5 CNN
151 pages
DEEP LEARNING Unit-2 NOTES For Post Graduation
No ratings yet
DEEP LEARNING Unit-2 NOTES For Post Graduation
11 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
11 pages
CNN and Applications
No ratings yet
CNN and Applications
22 pages
DL Unit 3 2019PAT
No ratings yet
DL Unit 3 2019PAT
66 pages
CNN Notes Unit 3 Notes
No ratings yet
CNN Notes Unit 3 Notes
17 pages
Image Processing Deep Dive
No ratings yet
Image Processing Deep Dive
4 pages
Module 3 Notes
No ratings yet
Module 3 Notes
22 pages
DL Unit-3
No ratings yet
DL Unit-3
70 pages
Understanding CNN Architecture and Applications
No ratings yet
Understanding CNN Architecture and Applications
69 pages
CPCS432 Lecture 5 Deep Learning and Artificial Neural Networks Techniques in Computer Vision
No ratings yet
CPCS432 Lecture 5 Deep Learning and Artificial Neural Networks Techniques in Computer Vision
57 pages
DL Unit4
No ratings yet
DL Unit4
31 pages
Lecture 6
No ratings yet
Lecture 6
17 pages
Module5 ML
No ratings yet
Module5 ML
112 pages
Day8 (CNN)
No ratings yet
Day8 (CNN)
35 pages
CNNs for ECE Students
No ratings yet
CNNs for ECE Students
60 pages
DL Unit-Ii
No ratings yet
DL Unit-Ii
34 pages
Lec5 CNN RNN Attention
No ratings yet
Lec5 CNN RNN Attention
71 pages
CNN Intro
No ratings yet
CNN Intro
21 pages
Unit - 4 DL
No ratings yet
Unit - 4 DL
19 pages
Chap 2 DL
No ratings yet
Chap 2 DL
88 pages
Intro To CNN
No ratings yet
Intro To CNN
93 pages
CNNs for Image Recognition
No ratings yet
CNNs for Image Recognition
16 pages
Introduction To Convolutional Neural Networks (CNNS)
No ratings yet
Introduction To Convolutional Neural Networks (CNNS)
28 pages
Week8 WEB
No ratings yet
Week8 WEB
54 pages
Convolutional Neural Networks: CMSC 733 Fall 2015 Angjoo Kanazawa
No ratings yet
Convolutional Neural Networks: CMSC 733 Fall 2015 Angjoo Kanazawa
55 pages
03 - CNN
No ratings yet
03 - CNN
10 pages
07 Ais302 CNN
No ratings yet
07 Ais302 CNN
56 pages
CNN
No ratings yet
CNN
35 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
23 pages
CNN 01
No ratings yet
CNN 01
79 pages
WINSEM2024-25 BMEE407L TH VL2024250503563 2025-03-28 Reference-Material-I
No ratings yet
WINSEM2024-25 BMEE407L TH VL2024250503563 2025-03-28 Reference-Material-I
36 pages
Lecture 2 CNN
No ratings yet
Lecture 2 CNN
105 pages
CNN Basics with TensorFlow Explained
No ratings yet
CNN Basics with TensorFlow Explained
17 pages
Some Important Question
No ratings yet
Some Important Question
59 pages
Deep Learning: Seungsang Oh
No ratings yet
Deep Learning: Seungsang Oh
39 pages
Convolutional Neural Networks (CNNS) Introduction, Convolution Operation, Pooling Layers, Padding, Hyper Parameter Tuning
No ratings yet
Convolutional Neural Networks (CNNS) Introduction, Convolution Operation, Pooling Layers, Padding, Hyper Parameter Tuning
51 pages
DL Unit2
No ratings yet
DL Unit2
25 pages
DLCV Ch2 Neural Network
No ratings yet
DLCV Ch2 Neural Network
68 pages
QB 4
No ratings yet
QB 4
5 pages
DE Notes 1
No ratings yet
DE Notes 1
20 pages
Lecture 2 Fundamental Steps in Digital Image Processing
No ratings yet
Lecture 2 Fundamental Steps in Digital Image Processing
4 pages
Multilevel Gate Networks Guide
No ratings yet
Multilevel Gate Networks Guide
19 pages
Fahim 2020
No ratings yet
Fahim 2020
5 pages
Computer Aided Engineering Drawing (Caed) : Mixing of Questions From Different Modules)
No ratings yet
Computer Aided Engineering Drawing (Caed) : Mixing of Questions From Different Modules)
1 page
Part A: Multiple Choice: Answer With The Best Choice. Make Sure That You Clearly Circle The
No ratings yet
Part A: Multiple Choice: Answer With The Best Choice. Make Sure That You Clearly Circle The
8 pages
NITK Software Engineering Lab Exam Schedule
No ratings yet
NITK Software Engineering Lab Exam Schedule
5 pages
Nerve Impulse Elicitation and Inhibition
No ratings yet
Nerve Impulse Elicitation and Inhibition
3 pages
Introduction To Reading Skills
No ratings yet
Introduction To Reading Skills
52 pages
Grade 4 Arts Lesson: Colors of Nature
No ratings yet
Grade 4 Arts Lesson: Colors of Nature
8 pages
Mechanics+ +Waves+on+a+Stretched+String
No ratings yet
Mechanics+ +Waves+on+a+Stretched+String
4 pages
Mentor Diary.1
No ratings yet
Mentor Diary.1
16 pages
ENGL 1010 Course Reflection Insights
No ratings yet
ENGL 1010 Course Reflection Insights
2 pages
Grade 8 English Lesson Plan
No ratings yet
Grade 8 English Lesson Plan
6 pages
If
50% (2)
If
4 pages
Domestic Violence Lesson Plans
No ratings yet
Domestic Violence Lesson Plans
21 pages
MBA-HealthCare-AC 2
No ratings yet
MBA-HealthCare-AC 2
3 pages
Beyond The Code
No ratings yet
Beyond The Code
5 pages
Transformational Leadership Reflection
No ratings yet
Transformational Leadership Reflection
2 pages
Eb Ceg in The Line of Money
No ratings yet
Eb Ceg in The Line of Money
33 pages
Appian Developer Resume Guide
No ratings yet
Appian Developer Resume Guide
9 pages
Types of Information Systems
No ratings yet
Types of Information Systems
13 pages
Math 2 Q1
100% (3)
Math 2 Q1
7 pages
Masculinities 2nd Edition Raewyn W. Connell ebook complete literary edition
100% (1)
Masculinities 2nd Edition Raewyn W. Connell ebook complete literary edition
88 pages
Readiness Checklist For Feeding Implementation
No ratings yet
Readiness Checklist For Feeding Implementation
13 pages
Chemistry Project
No ratings yet
Chemistry Project
18 pages
Primary 2 English
100% (2)
Primary 2 English
4 pages
B2027 Bachelor of Business and Commerce and Bachelor of Digital Media and Communication
No ratings yet
B2027 Bachelor of Business and Commerce and Bachelor of Digital Media and Communication
1 page
Ieltsfever Academic Reading Practice Test 42 PDF
100% (2)
Ieltsfever Academic Reading Practice Test 42 PDF
17 pages
Giles & Coupland, 1991
No ratings yet
Giles & Coupland, 1991
5 pages
Handler - 2024 - Determinants of Llm-Assisted Decision-Making
No ratings yet
Handler - 2024 - Determinants of Llm-Assisted Decision-Making
45 pages
HW7
100% (3)
HW7
6 pages
INTRODUCTION
No ratings yet
INTRODUCTION
3 pages
Curriculum Implementation
88% (8)
Curriculum Implementation
11 pages

CNN Basics for Image Classification

Uploaded by

CNN Basics for Image Classification

Uploaded by

11/27/2020 Understanding of Convolutional Neural Network (CNN) — Deep Learning | by Prabhu | Medium

Get started Open in app

Understanding of Convolutional Neural

In neural networks, Convolutional neural network (ConvNets or CNNs) is one of the

Figure 1 : Array of RGB Matrix

Figure 2 : Neural network with many convolutional layers

Figure 3: Image matrix multiplies kernel or filter matrix

Figure 4: Image matrix multiplies kernel or filter matrix

Figure 5: 3 3 Output matrix

Figure 7 : Some common filters

Figure 6 : Stride of 2 pixels

Pad the picture with zeros (zero-padding) so that it fits

Non Linearity (ReLU)

Why ReLU is important : ReLU’s purpose is to introduce non-linearity in our ConvNet.

Figure 7 : ReLU operation

Figure 8 : Max Pooling

Fully Connected Layer

Figure 9 : After pooling layer, flattened as FC layer

Figure 10 : Complete CNN architecture

Provide input image into convolution layer

Choose parameters, apply filters with strides, padding if requires. Perform

Perform pooling to reduce dimensionality size

Add as many convolutional layers until satisfied

About Help Legal

Get the Medium app

You might also like