0% found this document useful (0 votes)

7 views13 pages

Linear Classification: The Perceptron

Uploaded by

sagor mohajan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views13 pages

Linear Classification: The Perceptron

Uploaded by

sagor mohajan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Linear Classification:

The Perceptron

These slides were assembled by Eric Eaton, with grateful acknowledgement of the many others who made
their course materials freely available online. Feel free to reuse or adapt these slides for your own academic
purposes, provided that you include proper attribution. Please send comments and corrections to Eric.
Robot Image Credit: Viktoriya Sukhanova © [Link]
Linear Classifiers
• A hyperplane partitions Rd into two half-spaces
– Defined by the normal vector ✓ 2 Rd
• ✓2is orthogonal
Rd to any vector lying ✓
on the hyperplane
– Assumed to pass through the origin
• This is because we incorporated bias term ✓0 into it by x0 = 1

• Consider classification with +1, -1 labels ...

Based on slide by Piyush Rai 2

Linear Classifiers
• Linear classifiers: represent decision boundary by hyperplane
2 3
✓0
6 ✓1 7
6 7 x| = ⇥ 1 x . . . x ⇤ ✓
✓=6 . 7 1 d
4 .. 5
✓d
⇢
| 1 if z 0
h(x) = sign(✓ x) where sign(z) =
1 if z < 0
– Note that: ✓ | x > 0 =) y = +1
|
✓ x < 0 =) y = 1
3
The Perceptron
⇢
| 1 if z 0
h(x) = sign(✓ x) where sign(z) =
1 if z < 0
• The perceptron uses the following update rule each
time it receives a new training instance (x(i) , y (i) )
↵ ⇣ ⇣ (i) ⌘ (i)
⌘
(i)
✓j ✓j h✓ x y xj
2
either 2 or -2

– If the prediction matches the label, make no change

– Otherwise, adjust θ

4
The Perceptron
• The perceptron uses the following update rule each
time it receives a new training instance (x(i) , y (i) )
↵ ⇣ ⇣ ⌘ ⌘
(i) (i) (i)
✓j ✓j h✓ x y xj
2
either 2 or -2

(i) (i)
• Re-write as ✓j ✓j + ↵y xj (only upon misclassification)
– Can eliminate α in this case, since its only effect is to scale θ
by a constant, which doesn’t affect performance

Perceptron
✓ + y (i)
✓Rule: If x(i) is misclassified, do ✓ ✓ + y (i) x(i)
5
Why the Perceptron Update Works

✓old ✓old
x ✓old
+ x + ✓new
+
misclassified

Based on slide by Piyush Rai 6

Based on slide by Piyush Rai 7

The Perceptron Cost Function
• Prediction is correct if y (i) ✓ T x(i) > 0
• Could have used 0/1 loss
Xn
1
J0/1 (✓) = `(sign(✓ T x(i) ), y (i) )
n i=1
where `() is 0 if the prediction is correct, 1 otherwise

Doesn’t produce a useful gradient

Based on slide by Alan Fern 8

The Perceptron Cost Function
• The perceptron uses the following cost function
Xn
1
Jp (✓) = max(0, y (i) ✓ T x(i) )
n i=1

– max(0, y (i) ✓ T x(i) ) is 0 if the prediction is correct

– Otherwise, it is the confidence in the misprediction

Nice gradient

Based on slide by Alan Fern 9

Online Perceptron Algorithm

1.) Let ✓ [0, 0, . . . , 0]

2.) Repeat:
3.) Receive training example (x(i) , y (i) )
4.) if y (i) x(i) ✓  0 // prediction is incorrect
5.) ✓ ✓ + y (i) x(i)

Online learning – the learning mode where the model update is

performed each time a single observation is received

Batch learning – the learning mode where the model update is

performed after observing the entire training set

Based on slide by Alan Fern 10

Online Perceptron Algorithm

Red points are

labeled +

Blue points are

labeled -

• Simplest case: α = 1 and don’t normalize, yields the fixed

increment perceptron
• Guaranteed to find a separating hyperplane if one exists
Based on slide by Alan Fern 12
Improving the Perceptron
• The Perceptron produces many θ‘s during training
• The standard Perceptron simply uses the final θ at test time
– This may sometimes not be a good idea!
– Some other θ may be correct on 1,000 consecutive examples,
but one mistake ruins it!

• Idea: Use a combination of multiple perceptrons

– (i.e., neural networks!)
• Idea: Use the intermediate θ‘s
– Voted Perceptron: vote on predictions of the intermediate θ‘s
– Averaged Perceptron: average the intermediate θ‘s

Based on slide by Piyush Rai 13

Asset-V1 MITx+6.86x+1T2019+Type@Asset+Block@Slides Lecture2
No ratings yet
Asset-V1 MITx+6.86x+1T2019+Type@Asset+Block@Slides Lecture2
21 pages
Intro to Linear Classifiers
No ratings yet
Intro to Linear Classifiers
21 pages
Lecture Notes 3 Perceptron
100% (1)
Lecture Notes 3 Perceptron
7 pages
Perceptron Learning Algorithm Guide
No ratings yet
Perceptron Learning Algorithm Guide
25 pages
SML Lecture5
No ratings yet
SML Lecture5
45 pages
06 Optimization Basics PDF
No ratings yet
06 Optimization Basics PDF
82 pages
Percept Ron
No ratings yet
Percept Ron
2 pages
Lecture 3 - Rosenblatt - S Perceptron-Ch2
No ratings yet
Lecture 3 - Rosenblatt - S Perceptron-Ch2
20 pages
Linear Regression
No ratings yet
Linear Regression
37 pages
Understanding Perceptrons in ANN
No ratings yet
Understanding Perceptrons in ANN
14 pages
Understanding the Perceptron Algorithm
No ratings yet
Understanding the Perceptron Algorithm
72 pages
MAT6007 Session5 Perceptron Algorithm
No ratings yet
MAT6007 Session5 Perceptron Algorithm
19 pages
Perceptron Bound Proof
No ratings yet
Perceptron Bound Proof
27 pages
Perceptron
No ratings yet
Perceptron
26 pages
Slide 2
No ratings yet
Slide 2
35 pages
Machine Learning - Classifiers and Boosting: Reading CH 18.6-18.12, 20.1-20.3.2
No ratings yet
Machine Learning - Classifiers and Boosting: Reading CH 18.6-18.12, 20.1-20.3.2
54 pages
Lecture 2
No ratings yet
Lecture 2
19 pages
Lecture 5 NN
No ratings yet
Lecture 5 NN
57 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
87 pages
Module 4 Lab 1
No ratings yet
Module 4 Lab 1
5 pages
ML CH 11
No ratings yet
ML CH 11
26 pages
Lecture 3 - The Perceptron
No ratings yet
Lecture 3 - The Perceptron
4 pages
Lecturenotes Perceptron
No ratings yet
Lecturenotes Perceptron
7 pages
Single-Layer Perceptrons Guide
No ratings yet
Single-Layer Perceptrons Guide
11 pages
Linear Classifier-Perceptron
No ratings yet
Linear Classifier-Perceptron
42 pages
3 Percept Ron
No ratings yet
3 Percept Ron
34 pages
Perceptron
No ratings yet
Perceptron
6 pages
Lecture 16 - Hyperplane Classifiers - Perceptron - Plain
No ratings yet
Lecture 16 - Hyperplane Classifiers - Perceptron - Plain
9 pages
ML CH 11
No ratings yet
ML CH 11
26 pages
Perceptron Algorithm
No ratings yet
Perceptron Algorithm
10 pages
Neural Network
No ratings yet
Neural Network
82 pages
Lecture 4
No ratings yet
Lecture 4
65 pages
Perceptron Basics for ML Enthusiasts
0% (1)
Perceptron Basics for ML Enthusiasts
8 pages
Neural N Problems - SLP
No ratings yet
Neural N Problems - SLP
123 pages
CV 2025 Spring 14
No ratings yet
CV 2025 Spring 14
33 pages
Perceptron Learning Algorithm Explained
No ratings yet
Perceptron Learning Algorithm Explained
12 pages
Perceptron Learning Example Steps
No ratings yet
Perceptron Learning Example Steps
6 pages
AI Linear Regression & Perceptron
No ratings yet
AI Linear Regression & Perceptron
8 pages
Machine Learning Algorithms Explained
No ratings yet
Machine Learning Algorithms Explained
46 pages
NN 03
No ratings yet
NN 03
27 pages
Lecture 8 - Intro To Neural Networks
No ratings yet
Lecture 8 - Intro To Neural Networks
61 pages
Perceptron Algorithm in Credit Analysis
No ratings yet
Perceptron Algorithm in Credit Analysis
33 pages
09 CSE358 Intro To Machine Leaning III
No ratings yet
09 CSE358 Intro To Machine Leaning III
64 pages
FALL2025-26 CSE2009 ETH AP2025262001754 2025-07-31 Reference-Material-I
No ratings yet
FALL2025-26 CSE2009 ETH AP2025262001754 2025-07-31 Reference-Material-I
22 pages
MSE Derivative in Perceptron Training
No ratings yet
MSE Derivative in Perceptron Training
42 pages
Perceptron Algorithms Guide
No ratings yet
Perceptron Algorithms Guide
43 pages
Clase3 Redunidireccional
No ratings yet
Clase3 Redunidireccional
74 pages
06 Ann
No ratings yet
06 Ann
56 pages
Perceptron
No ratings yet
Perceptron
3 pages
6.036: Intro To Machine Learning: Lecturer: Professor Leslie Kaelbling Notes By: Andrew Lin Fall 2019
No ratings yet
6.036: Intro To Machine Learning: Lecturer: Professor Leslie Kaelbling Notes By: Andrew Lin Fall 2019
50 pages
CSE465 T3 Perceptron
No ratings yet
CSE465 T3 Perceptron
27 pages
Lec 10 Oct 24
No ratings yet
Lec 10 Oct 24
21 pages
4 History of The Perceptron
No ratings yet
4 History of The Perceptron
34 pages
01 Halfspaces Perceptron
No ratings yet
01 Halfspaces Perceptron
56 pages
L17 Perceptron
No ratings yet
L17 Perceptron
21 pages
05 Optimization Basics
No ratings yet
05 Optimization Basics
94 pages
4th and 5th Quiz COMPUTER VISION
100% (1)
4th and 5th Quiz COMPUTER VISION
2 pages
AP Problems and Solutions
No ratings yet
AP Problems and Solutions
2 pages
Geometric Transformation and Image Registration - Functions
No ratings yet
Geometric Transformation and Image Registration - Functions
2 pages
Dewe 43, Manual Intruction, Dewesoft, DAQ
No ratings yet
Dewe 43, Manual Intruction, Dewesoft, DAQ
36 pages
BCAC403 ClassNote Module-2
No ratings yet
BCAC403 ClassNote Module-2
47 pages
DD2434 Machine Learning, Advanced Course Assignment 2: Jens Lagergren Deadline 23.00 (CET) December 30, 2017
No ratings yet
DD2434 Machine Learning, Advanced Course Assignment 2: Jens Lagergren Deadline 23.00 (CET) December 30, 2017
5 pages
GFG Complex Problems
No ratings yet
GFG Complex Problems
3 pages
Numerical Methods Question Bank
No ratings yet
Numerical Methods Question Bank
4 pages
Multi Layer Perceptron 1
No ratings yet
Multi Layer Perceptron 1
54 pages
Two-Phase Simplex Method Explained
No ratings yet
Two-Phase Simplex Method Explained
61 pages
Digital Sound Generation 1
No ratings yet
Digital Sound Generation 1
85 pages
Clustering Process
No ratings yet
Clustering Process
2 pages
RAM PRAKASH/Numerical Analysis and Computer Programming
No ratings yet
RAM PRAKASH/Numerical Analysis and Computer Programming
39 pages
Predictive Modeling W - Python - Curated by Ben Putney - Medium
No ratings yet
Predictive Modeling W - Python - Curated by Ben Putney - Medium
7 pages
Deep Learning Challenges & Solutions
No ratings yet
Deep Learning Challenges & Solutions
4 pages
Euler
No ratings yet
Euler
17 pages
Power Spectral Density Explained
0% (1)
Power Spectral Density Explained
2 pages
AI Project File Cover Page
No ratings yet
AI Project File Cover Page
13 pages
Fe Formulation For Structural 1D Bar (Subjected To Axial Loading)
No ratings yet
Fe Formulation For Structural 1D Bar (Subjected To Axial Loading)
5 pages
Ocs351-Aimlf Univ Exam
No ratings yet
Ocs351-Aimlf Univ Exam
9 pages
Unifying Two Types of Scaling Laws From The Perspective of Conditional Kolmogorov Complexity
No ratings yet
Unifying Two Types of Scaling Laws From The Perspective of Conditional Kolmogorov Complexity
10 pages
Model Question Paper: SP 206 Wavelet Transforms: Theory and Applications
100% (1)
Model Question Paper: SP 206 Wavelet Transforms: Theory and Applications
2 pages
Multi-Class Classification Lecture
No ratings yet
Multi-Class Classification Lecture
19 pages
Automated Vehicle Number Plate Recognition
No ratings yet
Automated Vehicle Number Plate Recognition
9 pages
Numerical Evaluation of Dynamic Response
No ratings yet
Numerical Evaluation of Dynamic Response
20 pages
II Pu Cs Mid-Term Model QP 1 2025-26 Ud
No ratings yet
II Pu Cs Mid-Term Model QP 1 2025-26 Ud
2 pages
CS 341 Homework 2 Solutions
No ratings yet
CS 341 Homework 2 Solutions
8 pages
Ezaz Ahmed - C223009 - Lab4
No ratings yet
Ezaz Ahmed - C223009 - Lab4
33 pages
Student Solution Chap 08
No ratings yet
Student Solution Chap 08
6 pages
Decision Strategies & Salary Plans
No ratings yet
Decision Strategies & Salary Plans
4 pages

Linear Classification: The Perceptron

Uploaded by

Linear Classification: The Perceptron

Uploaded by

Linear Classification:

• Consider classification with +1, -1 labels ...

Based on slide by Piyush Rai 2

– If the prediction matches the label, make no change

Based on slide by Piyush Rai 6

Based on slide by Piyush Rai 7

Doesn’t produce a useful gradient

Based on slide by Alan Fern 8

– max(0, y (i) ✓ T x(i) ) is 0 if the prediction is correct

Based on slide by Alan Fern 9

1.) Let ✓ [0, 0, . . . , 0]

Online learning – the learning mode where the model update is

Batch learning – the learning mode where the model update is

Based on slide by Alan Fern 10

Red points are

Blue points are

See the perceptron in action: [Link]/watch?v=vGwemZhPlsA

• Simplest case: α = 1 and don’t normalize, yields the fixed

• Idea: Use a combination of multiple perceptrons

Based on slide by Piyush Rai 13

You might also like