0% found this document useful (0 votes)

286 views19 pages

Supervised Learning Basics

Supervised learning involves inferring a function from labeled training data to predict new examples. The training data consists of input-output pairs used by a supervised learning algorithm to learn and produce a classifier or regression function. Key steps include determining the input representation, selecting an algorithm, training on labeled data, and evaluating accuracy on test data not used for training. The complexity of the learned function must be balanced with training data size to avoid underfitting or overfitting.

Uploaded by

Sauki Ridwan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

286 views19 pages

Supervised Learning Basics

Uploaded by

Sauki Ridwan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 19

Machine Learning

Part 2
Supervised Learning

Supervised Learning

Supervised learningis themachine

learningtask of inferring a function
fromsupervised(labeled) training data.
(Wikipedia)
Thetraining dataconsist of a set oftraining
examples.
In supervised learning, each example is
apairconsisting of an input object (typically a
vector) and a desired output value.
A supervised learning algorithm analyzes the
training data and produces an inferred function,
which is called aclassifier(if the output is
discrete) or aregression function(if the output is

Learning a Class from Examples

Class C of a family car

Prediction: Is car x a family car?
Knowledge extraction: What do people expect from a family car?

Output:
Positive (+) and negative () examples

Input representation:
x : price, x : engine power
1
2

Training set X
X {xt ,rt }tN1

1 if x is positive
r
0 if x is negative

x1
x
x2

Class C
p1 price p2 AND e1 engine

power e2

Hypothesis class H
1 if h says x is positive
h( x)
0 if h says x is negative

Error of h on H

E(h| X ) 1 h xt rt
N

t 1

S, G, and the Version Space

most specific hypothesis, S
most general hypothesis, G
h H, between S and G is
consistent
and make up the
version space
(Mitchell, 1997)

Margin

Choose h with largest margin

Steps in Supervised
Learning

Determine the type of training examples. In the

case of handwriting analysis, for example, this
might be a single handwritten character, an
entire handwritten word, or an entire line of
handwriting.
Gather a training set. The training set needs to
be representative of the real-world use of the
function. Thus, a set of input objects is gathered
and corresponding outputs are also gathered,
either from human experts or from
measurements.

Steps in Supervised
Learning

Determine the input feature representation of the

learned function. The accuracy of the learned
function depends strongly on how the input
object is represented. Typically, the input object
is transformed into a feature vector, which
contains a number of features that are
descriptive of the object. The number of features
should not be too large, because of thecurse of
dimensionality; but should contain enough
information to accurately predict the output.
Determine the structure of the learned function
and corresponding learning algorithm. For
example, the engineer may choose to
usesupport vector machinesordecision trees.

Steps in Supervised
Learning

Complete the design. Run the learning algorithm

on the gathered training set. Some supervised
learning algorithms require the user to determine
certain control parameters. These parameters
may be adjusted by optimizing performance on a
subset (called avalidationset) of the training set,
or viacross-validation.
Evaluate the accuracy of the learned function.
After parameter adjustment and learning, the
performance of the resulting function should be
measured on a test set that is separate from the
training set.

Facts in Supervised
Learning

A wide range of supervised learning algorithms is

available, each with its strengths and
weaknesses.
There is no single learning algorithm that works
best on all supervised learning problems (No free
lunch theorem).

Noise and Model Complexity

Use the simpler one because

Simpler to use
(lower computational
complexity)

Easier to train (lower

space complexity)

Easier to explain
(more interpretable)

Generalizes better (lower

variance - Occams razor)

simpler explanations are more plausible and any unnecessary complexity should be shaved off.

Multiple Classes, Ci i=1,...,K

X {xt ,rt }tN1
t

1
if
x
Ci
t
ri
t
0
if
x
C j , j i

Train hypotheses
h (x), i =1,...,K:
i
t

1
if
x
Ci
t

hi x

t
0
if
x
C j , j i

Regression
X x ,r
t

t N
t 1

g x w1 x w0

g x w2 x2 w1 x w0

rt f xt
Emperical error

1 N t
t 2
E g| X r g x
N t 1

1 N t
2
t
E w1 ,w0 | X r w1 x w0
N t 1

Model Selection & Generalization

Learning is an ill-posed problem; data is

not sufficient to find a unique solution
Generalization: How well a model
performs on new data
Overfitting: H more complex than C or f
Underfitting: H less complex than C or f

Triple Trade-Of

There is a trade-of between three

factors (Dietterich, 2003):
1.
2.
3.

Complexity of H, c (H),
Training set size, N,
Generalization error, E, on new data

As N
E
As c (H)
first Eand then E

Cross-Validation

To estimate generalization error, we need

data unseen during training. We split the
data as

Training set (50%)

Validation set (25%)
Test (publication) set (25%)

Resampling when there is few data

Dimensions of a Supervised
Learner
1.

Model:

Loss function:

g x |

E | X L rt ,g xt |
t

Optimization procedure:

* argminE | X

Unsupervised Learning in Machine Learning
No ratings yet
Unsupervised Learning in Machine Learning
11 pages
Introduction to Machine Learning Basics
No ratings yet
Introduction to Machine Learning Basics
32 pages
Supervised Regression in Machine Learning
No ratings yet
Supervised Regression in Machine Learning
32 pages
Introduction to Machine Learning Basics
No ratings yet
Introduction to Machine Learning Basics
38 pages
Cheatsheet Recurrent Neural Networks
No ratings yet
Cheatsheet Recurrent Neural Networks
5 pages
Machine Learning: Presentation
100% (2)
Machine Learning: Presentation
23 pages
Understanding Logistic Regression Basics
100% (1)
Understanding Logistic Regression Basics
5 pages
ch6 Perceptron MLP PDF
No ratings yet
ch6 Perceptron MLP PDF
31 pages
01 - ML Introduction - Course Outline
No ratings yet
01 - ML Introduction - Course Outline
21 pages
What Is Naive Bayes Algorithm?
No ratings yet
What Is Naive Bayes Algorithm?
18 pages
Regression Notes
100% (1)
Regression Notes
20 pages
Total Listing Machine Learning
100% (1)
Total Listing Machine Learning
114 pages
Overview of Machine Learning PDF
100% (1)
Overview of Machine Learning PDF
57 pages
Linear Models & SVM in Machine Learning
100% (1)
Linear Models & SVM in Machine Learning
23 pages
Assignment # 01 Bscs - 7 Semester: Machine Learning
100% (1)
Assignment # 01 Bscs - 7 Semester: Machine Learning
5 pages
ML Unit 2
No ratings yet
ML Unit 2
25 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
47 pages
Stats & ML Model Comparisons
100% (1)
Stats & ML Model Comparisons
72 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
3 pages
Time - Series Machine Learning
No ratings yet
Time - Series Machine Learning
132 pages
Machine Learning Cheat Sheet
No ratings yet
Machine Learning Cheat Sheet
1 page
71A Machine Learning
No ratings yet
71A Machine Learning
8 pages
AD8552 UNIT 1 Machine Learning
No ratings yet
AD8552 UNIT 1 Machine Learning
17 pages
Statistical Learning Methods
No ratings yet
Statistical Learning Methods
28 pages
AI Neural Network Training Methods
No ratings yet
AI Neural Network Training Methods
12 pages
Supervised Vs Unsupervised Learning
No ratings yet
Supervised Vs Unsupervised Learning
4 pages
Heuristic Search Techniques
No ratings yet
Heuristic Search Techniques
54 pages
Overview of Multilayer Perceptrons
No ratings yet
Overview of Multilayer Perceptrons
24 pages
The Multilayer Perceptron
No ratings yet
The Multilayer Perceptron
11 pages
LSTM for Touchpoint Prediction
100% (1)
LSTM for Touchpoint Prediction
73 pages
Feature Selection in Python ML
No ratings yet
Feature Selection in Python ML
7 pages
Machine Learning in Mechanical Engineering
No ratings yet
Machine Learning in Mechanical Engineering
20 pages
Algorithms: K Nearest Neighbors
No ratings yet
Algorithms: K Nearest Neighbors
16 pages
Artificial Neural Networks Explained
No ratings yet
Artificial Neural Networks Explained
54 pages
Numpy Basics: Array Creation & Functions
No ratings yet
Numpy Basics: Array Creation & Functions
50 pages
Introduction
No ratings yet
Introduction
6 pages
Unsupervised Learning Notes
No ratings yet
Unsupervised Learning Notes
21 pages
Machine Learning Workshop Overview
0% (1)
Machine Learning Workshop Overview
3 pages
Machine Learning Guide Line
No ratings yet
Machine Learning Guide Line
10 pages
A Course in Machine Learning
No ratings yet
A Course in Machine Learning
189 pages
Decision Tree Classification on Iris Dataset
No ratings yet
Decision Tree Classification on Iris Dataset
6 pages
Deep Learning LectureCNN
No ratings yet
Deep Learning LectureCNN
28 pages
Machine Learning
100% (2)
Machine Learning
211 pages
Topic 5 - Part1 Multilayer Perceptron
No ratings yet
Topic 5 - Part1 Multilayer Perceptron
28 pages
Machine Learning for Tech Enthusiasts
No ratings yet
Machine Learning for Tech Enthusiasts
12 pages
Classification and Regression Trees (CART - I) : Dr. A. Ramesh
No ratings yet
Classification and Regression Trees (CART - I) : Dr. A. Ramesh
34 pages
NNpred
100% (2)
NNpred
74 pages
Machine Learning Revision Notes
No ratings yet
Machine Learning Revision Notes
6 pages
SimPy Basics for New Users
No ratings yet
SimPy Basics for New Users
15 pages
AI Unit 4 - Artificial Neural Network by Kulbhushan (Krazy Kaksha & KK World)
No ratings yet
AI Unit 4 - Artificial Neural Network by Kulbhushan (Krazy Kaksha & KK World)
5 pages
Simple Linear and Logistic Regression
No ratings yet
Simple Linear and Logistic Regression
81 pages
Text Classification and Rocchio Algorithm
No ratings yet
Text Classification and Rocchio Algorithm
32 pages
Machine Learning Techniques - Types of Machine Learning - Applications Mathematical Foundations of Machine Learning
No ratings yet
Machine Learning Techniques - Types of Machine Learning - Applications Mathematical Foundations of Machine Learning
15 pages
ML Unit-1
No ratings yet
ML Unit-1
43 pages
Unit 2
No ratings yet
Unit 2
63 pages
Unit II
No ratings yet
Unit II
25 pages
First Step in Supervised Learning
No ratings yet
First Step in Supervised Learning
10 pages
ML 2
No ratings yet
ML 2
166 pages
Supervised ML
No ratings yet
Supervised ML
9 pages
Java Crash Cource Book
No ratings yet
Java Crash Cource Book
17 pages
How To Setup and Run RSLogix Emulation (SLC500&Micrologix 1000)
No ratings yet
How To Setup and Run RSLogix Emulation (SLC500&Micrologix 1000)
4 pages
NFC Based Intelligent Bus Ticketing Syst
No ratings yet
NFC Based Intelligent Bus Ticketing Syst
5 pages
ICS 3202 - Artificial Intelligence - December 2022
No ratings yet
ICS 3202 - Artificial Intelligence - December 2022
5 pages
AP Computer Science A 2020 Practice Exam FRQ Scoring Guidelines
No ratings yet
AP Computer Science A 2020 Practice Exam FRQ Scoring Guidelines
9 pages
Advanced Digital Systems Design (2014)
No ratings yet
Advanced Digital Systems Design (2014)
3 pages
Ict Marking Guide For s1 Paper 1
No ratings yet
Ict Marking Guide For s1 Paper 1
3 pages
Experiment 1: Aim - : Introduction of Java. Theory - : What Is Java?
No ratings yet
Experiment 1: Aim - : Introduction of Java. Theory - : What Is Java?
16 pages
DIY Raspberry Pi Roomba Hack
No ratings yet
DIY Raspberry Pi Roomba Hack
15 pages
Arduino Sound Experiment Pre-Lab Guide
No ratings yet
Arduino Sound Experiment Pre-Lab Guide
2 pages
Chương 3+4 Datalink + MAC
No ratings yet
Chương 3+4 Datalink + MAC
9 pages
PriorityManagement Support JTI
No ratings yet
PriorityManagement Support JTI
28 pages
Udp Hole Punching
No ratings yet
Udp Hole Punching
31 pages
AWS CloudFormation Course
No ratings yet
AWS CloudFormation Course
191 pages
Debugging PCSX2 Crash Logs
No ratings yet
Debugging PCSX2 Crash Logs
6 pages
5G SA Architecture
No ratings yet
5G SA Architecture
5 pages
Snake Game: A B.Tech Project
No ratings yet
Snake Game: A B.Tech Project
25 pages
2020 Network Diagnostics
No ratings yet
2020 Network Diagnostics
1 page
Introduction to Computer Graphics
No ratings yet
Introduction to Computer Graphics
32 pages
infoPLC Net Finding Out The IP Address of A Lenze Controller
No ratings yet
infoPLC Net Finding Out The IP Address of A Lenze Controller
6 pages
Employee Anti-Phishing Guide
100% (1)
Employee Anti-Phishing Guide
7 pages
Computer Science Exam: C++ & Python Sections
No ratings yet
Computer Science Exam: C++ & Python Sections
15 pages
2018 Sourthern Province Grade 12 First Term Paper
No ratings yet
2018 Sourthern Province Grade 12 First Term Paper
5 pages
8051 Microcontroller Seminar
No ratings yet
8051 Microcontroller Seminar
21 pages
Threads Deitel
No ratings yet
Threads Deitel
66 pages
NOVA L 2.4 Remote Control Overview
No ratings yet
NOVA L 2.4 Remote Control Overview
2 pages
CAT Grade 11 Revisionpackage Term 2 2023
No ratings yet
CAT Grade 11 Revisionpackage Term 2 2023
16 pages
Sheet 3 (Control Statments
No ratings yet
Sheet 3 (Control Statments
3 pages
Well Cap
No ratings yet
Well Cap
1 page
DSD Project Titles
No ratings yet
DSD Project Titles
21 pages

Supervised Learning Basics

Uploaded by

Supervised Learning Basics

Uploaded by

Machine Learning

Supervised learningis themachine

Learning a Class from Examples

Class C of a family car

S, G, and the Version Space

Choose h with largest margin

Determine the type of training examples. In the

Determine the input feature representation of the

Complete the design. Run the learning algorithm

A wide range of supervised learning algorithms is

Noise and Model Complexity

Easier to train (lower

Generalizes better (lower

Multiple Classes, Ci i=1,...,K

Model Selection & Generalization

Learning is an ill-posed problem; data is

There is a trade-of between three

To estimate generalization error, we need

Training set (50%)

Resampling when there is few data

You might also like