0% found this document useful (0 votes)

144 views51 pages

EN3150 Pattern Recognition - L02

Uploaded by

mahamalagephysics

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

144 views51 pages

EN3150 Pattern Recognition - L02

Uploaded by

mahamalagephysics

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 51

EN3150 Pattern Recognition

Learning from data and related challenges

M. T. U. Sampath K. Perera,
Department of Electronic and Telecommunication Engineering,
University of Moratuwa.
([email protected]).
Semester 5 – Batch 21.
What is learning ?
“A computer program is said to learn from experience E with respect to some class
of tasks T and performance measure P, if its performance at tasks in T, as measured
by P, improves with experience E “[1]

experience E performance measure P

tasks T

performance at tasks in T, as measured by P, improves with experience E

[1] Mitchell, Tom M. "Machine learning." (1997).

What is learning ?
performance at tasks
in T, as measured by P,
improves with
experience E

tasks T

performance measure P

experience E

Sample of the MNIST dataset of handwritten digits

(https://en.wikipedia.org/wiki/MNIST_database)
What is learning ?
performance at tasks in T, as measured by P, improves with experience E

tasks T

performance measure P

experience E

Sample of the MNIST dataset of handwritten digits

(https://en.wikipedia.org/wiki/MNIST_database)
Learning from data
➢There are different types of learning from data:

data ➢ Supervised
preparation
➢ Unsupervised
Model
training ➢ Semi-supervised

➢ Self-supervised
Model
Evaluation
➢ Reinforcement learning
Learning from data: Supervised learning
➢Supervised learning:
o The algorithm learns from labeled training data to make predictions or decisions.

➢ Labeled training data?

o The training data consists of input examples (also called features) along with their
corresponding output labels (also called targets or ground truth).

➢The goal of supervised learning is to learn a mapping function that can predict the correct
output label for new, unseen input examples.

Zero Five
ML Model
Learning from data: Supervised learning
➢ Labeled training data
➢ Handwritten digit - MNIST dataset

Zero Five ML Model

MNIST dataset
28x28 pixel images of handwritten digits (0 to 9) along with their 28
corresponding labels.
28
0 0 1 ... 781 782 783 label
0 0.0 0.0 0.0 0.0 0.0 5
1 0.0 0.0 0.0 0.0 0.0 0
2 0.0 0.0 0.0 0.0 0.0 4
3 0.0 0.0 0.0 0.0 0.0 1
4 0.0 0.0 0.0 0.0 0.0 9
Learning from data: Unsupervised learning
➢ Unsupervised learning involves training an algorithm on unlabeled data without explicit output
labels
➢The algorithm's objective is to find patterns, structures, or relationships within the data.
Learning from data: Semi-Supervised Learning

➢Semi-supervised learning: supervised + unsupervised learning.

➢Training data contains a mixture of labeled and unlabeled examples.

Learning from data: Self-Supervised Learning

➢Model is trained to generate its own data.

➢E.g., Generative Adversarial Networks (GANs), Variational Autoencoders(VAEs)
can be used to generate data
➢Contrastive Learning: model is trained to discriminate between positive pairs
(similar samples) and negative pairs (dissimilar samples)

A. Creswell, T. White, V. Dumoulin, K. Arulkumaran, B. Sengupta and A. A. Bharath, "Generative Adversarial Networks: An Overview," in IEEE Signal Processing Magazine, vol.
35, no. 1, pp. 53-65, Jan. 2018, doi: 10.1109/MSP.2017.2765202.
Learning from data: Reinforcement Learning
➢An agent learns to make decisions by interacting with an environment.
➢The agent receives feedback in the form of rewards or penalties based on its
actions.
➢The goal of the agent is to learn an optimal policy that maximizes the
cumulative reward over time.

action action

rewards or penalties rewards or penalties

Supervised vs Unsupervised learning
Supervised Unsupervised
Uses labeled input and output data Labels are not available
Well-defined objective (As labels are given, so you May discover hidden relationships
know possible type of results to expect)
Can be used to learn meaningful representations or
features from raw data
human intervention is required to label data

If you have labeled data and a clear target variable If you have large amounts of unlabeled data and
to predict, use supervised learning for accurate want to find patterns or groupings in the data, opt
predictions for unsupervised learning
If you have a mix of labeled and unlabeled data, or the cost of labeling data is high, consider using semi-
supervised learning to leverage both types of data
Learning from data and
related challenges
➢ Data Quality and Quantity
o Noisy, incomplete data can lead to inaccurate and
unreliable predictions.
90 o Often requires large amount of data
%
➢ Data Imbalance:
o E.g, in classification imbalance classes
10
% o May lead to poor performance
Learning from data and related challenges
➢Overfitting: Overfitting occurs when a model performs
exceptionally well on the training data but fails to generalize to
new, unseen data.
➢ Underfitting: When a model is too simplistic to capture the
underlying patterns in the data.
Underfitting
➢ Generalization: Ensuring that machine learning models
generalize well to new, unseen data.

Overfitting
Data preparation
➢ Data cleaning: Handle missing or inconsistent data
o Approaches:
inliers
o Removing them
outliers
o Filling with zeros/mean/median,
o Interpolation
➢Data cleaning: outlier* detection and removing
inliers
➢Data Preprocessing: Feature scaling (E.g. normalization)
➢Data Preprocessing: Dimensionality reduction
➢Principal Component Analysis (PCA)

*An outlier in a dataset refers to a data point that deviates significantly from the majority of the other data points.
Data preparation
➢Data Augmentation: Artificially expand the size and diversity of a given dataset.
oE.g Image rotation, flipping, scaling, cropping ➔ New image

➢Imbalanced Data:
o Under sampling of majority class
o Generating synthetic samples of the minority class
Data preprocessing example
➢ https://scikit-learn.org/stable/modules/preprocessing.html
1. Standardization: scale the features of a dataset to have zero mean and unit
variance.
2. Scaling features to a range e.g., between 0 and 1
➢ Min max scalar ➔ [0, 1]
➢ Max Abs Scaler ➔ [-1, 1]

If outliers are there, will it work?

Suggestions?
Data preprocessing example
California Housing Dataset
Independent variables
import pandas as pd
1. MedInc Median income in block group (measured in tens of
from sklearn.datasets import fetch_california_housing thousands of US Dollars)
2. HouseAge Median house age in block group (a lower number is a
Use pandas df.describe() to get followings newer building)
3. AveRooms Average number of rooms per household
MedInc AveOccup 4. AveBedrms Average number of bedrooms per household
count 20640 20640 5. Population Block group population
mean 3.870671 3.070655 6. AveOccup Average number of household members
std 1.899822 10.386050 7. Latitude Block group latitude (a higher value is farther north)
min 0.499900 0.692308 8. Longitude Block group longitude (a higher value is farther west)
25% 2.563400 2.429741
50% 3.534800 2.818116 Dependent variable
75% 4.743250 3.282261 1. medianHouseValue Median house value for households within a
max 15.000100 1243.333333 block (measured in US Dollars)
Data preprocessing example: Min-max Scaler
Data preprocessing example: Min-max Scaler

Min-max Scaler

Average Occupancy: Inliers in narrow range [0, 0.005]

Features scales differently

Balanced feature scales cannot be guaranteed with outliers

Data preprocessing example: Standard Scaler
Data preprocessing example: Standard Scaler

Standard Scaler
Average Occupancy:[-0.2, 0.2]
Median Income [-2, 4]

Features scales differently

Balanced feature scales cannot be guaranteed with outliers

Data preprocessing example: Robust Scalar
Data preprocessing example: Robust Scalar
Robust Scaler
Most of data points in both features in range of [-2, 3]

Features scales similarly with outliers as well

Still outliers are there

sklearn.preprocessing.RobustScaler — scikit-learn 1.3.0

documentation
Data preprocessing example: Robust Scalar

Standard Scaler Robust Scaler

Min-max Scaler
Average Occupancy:[-0.2, 0.2] Most of data points in both features in range of [-2, 3]
Average Occupancy: Inliers in narrow range [0, 0.005]
Median Income [-2, 4]
Data preprocessing example: Quantiles
information
➢ Non-linear transformation

➢ spreads out the most frequent

values in each feature, aiming to
follow a uniform or normal
distribution.

➢ It reduces the impact of outliers,

making it a robust preprocessing
technique.

➢ The transformation is applied

independently to each feature.

➢ It estimates the cumulative

distribution function of a feature
to map original values to a
sklearn.preprocessing.Quantile uniform/normal distribution.
Transformer — scikit-learn
1.3.0 documentation ➢ The obtained values are then
mapped to the desired output
distribution using the associated
quantile function.
Homework

➢Task: Comparing Data Normalization Methods

(See course page in Moodle)
ML Training Process (Supervised
Learning)
Model Training

Training set
data data Performance evaluation
Model Validation
preparation splitting
• Accuracy
• Precision
Validation set • Recall
• F1-Score
Model Testing • Mean Absolute Error (MAE)
• Mean Squared Error (MSE)
sklearn.model_selection.train_te • Root Mean Squared Error (RMSE)
Testing set
st_split — scikit-learn 1.3.0
documentation
ML Training Process (Supervised
Learning)
Model Training Loss Functions
Used to see how different the guesses
made by a machine learning model are
Training set from the actual correct answers
data data
preparation splitting Model Validation
➢ Mean Squared Error (MSE)
Validation set ➢ Mean Absolute Error (MAE)
➢ Binary Cross-Entropy (Log Loss)
Model Testing ➢ Categorical Cross-Entropy
➢ Sparse Categorical Cross-Entropy
➢ Hinge Loss
Testing set ➢ Kullback-Leibler Divergence (KL Divergence)
➢ Huber Loss
➢ Triplet Loss
➢ Ranking Losses (e.g., Hinge Rank Loss, RankNet Loss))
How to evaluate a model
➢ Accuracy, Precision, Recall, and F-Score
TP + 𝑇𝑁
Accuracy =
Predicted class
TP + TN + FP + 𝐹𝑁
Positive (+) Negative (-) Total
TP
Precision =
Positive (+) False Neg. (FN) P TP + 𝐹𝑃
True Pos. (TP)
True
class
Negative (-) N TP
False Pos. (FP) True Neg. (TN) Recall =
TP + FN
Total P* N*

2
𝐹1 =
type I error, false alarm type II error, miss 1
+
1
Precision Recall
Higher F1-score values indicate a better
Accuracy can be a misleading metric for imbalanced data sets. balance between precision and recall
How to evaluate a model
➢ Accuracy, Precision, Recall, and F-Score
Predicted class
TP
Precision =
TP + 𝐹𝑃
Positive (+) Negative (-) Total

TP
Positive (+) True Pos. (TP) False Neg. (FN) P Recall =
True TP + FN
class
Negative (-) False Pos. (FP) True Neg. (TN) N

Total P* N*

➢ A higher precision indicates a lower rate of false positives, which means

the model is making fewer incorrect positive predictions.
➢ A higher recall indicates a lower rate of false negatives, meaning the
model is correctly identifying more positive instances

https://en.wikipedia.org/wiki/Precision_and_recall
Model selection
➢ Model selection is the process of choosing the
best model from a set of candidate models for a
specific task.
Sample of the MNIST dataset of handwritten digits
(https://en.wikipedia.org/wiki/MNIST_database)

ML Model

k-Nearest
Convolutional Neural Decision Trees Neighbors (k-NN)
Networks (CNNs)
Model
selection
➢ Hyperparameters (parameters that are
set before the training process begins)
➢ E.g., Learning Rate

➢Hyperparameters not learned from data

Model selection
➢ Model selection is the process of choosing the
best model from a set of candidate models for a
specific task.

Model Model Parameter

Training (hyperparameters)
Tunning
Training set hyperparameters
Model/Models not learned from
data)
Model Re-
training

Testing set Model Select the best model

Evaluation e.g., using performance
metrics like accuracy, F1-score
Model selection
Resampling technique:- k-fold
cross validation (k-CV)

•The dataset is divided into k

subsets (folds) of approximately
equal size.
•The model is trained and
evaluated k times, each time
using a different fold as the test
set and the rest as the training
set.
•For each iteration, the model is
trained on (k-1) folds and
evaluated on the remaining fold.

Image from scikit-learn.org

Model selection:
Resampling technique:- k-fold cross validation (k-CV)
➢ Reduced Overfitting: Mitigates overfitting by testing the model on
unseen data subsets. Ensures better generalization to new data.
➢ Evaluates model performance for various hyperparameter
settings.
➢ Helps to identify the best hyperparameters for optimal model
performance
➢ Allows fair and consistent evaluation of multiple models
➢ Maximizes data utilization for both training and testing
➢ Ensures all available data contributes to model evaluation
Model selection:- Hyper parameter tuning-Grid Search

➢ Grid Search involves defining a grid of hyperparameter values to explore.

➢It systematically evaluates all possible combinations from the grid to identify the
best-performing one
➢To avoid overfitting during Grid Search, cross-validation is commonly used.
➢Grid Search can be computationally expensive when the hyperparameter space
is large.

Image from scikit-learn.org

Model Selection: Probabilistic

➢ Statistical modeling is used to choose the most appropriate model among a set of candidate
models.

➢Model comparison

➢ Akaike Information Criterion (AIC).

➢ Bayesian Information Criterion (BIC). “Information theory perspective”
➢ Minimum Description Length (MDL).
Model Selection: Probabilistic

➢ Akaike Information Criterion (AIC).

Number of adjustable parameters of the model

Best-fit log likelihood

✓ Select the model with the largest value

✓ Both model complexity and model performance is considered
➢Bayesian Information Criterion (BIC)- variation of AIC

✓ Generally, more penalize on model complexity than AIC ➔ more complex models less
like to select
Bias-variance trade-off.
➢ Given a dataset with samples denoted as

➢ A model that maps input features to predicted output

Mean Squared Error (MSE)

𝑛
1
MSE = ෍( 𝑦true,𝑖 −𝑦pred,𝑖 )2
𝑛
𝑖=1

➢ Learned model
Bias-variance trade-off.
➢ Given a dataset with samples denoted as
𝑛
1 𝑦𝑖 = 𝑦true,𝑖 = 𝑓 𝑥𝑖 + 𝑒
Mean Squared Error (MSE) MSE = ෍( 𝑦true,𝑖 −𝑦pred,𝑖 )2
𝑛
𝑖=1
𝑦pred,𝑖 = 𝑓መ 𝑥𝑖
2
MSE = 𝔼 𝑦true −𝑦pred
2 2
=𝔼 𝑓 𝑥 + 𝑒 − 𝑓መ 𝑥 = 𝔼 𝑓 𝑥 − 𝑓መ 𝑥 +𝑒
2
=𝔼 𝑓 𝑥 − 𝑓መ 𝑥 + 𝔼 (𝑒)2 + 2𝔼 𝑓 𝑥 − 𝑓መ 𝑥 (e)
Assuming e and (𝑓 𝑥 − 𝑓መ 𝑥 ) are
2
=𝔼 𝑓 𝑥 − 𝑓መ 𝑥 + 𝔼 (𝑒)2 + 2 𝔼 𝑓 𝑥 − 𝑓መ 𝑥 2𝔼 e independent
2
=𝔼 𝑓 𝑥 − 𝑓መ 𝑥 + 𝜎𝑒2 𝜎𝑒2 = 𝔼[(𝑒)2 ] - (𝔼[𝑒])2 and assuming 𝔼 e = 0
Bias-variance trade-off.
2 2
𝔼 𝑓 𝑥 − 𝑓መ 𝑥 =𝔼 𝑓 𝑥 − 𝔼 𝑓መ 𝑥 + 𝔼 𝑓መ 𝑥 − 𝑓መ 𝑥

2 2
= 𝔼 𝑓 𝑥 − 𝔼 𝑓መ 𝑥 +𝔼 𝔼 𝑓መ 𝑥 − 𝑓መ 𝑥 + 2𝔼 𝑓 𝑥 − 𝔼 𝑓መ 𝑥 𝔼 𝑓መ 𝑥 − 𝑓መ 𝑥

2𝔼 𝑓 𝑥 − 𝔼 𝑓መ 𝑥 𝔼 𝑓መ 𝑥 − 𝑓መ 𝑥 = 2 𝑓 𝑥 − 𝔼 𝑓መ 𝑥 𝔼 𝔼 𝑓መ 𝑥 − 𝑓መ 𝑥 =0

2 2 2
𝔼 𝑓 𝑥 − 𝑓መ 𝑥 = 𝔼 𝑓 𝑥 − 𝔼 𝑓መ 𝑥 +𝔼 𝔼 𝑓መ 𝑥 − 𝑓መ 𝑥

2
MSE = 𝔼 𝑓 𝑥 − 𝑓መ 𝑥 + 𝜎𝑒2
2 2
= 𝔼 𝑓 𝑥 − 𝔼 𝑓መ 𝑥 +𝔼 𝔼 𝑓መ 𝑥 − 𝑓መ 𝑥 +𝜎𝑒2
Bias-variance trade-off

2 2
= 𝔼 𝑓 𝑥 − 𝔼 𝑓መ 𝑥 +𝔼 𝔼 𝑓መ 𝑥 − 𝑓መ 𝑥 + 𝜎𝑒2

2 2
=𝔼 𝔼 𝑓መ 𝑥 −𝑓 𝑥 + 𝔼 𝑓መ 𝑥 − 𝔼 𝑓መ 𝑥 + 𝜎𝑒2

2 2
= 𝔼 𝑓መ 𝑥 −𝑓 𝑥 + 𝔼 𝑓መ 𝑥 − 𝔼 𝑓መ 𝑥 + 𝜎𝑒2

2
= bias + variance + Irreducible error

cannot be reduced by any model

Bias-variance trade-off
➢High bias: Inability to capture the true relationship between input data and
output
Marks

Hours of study (per week)

High bias Low bias
Fits well to training data (in this
Training samples Example)
Bias-variance trade-off
➢High variance: Inability to fit different data sets
Low variance means that the
Marks estimator does not change much
when different training datasets
are used

Fits well to testing data (in this

Example)

Hours of study (per week)

Low variance High variance

Testing samples
Bias-variance trade-off

➢ Too simple model

will have high bias but low variance

Overfitting ➢ A highly complex model

Underfitting
will have low bias but high variance

Low high

Image from https://en.wikipedia.org/wiki/Bias%E2%80%93variance_tradeoff

Bias-variance trade-off

➢ Too simple model

will have high bias but low variance

➢ A highly complex model

will have low bias but high variance

Underfitting Overfitting

high bias but low variance low bias but high variance
Bias-variance trade-off
Underfitting Overfitting
Variance low Variance high
Bias high Bias low

Testing sample
Error

Training sample
Model complexity
Low high
Bias-variance trade-off

Low bias and low variance

Bias-variance trade-off
low variance High variance

Low bias

High bias
Thank You
Q&A

Machine Learning Introduction
No ratings yet
Machine Learning Introduction
17 pages
Workflow of A Machine Learning Project
No ratings yet
Workflow of A Machine Learning Project
12 pages
None
No ratings yet
None
16 pages
Comp Vis Week 2
No ratings yet
Comp Vis Week 2
16 pages
AAI Lecture 9 SP 25
No ratings yet
AAI Lecture 9 SP 25
26 pages
LM #02-ML Concepts & Frameworks
No ratings yet
LM #02-ML Concepts & Frameworks
31 pages
GML Slides 2024 04 29
No ratings yet
GML Slides 2024 04 29
206 pages
Intro ML
No ratings yet
Intro ML
35 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
316 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
47 pages
ML Intro
No ratings yet
ML Intro
28 pages
Classification vs Regression in ML
No ratings yet
Classification vs Regression in ML
15 pages
Week 09 Lesson 1 Intro Machine Learning 1 To 32
No ratings yet
Week 09 Lesson 1 Intro Machine Learning 1 To 32
61 pages
Types of Machine Learning Explained
No ratings yet
Types of Machine Learning Explained
27 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
4 pages
AI Chapter 5
No ratings yet
AI Chapter 5
31 pages
Lecture 2
No ratings yet
Lecture 2
22 pages
Unit 2
No ratings yet
Unit 2
63 pages
ML Quation Bank
No ratings yet
ML Quation Bank
50 pages
Classification
No ratings yet
Classification
53 pages
5 Le
100% (1)
5 Le
36 pages
AI Lab6
No ratings yet
AI Lab6
7 pages
2-Capacity, Underfitting, overfitting-15-Jul-2020Material - I - 15-Jul-2020 - ML - Fundamentals
No ratings yet
2-Capacity, Underfitting, overfitting-15-Jul-2020Material - I - 15-Jul-2020 - ML - Fundamentals
35 pages
NEP Syllabus Questions
No ratings yet
NEP Syllabus Questions
3 pages
5.1 Large Scale ML
No ratings yet
5.1 Large Scale ML
10 pages
Introduction to Machine Learning Basics
No ratings yet
Introduction to Machine Learning Basics
41 pages
Ocs353 DSF Unit III Notes
No ratings yet
Ocs353 DSF Unit III Notes
11 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
15 pages
Machine Learning Overview and Techniques
No ratings yet
Machine Learning Overview and Techniques
62 pages
Machine Learning - 1
No ratings yet
Machine Learning - 1
52 pages
MLES
No ratings yet
MLES
30 pages
Introduction To ML
No ratings yet
Introduction To ML
55 pages
Module 1
No ratings yet
Module 1
50 pages
Machine Learning for Beginners
No ratings yet
Machine Learning for Beginners
27 pages
Unit 4 - Question Bank and Answers
No ratings yet
Unit 4 - Question Bank and Answers
23 pages
ML Q
No ratings yet
ML Q
40 pages
Statistic Inference Unit 2 Notes
No ratings yet
Statistic Inference Unit 2 Notes
34 pages
Machine Learning (Autosaved)
No ratings yet
Machine Learning (Autosaved)
13 pages
Lecture Notes 1 2 Intro Python
No ratings yet
Lecture Notes 1 2 Intro Python
13 pages
Supervised Unsupervised Reinforcement
No ratings yet
Supervised Unsupervised Reinforcement
39 pages
ML - 01 Supervised Learning
No ratings yet
ML - 01 Supervised Learning
26 pages
Unit I 2 Mark Answers ML
No ratings yet
Unit I 2 Mark Answers ML
3 pages
Lecture 5
No ratings yet
Lecture 5
26 pages
Supervised ML with Flask & Docker
No ratings yet
Supervised ML with Flask & Docker
30 pages
Machine Learning Models: by Mayuri Bhandari
No ratings yet
Machine Learning Models: by Mayuri Bhandari
48 pages
Lec-7 Intro Machine Learning
No ratings yet
Lec-7 Intro Machine Learning
87 pages
Machine Learning Essentials
No ratings yet
Machine Learning Essentials
86 pages
ML 1 2 3
No ratings yet
ML 1 2 3
54 pages
Classification of Machine Learning
No ratings yet
Classification of Machine Learning
73 pages
Lecture 1
No ratings yet
Lecture 1
30 pages
Machine Learning Notes
83% (12)
Machine Learning Notes
19 pages
AI Data Processing Guide
No ratings yet
AI Data Processing Guide
5 pages
Unit 5 Intro To Machine Learning
No ratings yet
Unit 5 Intro To Machine Learning
25 pages
DUnit I
No ratings yet
DUnit I
25 pages
ML Final Print Upload
No ratings yet
ML Final Print Upload
10 pages
OR Forecasting Tool
No ratings yet
OR Forecasting Tool
39 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
14 pages
Unit 1 Data Mining Task
No ratings yet
Unit 1 Data Mining Task
7 pages
Bayesian Games: 1: Definition and Equilibrium
No ratings yet
Bayesian Games: 1: Definition and Equilibrium
20 pages
Federated Learning Challenges Methods and Future Directions
No ratings yet
Federated Learning Challenges Methods and Future Directions
11 pages
03 Security User Authentication
No ratings yet
03 Security User Authentication
56 pages
Lecture14 CE72.12Elasticity and 2D FEM Weak Formulation
No ratings yet
Lecture14 CE72.12Elasticity and 2D FEM Weak Formulation
20 pages
AP Calculus AB 2024 Cheat Sheet
No ratings yet
AP Calculus AB 2024 Cheat Sheet
2 pages
Control Systems 1 Block Diagram Reduction Part 3
No ratings yet
Control Systems 1 Block Diagram Reduction Part 3
8 pages
Big M Method in Linear Programming
No ratings yet
Big M Method in Linear Programming
28 pages
PQXDH
No ratings yet
PQXDH
16 pages
Cryptography and Network Security Guide
No ratings yet
Cryptography and Network Security Guide
8 pages
Graphs & Algorithms for CS Students
No ratings yet
Graphs & Algorithms for CS Students
18 pages
Axiomatic Design for Engineers
No ratings yet
Axiomatic Design for Engineers
1 page
SIMULATION
No ratings yet
SIMULATION
13 pages
Safari 6
No ratings yet
Safari 6
4 pages
Ine 425-S01 - 02024 - en 2
No ratings yet
Ine 425-S01 - 02024 - en 2
3 pages
KMU 340: Basic Mathematical Modelling
No ratings yet
KMU 340: Basic Mathematical Modelling
3 pages
NLP PG Syllabus 2023
No ratings yet
NLP PG Syllabus 2023
3 pages
Zeroth and Third Law of Thermodynamics
No ratings yet
Zeroth and Third Law of Thermodynamics
18 pages
Advanced Data Structures and Algorithms Notes
No ratings yet
Advanced Data Structures and Algorithms Notes
41 pages
Data Structures Lesson Plan 2013-14
No ratings yet
Data Structures Lesson Plan 2013-14
9 pages
This Study Resource Was: Problem # 1
No ratings yet
This Study Resource Was: Problem # 1
7 pages
Fuzzy Systems in Bio-Inspired Computing - State-of-the-Art Literature Review
No ratings yet
Fuzzy Systems in Bio-Inspired Computing - State-of-the-Art Literature Review
1 page
App e
No ratings yet
App e
14 pages
CHEN20051 Chemical Eng. Optimisation Exam Paper - Final
No ratings yet
CHEN20051 Chemical Eng. Optimisation Exam Paper - Final
5 pages
Nessus Vulnerability Scan Summary
No ratings yet
Nessus Vulnerability Scan Summary
6 pages
Iterative Methods For Solving Ax B - Jacobi's Method
No ratings yet
Iterative Methods For Solving Ax B - Jacobi's Method
4 pages
Icra 2018 8462878
No ratings yet
Icra 2018 8462878
8 pages
【TPAMI 综述】时间序列分析的自我监督学习
No ratings yet
【TPAMI 综述】时间序列分析的自我监督学习
20 pages
Alice vs Bob: Stone Game Strategy
No ratings yet
Alice vs Bob: Stone Game Strategy
12 pages
Digital Signal Processing in Telecommunication
No ratings yet
Digital Signal Processing in Telecommunication
874 pages

EN3150 Pattern Recognition - L02

Uploaded by

EN3150 Pattern Recognition - L02

Uploaded by

EN3150 Pattern Recognition

Learning from data and related challenges

experience E performance measure P

performance at tasks in T, as measured by P, improves with experience E

[1] Mitchell, Tom M. "Machine learning." (1997).

Sample of the MNIST dataset of handwritten digits

Sample of the MNIST dataset of handwritten digits

➢ Labeled training data?

Zero Five ML Model

➢Semi-supervised learning: supervised + unsupervised learning.

➢Training data contains a mixture of labeled and unlabeled examples.

➢Model is trained to generate its own data.

rewards or penalties rewards or penalties

If outliers are there, will it work?

Average Occupancy: Inliers in narrow range [0, 0.005]

Features scales differently

Balanced feature scales cannot be guaranteed with outliers

Features scales differently

Balanced feature scales cannot be guaranteed with outliers

Features scales similarly with outliers as well

Still outliers are there

sklearn.preprocessing.RobustScaler — scikit-learn 1.3.0

Standard Scaler Robust Scaler

➢ spreads out the most frequent

➢ It reduces the impact of outliers,

➢ The transformation is applied

➢ It estimates the cumulative

➢Task: Comparing Data Normalization Methods

➢ A higher precision indicates a lower rate of false positives, which means

➢Hyperparameters not learned from data

Model Model Parameter

Testing set Model Select the best model

•The dataset is divided into k

Image from scikit-learn.org

➢ Grid Search involves defining a grid of hyperparameter values to explore.

Image from scikit-learn.org

➢ Akaike Information Criterion (AIC).

➢ Akaike Information Criterion (AIC).

Number of adjustable parameters of the model

✓ Select the model with the largest value

➢ A model that maps input features to predicted output

Mean Squared Error (MSE)

cannot be reduced by any model

Hours of study (per week)

Fits well to testing data (in this

Hours of study (per week)

➢ Too simple model

will have high bias but low variance

Overfitting ➢ A highly complex model

Image from https://en.wikipedia.org/wiki/Bias%E2%80%93variance_tradeoff

➢ Too simple model

will have high bias but low variance

➢ A highly complex model

will have low bias but high variance

Low bias and low variance

You might also like