C1 Supervised Machine Learning Week 3

This document covers the fundamentals of classification in supervised machine learning, focusing on logistic regression as a solution to the limitations of linear regression for binary classification tasks. It discusses the importance of cost functions, regularization techniques to prevent overfitting, and the implementation of gradient descent for optimizing logistic regression models. Additionally, it highlights the significance of finding a balance between model complexity and generalization to new data.

Uploaded by

Hemesh R

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views8 pages

C1 Supervised Machine Learning Week 3

Uploaded by

Hemesh R

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Supervised Machine Learning: Regression and

Classification
Week 3 – CLASSIFICATION
Classification
 Classification problems involve predicting a limited number of
possible outcomes, such as determining if an email is spam (yes or
no) or if a financial transaction is fraudulent (true or false).
 Binary classification refers to problems with only two possible
outputs, often represented as 0 (negative class) and 1 (positive
class).
Limitations of Linear Regression for Classification
 Linear regression
predicts a continuous
range of values, which
is not ideal for
classification tasks
where the output
should be categorical.
 Adding new data points
can shift the decision
boundary
inappropriately, leading to incorrect classifications.
Introduction to Logistic Regression
 Logistic regression is introduced as a more effective algorithm for
binary classification, ensuring outputs remain between 0 and 1.
 Despite its name, logistic regression is used for classification rather
than regression, addressing the limitations of linear regression in
these scenarios.
Optional lab: Classification
– Linear regression
Approach
 The Sigmoid function, defined as ( g(z) = \frac{1}{1 + e^{-z}} ),
transforms the linear combination of features into a probability.
Optional Lab : Logistic Regression - Sigmoid or Logistic
Function

Understanding Logistic
Regression
 The logistic regression model computes outputs in two steps: first,
calculating ( z = w dot x + b ), and then applying the Sigmoid
function ( g(z) ) to obtain the probability that ( y=1) given ( x ).
 A common threshold for making predictions is 0.5; if ( f(x) \geq 0.5 ),
then ( y ) is predicted as 1, otherwise as 0.

Complex Decision Boundaries

 By incorporating polynomial features, logistic regression can model

more complex decision boundaries, such as circles or ellipses,
allowing it to fit intricate data patterns.
 The decision boundary can become non-linear with higher-order
polynomial terms, enabling the model to predict ( y = 1 ) or ( y =
0 ) based on more complex relationships between features.

Optional Lab: Logistic Regression, Decision

Boundary
(Plotting decision boundary)
Cost Functions
 The cost function measures
how well a set of
parameters fits the training
data, guiding the selection
of better parameters.
 The squared error cost
function is not suitable for
logistic regression due to
its non-convex nature,
which can lead to local
minima during optimization.
New Loss Function
 A new loss function is introduced for logistic regression, defined
based on the true label and the predicted probability. The loss
function is designed to be convex, ensuring that gradient descent
can reliably converge to the global minimum.
Analyzing Loss for Different Labels
 When the true label is 1, the loss incentivizes accurate predictions
close to 1, resulting in a low loss value. Conversely, when the true
label is 0, the loss increases significantly as the predicted probability
approaches 1, penalizing incorrect predictions.

Optional lab:
Logistic loss
function
Simplified Loss Function
 The loss function can be expressed as a single equation
Cost Function for Logistic Regression
 The cost function ( J ) is the average loss across the training set
 The derived cost function is commonly used in training logistic
regression and is based on the principle of maximum likelihood
estimation.
 This cost function is convex, which is beneficial for optimization,
ensuring that gradient descent can effectively find the best
parameters.
Optional Lab: Cost Function
for Logistic Regression

Gradient Descent Algorithm

 The gradient descent algorithm updates each parameter by
calculating the derivative of the cost function with respect
to ( w ) and ( b ).
Difference Between Linear and Logistic Regression
 Although the equations for gradient descent in both algorithms
appear similar, they differ in the definition of the function ( f(x) );
logistic regression uses the sigmoid function, while linear regression
Optional uses a linear function.
lab:  Feature scaling can be applied to both algorithms to help gradient
Gradient descent converge faster. Optional lab:
descent
Logistic regression
for logistic
with scikit-learn

Understanding Overfitting and

Underfitting
 Overfitting occurs when a model
learns the training data too well,
capturing noise and fluctuations,
which leads to poor generalization
on new data. This is often
associated with high variance.
 Underfitting happens when a model is too simple to capture the
underlying patterns in the data, resulting in poor performance on
both training and new data. This is linked to high bias.
 The goal in machine learning is to
find a model that is "just right,"
meaning it neither underfits nor
overfits the data. This balance
allows for good generalization to
new examples.
 Techniques like regularization can
help mitigate overfitting, ensuring
that the model remains flexible
enough to capture the data's patterns without becoming overly
complex.
Addressing Overfitting
Collecting More Data
 One effective way to combat overfitting is to
gather more training data, which helps the
learning algorithm fit a less complex function.
 More data allows the model to generalize better,
reducing the likelihood of high variance.
Feature Selection
 If more data isn't available,
consider using fewer features
by selecting only the most
relevant ones for the
prediction task.
 This process, known as
feature selection, can help
prevent the model from
overfitting by reducing
complexity.
Regularization Techniques
 Regularization is another method to address overfitting by shrinking
the values of the model's parameters without eliminating features
entirely.
 This technique allows the model to retain all features while
minimizing their impact, leading to better generalization.

Optional Lab:
Overfitting
Understanding
Regularization
 Regularization aims to keep the parameter values (W1 through WN)
small, which helps in creating a simpler model that is less prone to
overfitting.
 By adding a penalty term to the cost function, such as 1000 times
W3 squared plus 1000 times W4 squared, we encourage smaller
values for certain parameters.

Modified Cost Function

 The modified cost
function includes the
original mean squared
error cost plus a
regularization term, which
penalizes all parameters
(W1 to W100) to keep
them small.
 The regularization
parameter, lambda (λ),
controls the trade-off i = 1 to m and j = 1 to n (note this while studying and
between fitting the green dots valla j sarigga kanapadatledhu, it looks
like i )
training data well and
keeping the parameters small.
Choosing the Right Lambda
 If λ is set to 0, the model may overfit, while a very large λ (e.g.,
10^10) can lead to underfitting by forcing all parameters close to 0.
 The goal is to find a balanced λ that minimizes both the mean
squared error and the regularization term, resulting in a model that
fits the data appropriately.
Gradient Descent Updates
 The goal is to find
parameters w and
b that minimize
this regularized
cost function.
 The gradient
descent algorithm
updates
parameters w and
b using specific
formulas, with the update for w_j now including an additional term
from the regularization.
 The update for b remains unchanged since it is not regularized,
while the update for w_j incorporates the regularization term to
shrink its value.
Intuition Behind Regularization
 Regularization effectively
shrinks the parameters w_j
by multiplying them by a
factor slightly less than 1
during each iteration of
gradient descent.
 This process helps reduce
overfitting, especially when
dealing with many features
and a small training set, leading to improved performance in linear
regression tasks.
Understanding Overfitting in Logistic Regression
 Logistic regression can overfit when using high-order polynomial
features, leading to complex decision boundaries that do not
generalize well to new data.
 Regularization helps mitigate overfitting by adding a penalty term to
the cost function, which discourages large parameter values.

Implementing Regularized Logistic Regression

 The cost function for logistic regression can be modified by adding a
regularization term
 Gradient descent is used to minimize this cost function, with an
additional term included in the update rules for the
parameters ( w_j ).

Optional Lab - Regularized Cost and

Gradient descent for both linear and
logistic regression

LAB ASSIGNMENT :
Logistic Regression :sigmoid function , cost
function, gradient descent, evaluating logistic
regression, regularised logistic regression (cost
function, gradient descent)

Intro to Classification & Regression
No ratings yet
Intro to Classification & Regression
42 pages
AML - Lecture 3 Logistic Regression. Neural Networks
No ratings yet
AML - Lecture 3 Logistic Regression. Neural Networks
59 pages
ML 1
No ratings yet
ML 1
24 pages
Machine Learning Using Optimization and Logistic Regression and Sigmoid Function - GRP 06
No ratings yet
Machine Learning Using Optimization and Logistic Regression and Sigmoid Function - GRP 06
31 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
21 pages
Unit 2
No ratings yet
Unit 2
8 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Introduction To Machine Learning: Dr. Muhammad Amjad Iqbal
No ratings yet
Introduction To Machine Learning: Dr. Muhammad Amjad Iqbal
20 pages
Lasso Regression in Logistic Models
No ratings yet
Lasso Regression in Logistic Models
43 pages
Classification - Introduction, Logistic Regression
No ratings yet
Classification - Introduction, Logistic Regression
28 pages
Lecture Ai
No ratings yet
Lecture Ai
40 pages
CSCI-43646364 S25 - Lecture 4
No ratings yet
CSCI-43646364 S25 - Lecture 4
92 pages
Logistic Regression Overview
No ratings yet
Logistic Regression Overview
55 pages
A Layman's Guide To The Project
No ratings yet
A Layman's Guide To The Project
34 pages
Subtitle
No ratings yet
Subtitle
2 pages
04 - Linear-Classification-2024
No ratings yet
04 - Linear-Classification-2024
65 pages
Handout 02 Logistic Regression
No ratings yet
Handout 02 Logistic Regression
39 pages
ML4 Linear Models
No ratings yet
ML4 Linear Models
34 pages
Logistic Regression for Binary Classification
No ratings yet
Logistic Regression for Binary Classification
37 pages
Week 7
No ratings yet
Week 7
21 pages
Logistic Regression
No ratings yet
Logistic Regression
6 pages
Logistic Regression Cost Function Analysis
No ratings yet
Logistic Regression Cost Function Analysis
26 pages
Module 3-Logistic Regression
No ratings yet
Module 3-Logistic Regression
21 pages
09 23ECE216 LogisticRegression
No ratings yet
09 23ECE216 LogisticRegression
40 pages
Logistic Regression
No ratings yet
Logistic Regression
34 pages
Logistic Regression
No ratings yet
Logistic Regression
36 pages
A Tutorial of Machine Learning
No ratings yet
A Tutorial of Machine Learning
16 pages
Lecture 07
No ratings yet
Lecture 07
26 pages
Ziad Aladawy - Logistic Regressio
No ratings yet
Ziad Aladawy - Logistic Regressio
54 pages
Logistic Regression
No ratings yet
Logistic Regression
9 pages
PCCAIML601
No ratings yet
PCCAIML601
7 pages
Logistic Regression & Classification
No ratings yet
Logistic Regression & Classification
30 pages
M02Logistic Regression Logistic RegressioLogistic Regressionn
No ratings yet
M02Logistic Regression Logistic RegressioLogistic Regressionn
19 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
07: Regularization: The Problem of Overfitting
No ratings yet
07: Regularization: The Problem of Overfitting
5 pages
Linear Regression Techniques
No ratings yet
Linear Regression Techniques
25 pages
Exp 2
No ratings yet
Exp 2
7 pages
Logistic Regression and Regularization Techniques
No ratings yet
Logistic Regression and Regularization Techniques
39 pages
Regression
No ratings yet
Regression
30 pages
Logistic Regression-II
No ratings yet
Logistic Regression-II
23 pages
LR2
No ratings yet
LR2
25 pages
Unit 3-ML
No ratings yet
Unit 3-ML
99 pages
Logistic Regression Overview
No ratings yet
Logistic Regression Overview
66 pages
Generalized Linear Model
No ratings yet
Generalized Linear Model
67 pages
Regularization
No ratings yet
Regularization
8 pages
Logistic Regression
No ratings yet
Logistic Regression
12 pages
DS203 2024 01 02 LogisticRegression
No ratings yet
DS203 2024 01 02 LogisticRegression
38 pages
Logistic Regression
No ratings yet
Logistic Regression
74 pages
Regularization in Polynomial Regression
No ratings yet
Regularization in Polynomial Regression
30 pages
Understanding Logistic Regression
No ratings yet
Understanding Logistic Regression
41 pages
Week 8
No ratings yet
Week 8
38 pages
Logistic Regression
No ratings yet
Logistic Regression
74 pages
AC-ED L04 - Logistic Regression, Regularization
No ratings yet
AC-ED L04 - Logistic Regression, Regularization
80 pages
ML Linear Model
No ratings yet
ML Linear Model
10 pages
Simple Linear Regression Definition: Two Variables Independent Variable Dependent Variable Equation
No ratings yet
Simple Linear Regression Definition: Two Variables Independent Variable Dependent Variable Equation
9 pages
Machine Learning PPT Part II
No ratings yet
Machine Learning PPT Part II
56 pages
Ch2Regression and Regularization1
No ratings yet
Ch2Regression and Regularization1
45 pages
Machine Learning: Logistic Regression
No ratings yet
Machine Learning: Logistic Regression
19 pages
Module 3.3 Classification Models, An Overview
No ratings yet
Module 3.3 Classification Models, An Overview
11 pages
9-Keys in DBMS-05-08-2024
No ratings yet
9-Keys in DBMS-05-08-2024
14 pages
Multi Tasker
No ratings yet
Multi Tasker
4 pages
24-Multi-Level Indexing, Dynamic Multilevel Indexing, B-Tree-11-09-2024
No ratings yet
24-Multi-Level Indexing, Dynamic Multilevel Indexing, B-Tree-11-09-2024
40 pages
25-Hashing Techniques - 16-09-2024
No ratings yet
25-Hashing Techniques - 16-09-2024
39 pages
1 Unnamed 03 01 2024
No ratings yet
1 Unnamed 03 01 2024
10 pages
29-Query Optimization-04-10-2024
No ratings yet
29-Query Optimization-04-10-2024
35 pages
Forms and Event Handlers
No ratings yet
Forms and Event Handlers
5 pages
Web Programming Basics by Hemesh
No ratings yet
Web Programming Basics by Hemesh
2 pages
Ashcroft 1279 Process Gauge
No ratings yet
Ashcroft 1279 Process Gauge
2 pages
Radha TMT Creators of Tomorrow Buildathon
No ratings yet
Radha TMT Creators of Tomorrow Buildathon
9 pages
2019 Grade 6 ATP-1
No ratings yet
2019 Grade 6 ATP-1
13 pages
Oilfield Products
No ratings yet
Oilfield Products
12 pages
JPB WORKING PAPER Politics of Identity, Autonomy and Tribal Question in Assam
No ratings yet
JPB WORKING PAPER Politics of Identity, Autonomy and Tribal Question in Assam
9 pages
Understanding Holmes in Bohemia
No ratings yet
Understanding Holmes in Bohemia
22 pages
Marketing Environment Notes Class 11 CBSE
No ratings yet
Marketing Environment Notes Class 11 CBSE
5 pages
Collocation and Colligation
No ratings yet
Collocation and Colligation
6 pages
Tutorial 5
No ratings yet
Tutorial 5
2 pages
Learning Theory Applications and Reflections
No ratings yet
Learning Theory Applications and Reflections
2 pages
DLL (Eng2) Q4 W5
No ratings yet
DLL (Eng2) Q4 W5
10 pages
Nalini Kembhavi
No ratings yet
Nalini Kembhavi
18 pages
Reflection Journal 2
No ratings yet
Reflection Journal 2
3 pages
Bai 3. Metaverse Literature Review Synthesis and Future Research Agenda
No ratings yet
Bai 3. Metaverse Literature Review Synthesis and Future Research Agenda
22 pages
Edited - INVESTIGATORY PROJECT
50% (2)
Edited - INVESTIGATORY PROJECT
10 pages
Redox Titration
No ratings yet
Redox Titration
27 pages
Practical Strategiesfor Writingin Plain Language
No ratings yet
Practical Strategiesfor Writingin Plain Language
3 pages
Metasurface-Based Single-Layer Wideband Circularly Polarized MIMO Antenna For 5G Millimeter-Wave Systems
No ratings yet
Metasurface-Based Single-Layer Wideband Circularly Polarized MIMO Antenna For 5G Millimeter-Wave Systems
12 pages
Microbiology Practical Exam Guide
No ratings yet
Microbiology Practical Exam Guide
87 pages
Personal Values Identification Workbook
No ratings yet
Personal Values Identification Workbook
8 pages
Aim: To Find The Air Resistance (Drag) On A Cars: Research Question
No ratings yet
Aim: To Find The Air Resistance (Drag) On A Cars: Research Question
6 pages
English B Course Companion 2nd Edition Kawther Saa'D Aldin Full
100% (1)
English B Course Companion 2nd Edition Kawther Saa'D Aldin Full
106 pages
Existentialism & Jim Morrison
No ratings yet
Existentialism & Jim Morrison
11 pages
Kurikulum Mata Kuliah Sastra Inggris
No ratings yet
Kurikulum Mata Kuliah Sastra Inggris
2 pages
Safety Sensor 171
100% (1)
Safety Sensor 171
17 pages
Social World, Human Social Behavior, and Foundations of Society
No ratings yet
Social World, Human Social Behavior, and Foundations of Society
5 pages
Lars Helvete - The Eight Steps An Outline of Chaos Magic (Vol 1-2)
100% (1)
Lars Helvete - The Eight Steps An Outline of Chaos Magic (Vol 1-2)
41 pages
Group 3 Haccp Plan For Dried Banana Chips 1
100% (4)
Group 3 Haccp Plan For Dried Banana Chips 1
30 pages
Project Management Office Transformations: Direct and Moderating Effects That Enhance Performance and Maturity
No ratings yet
Project Management Office Transformations: Direct and Moderating Effects That Enhance Performance and Maturity
27 pages
2nd Intl Symposium Brochure Jan 9th and 10th 2025
No ratings yet
2nd Intl Symposium Brochure Jan 9th and 10th 2025
8 pages

C1 Supervised Machine Learning Week 3

Uploaded by

C1 Supervised Machine Learning Week 3

Uploaded by

Supervised Machine Learning: Regression and

Complex Decision Boundaries

 By incorporating polynomial features, logistic regression can model

Optional Lab: Logistic Regression, Decision

Gradient Descent Algorithm

Understanding Overfitting and

Modified Cost Function

Implementing Regularized Logistic Regression

Optional Lab - Regularized Cost and

You might also like