W8 - Logistic Regression

Logistic regression is a statistical method used for classification problems. It uses a logistic function to model the probabilities of different classes. The logistic regression model calculates a hypothetical function using weights and a bias, and applies a sigmoid activation function to convert the output to a probability value between 0 and 1. It uses a logistic cost function instead of mean squared error for optimization. Gradient descent is used to update the weights iteratively to minimize the cost function. The weights are adjusted to increase the likelihood of the training data. Once trained, the model predicts class 1 if the probability is above 0.5, and class 0 if below 0.5. Examples demonstrate training a logistic regression model on sample data to classify articles as

Uploaded by

5599RAJNISH SINGH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

74 views18 pages

W8 - Logistic Regression

Uploaded by

5599RAJNISH SINGH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 18

LOGISTIC REGRESSION

Likelihood Vs Probability
Probability vs Statistics
• In probability theory we consider some underlying process which has
some randomness or uncertainty modeled by random variables, and
we figure out what happens.
• In statistics we observe something that has happened, and try to
figure out what underlying process would explain those observations.
• Likelihood function is a fundamental concept in statistical inference.
• It indicates how likely a particular population is to produce an
observed sample.
• Probability points to chances while likelihood denotes a possibility.
Likelihood Vs Probability
• Probability is simply how likely something is to happen.
• The occurrence of discrete values yk is expressed by the probability P(yk).
• The distribution of all possible values of discrete random variable y is
expressed as probability distribution.
• We assume that there is some a priori probability (or simply prior) P(yk)
that the next feature vector belongs to the class k.
• P(x| yk) is called the class likelihood and is the conditional probability that a
pattern belonging to class yk has the associated observation value x.
• Any class that maximizes P(x| yq) is called Maximum Likelihood (ML) class.
Likelihood Vs Probability
• Probability follows clear parameters and computations while a likelihood is
based merely on observed factors/data.

• P(data; μ, σ) It means “the probability density of observing the data with

model parameters μ and σ”. It’s worth noting that we can generalise this to
any number of parameters and any distribution.
• On the other hand L(μ, σ; data) means “the likelihood of the parameters μ
and σ taking certain values given that we’ve observed a bunch of data.”
• But despite these two things being equal, the likelihood and the
probability density are fundamentally asking different questions — one is
asking about the data and the other is asking about the parameter values.
Example of Probability
• Consider a dataset containing the heights of the people of a particular
country. Let’s say the mean of the data is 170 & the standard deviation is 3.5.
• When Probability has to be calculated of any situation using this dataset,
then the dataset features will be constant i.e. mean & standard deviation of
the dataset will be constant, they will not be altered.
• Let’s say the probability of height > 170 cm has to be calculated for a random
record in the dataset, then that will be calculated using the information
shown below:

• While calculating probability, feature value can be varied, but the

characteristics(mean & Standard Deviation) of the data distribution cannot
be altered.
Example of Likelihood
• Likelihood calculation involves calculating the best distribution or best
characteristics of data given a particular feature value or situation.
• Consider the exactly same dataset example as provided above for
probability, if their likelihood of height > 170 cm has to be calculated then it
will be done using the information shown below:

• In the calculation of the Likelihood, the equation of the conditional

probability flips as compared to the equation in the probability calculation.
• Here, the dataset features will be varied, i.e. Mean & Standard Deviation of
the dataset will be varied to get the maximum likelihood for height > 170 cm.
• The likelihood in very simple terms means to increase the chances of a
particular situation to happen/occur by varying the characteristics of the
dataset distribution.
Logistic Regression Implementation
Hypothetical function
• In Logistic Regression, we apply the sigmoid activation function on the
hypothetical function of linear regression.
• So the resultant hypothetical function for logistic regression is given below:
h( x ) = sigmoid( wx + b )
Here, w is the weight vector.
x is the feature vector.
b is the bias.
sigmoid( z ) = 1 / ( 1 + e( - z ) )
Cost function
• The cost function of linear regression (mean square error) can’t be used in
logistic regression because it is a non-convex function of weights.
• Optimizing algorithms like i.e gradient descent only converge convex
function into a global minimum.
• So, the simplified cost function we use :
J = - ylog( h(x) ) - ( 1 - y )log( 1 - h(x) ) (it’s derived in last class)
here, y is the real target value
h( x ) = sigmoid( wx + b )
For y = 0, J = - log( 1 - h(x) )
and y = 1, J = - log( h(x) )
Gradient Descent Calculation
repeat until convergence {
tmpi = wi - alpha * dwi
wi = tmpi
}
where alpha is the learning rate.
• The chain rule is used to calculate the gradients like i.e dwi.

• here, a = sigmoid( z ) and z = wx + b.

Next?
• Update weights in an iterative process
• After completing all iterations, calculate Hypothetical function h( x )

Threshold classifier output h( x ) at 0.5:

If h( x ) , predict “y = 1”
If h( x ) , predict “y = 0”
Logistic Regression Numerical
Example 1
• Some samples of two classes of
articles: Technical (1) and Non-
technical (0) are given.
• Each class has two features:
• Time, which represent the
average time required to read an
article in hours,
• Sentences, representing a
number of sentences in a book
• first, we need to train our logistic
regression model.

1.9 3.1 ?
Example 1
• Training involves finding optimal
values of coefficients which are B0,
β1, and β2.
• While training, we find some value
of coefficients in the first step and
use those coefficients in another
step to optimize their value.
• We continue to do it until we get
consistent accuracy from the model.

=
1.9 3.1 ?
Example 1
• After 20 iteration, we get:
B0 = -0.1068913
B1 = 0.41444855
B2 = -0.2486209
• Thus, the decision boundary is given
as:
Z = B0+B1*X1+B2*X2
Z = -0.1068913 +0.41444855*Time-
0.2486209*Sentences

1.9 3.1 ?
Example 1
• For, X1 = 1.9 and X2 = 3.1, we get:
Z = -0.101818+0.41444855*1.9 -
0.2486209*3.1
Z = -0.085090545
• Now, we use sigmoid function to
find the probability and thus
predicting the class of given
variables.

• As y=0.477, is less than 0.5, we can

safely classify given sample to class
Non-technical. 1.9 3.1 ?
Examples 2 & 3
• Can be seen here in the following links
• https://machinelearningmastery.com/logistic-regression-tutorial-for-
machine-learning/
• https://courses.lumenlearning.com/introstats1/chapter/introduction-
to-logistic-regression/

Unit 3-ML
No ratings yet
Unit 3-ML
99 pages
ML Assignment
No ratings yet
ML Assignment
20 pages
Generalized Linear Model
No ratings yet
Generalized Linear Model
67 pages
09 23ECE216 LogisticRegression
No ratings yet
09 23ECE216 LogisticRegression
40 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
Lec 20
No ratings yet
Lec 20
16 pages
Mathematics Behind Logistic Regression Model 1598272636
No ratings yet
Mathematics Behind Logistic Regression Model 1598272636
6 pages
Lecture 07
No ratings yet
Lecture 07
26 pages
Logistic Regression for NLP
No ratings yet
Logistic Regression for NLP
64 pages
2+logistic Regression
No ratings yet
2+logistic Regression
10 pages
Logistic Regression
No ratings yet
Logistic Regression
91 pages
CSCI-43646364 S25 - Lecture 4
No ratings yet
CSCI-43646364 S25 - Lecture 4
92 pages
Reference Material - Logistic - Regression
No ratings yet
Reference Material - Logistic - Regression
11 pages
04 - Linear-Classification-2024
No ratings yet
04 - Linear-Classification-2024
65 pages
5 LR Apr 7 2021
No ratings yet
5 LR Apr 7 2021
94 pages
Exp 2
No ratings yet
Exp 2
7 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
25 pages
Reference Material - Logistic - Regression
No ratings yet
Reference Material - Logistic - Regression
11 pages
Reference Material Logistic Regression
No ratings yet
Reference Material Logistic Regression
11 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
FALLSEM2024-25 BCSE209L TH VL2024250101695 2024-08-12 Reference-Material-II
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101695 2024-08-12 Reference-Material-II
19 pages
Logistic Regression
No ratings yet
Logistic Regression
4 pages
Logistic Regression
No ratings yet
Logistic Regression
78 pages
Slide 2
No ratings yet
Slide 2
30 pages
Logistic Regression Cost Function Analysis
No ratings yet
Logistic Regression Cost Function Analysis
26 pages
Logistic Regression
No ratings yet
Logistic Regression
13 pages
Eml 24.7.25
No ratings yet
Eml 24.7.25
23 pages
Logistic Regression Guide
No ratings yet
Logistic Regression Guide
23 pages
5 LR Apr 7 2021
No ratings yet
5 LR Apr 7 2021
93 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
21 pages
Logistic Regression (Probability Concepts) and Perceptron
No ratings yet
Logistic Regression (Probability Concepts) and Perceptron
20 pages
7 Logistic-Regression
No ratings yet
7 Logistic-Regression
63 pages
Logistic Regression Basics
No ratings yet
Logistic Regression Basics
18 pages
Lecture 3.3 - Logistic Regression
No ratings yet
Lecture 3.3 - Logistic Regression
5 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
41 pages
Logistic Regression
No ratings yet
Logistic Regression
34 pages
Machine Learning for Mechanics
No ratings yet
Machine Learning for Mechanics
19 pages
11logistic Regression in Machine Learning - GeeksforGeeks
No ratings yet
11logistic Regression in Machine Learning - GeeksforGeeks
4 pages
ML - Unit 2
No ratings yet
ML - Unit 2
155 pages
ML Unit 3
No ratings yet
ML Unit 3
40 pages
Logistic Regression for Binary Classification
No ratings yet
Logistic Regression for Binary Classification
84 pages
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
No ratings yet
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
153 pages
Business Analytics & Machine Learning: Logistic and Poisson Regressions
No ratings yet
Business Analytics & Machine Learning: Logistic and Poisson Regressions
62 pages
Log-Linear Models and Conditional Random Fieldsels
No ratings yet
Log-Linear Models and Conditional Random Fieldsels
27 pages
Logistic Regression
No ratings yet
Logistic Regression
20 pages
cs188 Fa23 Note22
No ratings yet
cs188 Fa23 Note22
3 pages
Logistic Regression For Machine Learning Complete TutorialUnderstand This Popular Supervised Classifi
No ratings yet
Logistic Regression For Machine Learning Complete TutorialUnderstand This Popular Supervised Classifi
10 pages
3-LG Eval
No ratings yet
3-LG Eval
52 pages
Logistic Regression: Some Slides Adapted From Dan Jurfasky and Brendan O'Connor
No ratings yet
Logistic Regression: Some Slides Adapted From Dan Jurfasky and Brendan O'Connor
53 pages
Lecture Notes 6 Logistic Regression
No ratings yet
Lecture Notes 6 Logistic Regression
8 pages
Multimedia Application L9
No ratings yet
Multimedia Application L9
43 pages
Logistic Regression
No ratings yet
Logistic Regression
36 pages
Logistic Regression Overview
No ratings yet
Logistic Regression Overview
11 pages
29 LogisticRegression
No ratings yet
29 LogisticRegression
15 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
15 pages
Unit 3 LOGISTIC
No ratings yet
Unit 3 LOGISTIC
7 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
LR, Decision Tree
No ratings yet
LR, Decision Tree
48 pages
Notes 05
No ratings yet
Notes 05
51 pages
Unsupervised Learning Essentials
No ratings yet
Unsupervised Learning Essentials
29 pages
Ch09 Monitoring and Control
No ratings yet
Ch09 Monitoring and Control
34 pages
Ch11 Managing People
No ratings yet
Ch11 Managing People
23 pages
Data Structures & Algorithms Course Guide
No ratings yet
Data Structures & Algorithms Course Guide
159 pages
Ch08 Resource Allocation
No ratings yet
Ch08 Resource Allocation
17 pages
Abraham Tenaw Power Point Thisis
No ratings yet
Abraham Tenaw Power Point Thisis
40 pages
Thi Thu
No ratings yet
Thi Thu
10 pages
Understanding Logistic Regression
100% (3)
Understanding Logistic Regression
41 pages
Anova & Factor Analysis
No ratings yet
Anova & Factor Analysis
24 pages
Diabetes Mellitus Prediction and Classifier Comparitive Study
No ratings yet
Diabetes Mellitus Prediction and Classifier Comparitive Study
7 pages
Machine Learning Project Guidelines
No ratings yet
Machine Learning Project Guidelines
3 pages
Data Preprocessing Techniques Guide
No ratings yet
Data Preprocessing Techniques Guide
32 pages
Airline Delay Prediction
No ratings yet
Airline Delay Prediction
6 pages
Stroke Risk Prediction via Linear Regression
No ratings yet
Stroke Risk Prediction via Linear Regression
8 pages
MVDCMP: Multivariate Decomposition For Nonlinear Response Models
No ratings yet
MVDCMP: Multivariate Decomposition For Nonlinear Response Models
21 pages
Understanding Limited Dependent Variables
No ratings yet
Understanding Limited Dependent Variables
24 pages
1606 Impact of Financial Literacy On
No ratings yet
1606 Impact of Financial Literacy On
19 pages
Examination 732A99 - 732A68 - 2024-01-08
No ratings yet
Examination 732A99 - 732A68 - 2024-01-08
4 pages
Fox 2016 PDF
100% (1)
Fox 2016 PDF
817 pages
Cambridge Admissions Analysis and Insights
No ratings yet
Cambridge Admissions Analysis and Insights
3 pages
The NHSN Standardized Infection Ratio (Sir)
No ratings yet
The NHSN Standardized Infection Ratio (Sir)
53 pages
ML Unit 2 CSE
No ratings yet
ML Unit 2 CSE
160 pages
Industrias
No ratings yet
Industrias
21 pages
Principles of Regression Analysis: Statistics For Researchers
No ratings yet
Principles of Regression Analysis: Statistics For Researchers
5 pages
Sentiment Analysis On Youtube Comments
No ratings yet
Sentiment Analysis On Youtube Comments
54 pages
Students For Fair Admissions v. Harvard First Circuit Opinion
No ratings yet
Students For Fair Admissions v. Harvard First Circuit Opinion
104 pages
CH3 Logistic Regression 2024
No ratings yet
CH3 Logistic Regression 2024
31 pages
Complete User Manual Minitab
No ratings yet
Complete User Manual Minitab
1,086 pages
Electoral Participation of Women in India
No ratings yet
Electoral Participation of Women in India
9 pages
10.1515 - Jqas 2015 0059
No ratings yet
10.1515 - Jqas 2015 0059
12 pages
PSEUDO-R 2 in Logistic Regression Model
No ratings yet
PSEUDO-R 2 in Logistic Regression Model
15 pages
SPSS Univariate & Multivariate Analysis Guide
0% (2)
SPSS Univariate & Multivariate Analysis Guide
15 pages
Breaking The Cycle of Mistrust - Wise Interventions To Provide Critical Feedback Across The Racial Divide
No ratings yet
Breaking The Cycle of Mistrust - Wise Interventions To Provide Critical Feedback Across The Racial Divide
22 pages

W8 - Logistic Regression

Uploaded by

W8 - Logistic Regression

Uploaded by

LOGISTIC REGRESSION

• P(data; μ, σ) It means “the probability density of observing the data with

• While calculating probability, feature value can be varied, but the

• In the calculation of the Likelihood, the equation of the conditional

• here, a = sigmoid( z ) and z = wx + b.

Threshold classifier output h( x ) at 0.5:

• As y=0.477, is less than 0.5, we can

You might also like