Chapter 8 Logistic Regression

Logistic Regression is a binary classification algorithm used for predicting dichotomous outcomes, such as determining if an email is spam or if a loan application will be approved. It models the probability of an event occurring using the logistic function, which transforms probabilities to odds and then takes the logarithm to create a model that is unbounded. The parameters of the model are estimated using maximum likelihood estimation, which maximizes the probability of the observed data.

Uploaded by

prajapatiaryank

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views22 pages

Chapter 8 Logistic Regression

Uploaded by

prajapatiaryank

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Logistic Regression

“Machine Learning”
by Anuradha Srinivasaraghavan & Vincy Joseph
1
Copyright  2019 Wiley India Pvt. Ltd. All rights reserved.
 In linear regression, the response Y is
continuous
 If output is discrete, it is a classification
problem
 E.g. predict whether a person is male or female
based on height.
 Log. Reg. is a binary classification algorithm
used when the response variable is
dichotomous (1 or 0)
 Output : a random variable Yi that take values
(1 and 0) with probabilities pi and 1 − pi ,
respectively
“Machine Learning”
by Anuradha Srinivasaraghavan & Vincy Joseph
Copyright  2019 Wiley India Pvt. Ltd. All rights reserved.
 Let p denote probability that Y = 1 when X =
x.
 For linear model to describe p, the model for
the probability would be

p = Pr(Y = 1 | X = x) =β0 + β1x

 Since p is probability it must lie between 0
and 1
 The linear function is unbounded, and hence
cannot be used to model probability.
“Machine Learning”
by Anuradha Srinivasaraghavan & Vincy Joseph
Copyright  2019 Wiley India Pvt. Ltd. All rights reserved.
“Machine Learning”
by Anuradha Srinivasaraghavan & Vincy Joseph
Copyright  2019 Wiley India Pvt. Ltd. All rights reserved.
 Spam Detection: Predicting if an email is
Spam or not
 Credit Card Fraud: Predicting if a given credit
card transaction is fraud or not
 Health: Predicting if a given mass of tissue is
benign or malignant
 Marketing: Predicting if a given user will buy
an insurance product or not
 Banking: Predicting if a customer will default
on a loan.

“Machine Learning”
by Anuradha Srinivasaraghavan & Vincy Joseph
Copyright  2019 Wiley India Pvt. Ltd. All rights reserved.
 The odds of an event is the ratio of the
expected number of times that an event will
occur to the expected number of times it will
not occur.
 If p is the probability of an event and O is the
odds of the event, then

“Machine Learning”
by Anuradha Srinivasaraghavan & Vincy Joseph
Copyright  2019 Wiley India Pvt. Ltd. All rights reserved.
 Transforming the probability to odds removes
the upper bound.
 If we then take the logarithm of the odds, we
also remove the lower bound.
 Thus, we get the logistic model as

“Machine Learning”
by Anuradha Srinivasaraghavan & Vincy Joseph
Copyright  2019 Wiley India Pvt. Ltd. All rights reserved.
 In linear regression we used the method of
least squares to estimate regression
coefficients.
 In logistic regression we use another
approach called maximum likelihood
estimation.
 The maximum likelihood estimate of a
parameter is that value that maximizes the
probability of the observed data.

“Machine Learning”
by Anuradha Srinivasaraghavan & Vincy Joseph
Copyright  2019 Wiley India Pvt. Ltd. All rights reserved.
 Let us consider the example of predicting
whether a home loan application will be
approved based on the credit score of the
applicant

“Machine Learning”
by Anuradha Srinivasaraghavan & Vincy Joseph
Copyright  2019 Wiley India Pvt. Ltd. All rights reserved.
“Machine Learning”
by Anuradha Srinivasaraghavan & Vincy Joseph
Copyright  2019 Wiley India Pvt. Ltd. All rights reserved.
“Machine Learning”
by Anuradha Srinivasaraghavan & Vincy Joseph
Copyright  2019 Wiley India Pvt. Ltd. All rights reserved.
“Machine Learning”
by Anuradha Srinivasaraghavan & Vincy Joseph
Copyright  2019 Wiley India Pvt. Ltd. All rights reserved.
 For a continuous independent variable the
odds ratio can be defined as:

“Machine Learning”
by Anuradha Srinivasaraghavan & Vincy Joseph
Copyright  2019 Wiley India Pvt. Ltd. All rights reserved.
 Used to estimate regression coefficients
 The maximum likelihood estimate is the value
that maximizes the probability of the
observed data.
 Likelihood function is probability that the
observed values of the dependent variable
may be predicted from the observed values of
the independent variables.
 The likelihood varies from 0 to 1
 It is easier to work with the logarithm of the
likelihood function. This function is known as
the log-likelihood.
“Machine Learning”
by Anuradha Srinivasaraghavan & Vincy Joseph
Copyright  2019 Wiley India Pvt. Ltd. All rights reserved.
 In logistic regression, we observe binary
outcome.
 Suppose in a population, each individual has the
same probability p that an event occurs.
 For sample of size n, Yi =1 indicates that an
event occurs for the ith subject, otherwise, Yi =0.
 The observed data are Y1, . . . ,Yn and X1, . . . , Xn

 Natural logarithm of the likelihood is

 In which

“Machine Learning”
by Anuradha Srinivasaraghavan & Vincy Joseph
Copyright  2019 Wiley India Pvt. Ltd. All rights reserved.
 Estimating the parameters α and β is done
using the first derivatives of log-likelihood,
and solving them for α and β.
 For this, iterative computing is used.
 An arbitrary value for the coefficients
(usually 0) is first chosen.
 Then log-likelihood is computed and
variation of coefficients values observed.
 Reiteration is then performed until
maximization of L
 The results are the maximum likelihood
estimates of α and β. “Machine Learning”
by Anuradha Srinivasaraghavan & Vincy Joseph
Copyright  2019 Wiley India Pvt. Ltd. All rights reserved.

Machine Learning - Unit 2
No ratings yet
Machine Learning - Unit 2
104 pages
Predictive and Probabilistic Approach Using Logistic Regression:Application To Prediction of Loan Approval
No ratings yet
Predictive and Probabilistic Approach Using Logistic Regression:Application To Prediction of Loan Approval
6 pages
Logistic Regression
No ratings yet
Logistic Regression
20 pages
Machine Learning Handbook - Radivojac and White
No ratings yet
Machine Learning Handbook - Radivojac and White
108 pages
Machine Learning Labs Manual
No ratings yet
Machine Learning Labs Manual
60 pages
(Chapman & Hall - CRC Texts in Statistical Science) Paul Roback and Julie Legler - Beyond Multiple Linear Regression-Applied Generalized Linear Models and Multilevel Models in R-CRC Press (2020)
100% (1)
(Chapman & Hall - CRC Texts in Statistical Science) Paul Roback and Julie Legler - Beyond Multiple Linear Regression-Applied Generalized Linear Models and Multilevel Models in R-CRC Press (2020)
437 pages
W8 - Logistic Regression
No ratings yet
W8 - Logistic Regression
18 pages
BA TopicB LoR
No ratings yet
BA TopicB LoR
29 pages
3.logistic - 2021 With ROC
No ratings yet
3.logistic - 2021 With ROC
95 pages
Murphy's Machine Learning Solutions Manual
No ratings yet
Murphy's Machine Learning Solutions Manual
100 pages
Logistic Regression
No ratings yet
Logistic Regression
74 pages
Topic 7 Regression (Cont2) Logistic Regression
No ratings yet
Topic 7 Regression (Cont2) Logistic Regression
33 pages
Generalized Linear Model
No ratings yet
Generalized Linear Model
67 pages
Log-Linear Models and Conditional Random Fieldsels
No ratings yet
Log-Linear Models and Conditional Random Fieldsels
27 pages
Understanding Logistic Regression Basics
100% (1)
Understanding Logistic Regression Basics
41 pages
Mock Exams 2024
No ratings yet
Mock Exams 2024
81 pages
Logistic Regression for Analysts
No ratings yet
Logistic Regression for Analysts
61 pages
Logistic Regression Tutorial Python
No ratings yet
Logistic Regression Tutorial Python
30 pages
Unit 1
No ratings yet
Unit 1
92 pages
Cheat Sheet
No ratings yet
Cheat Sheet
163 pages
Advanced Statistical Methods Overview
No ratings yet
Advanced Statistical Methods Overview
6 pages
Business Analytics & Machine Learning: Logistic and Poisson Regressions
No ratings yet
Business Analytics & Machine Learning: Logistic and Poisson Regressions
62 pages
Machine Learning for Mechanics
No ratings yet
Machine Learning for Mechanics
19 pages
Chapter 10 - Logistic Regression: Data Mining For Business Intelligence
No ratings yet
Chapter 10 - Logistic Regression: Data Mining For Business Intelligence
20 pages
Logistic Regression
No ratings yet
Logistic Regression
61 pages
Logistic Regression for Coupon Usage
100% (1)
Logistic Regression for Coupon Usage
56 pages
UNIT I-Part 2
No ratings yet
UNIT I-Part 2
35 pages
Data Analytics Using R
No ratings yet
Data Analytics Using R
23 pages
05 Logistic Regression
No ratings yet
05 Logistic Regression
12 pages
Unit 3
No ratings yet
Unit 3
8 pages
MFML
No ratings yet
MFML
154 pages
Unit-2 MLT
No ratings yet
Unit-2 MLT
84 pages
Lec4 Logistic Regression
No ratings yet
Lec4 Logistic Regression
12 pages
ML Unit 3
No ratings yet
ML Unit 3
40 pages
Bayesian Probability & Regression
No ratings yet
Bayesian Probability & Regression
6 pages
Binary Logistic Regression - 6.2
No ratings yet
Binary Logistic Regression - 6.2
34 pages
Xiii Xiv Contents: 2 Probability Distributions 67
No ratings yet
Xiii Xiv Contents: 2 Probability Distributions 67
6 pages
Machine Learning and Data Mining
No ratings yet
Machine Learning and Data Mining
88 pages
Preface VII Mathematical Notation Xi Contents Xiii
No ratings yet
Preface VII Mathematical Notation Xi Contents Xiii
6 pages
ML - Unit 2
No ratings yet
ML - Unit 2
155 pages
2+logistic Regression
No ratings yet
2+logistic Regression
10 pages
Machine Learning The Basics
No ratings yet
Machine Learning The Basics
158 pages
Lecture1 Intro ML
No ratings yet
Lecture1 Intro ML
60 pages
6.867 Lecture Notes: Section 1: Introduction: 1 Intro 2 2 Problem Class 3
No ratings yet
6.867 Lecture Notes: Section 1: Introduction: 1 Intro 2 2 Problem Class 3
10 pages
Chap10 Logistic Regression
No ratings yet
Chap10 Logistic Regression
36 pages
ML 4
No ratings yet
ML 4
80 pages
04 Probability and Learning PDF
No ratings yet
04 Probability and Learning PDF
34 pages
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
No ratings yet
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
153 pages
Unit 2&3 - 250421 - 215911
No ratings yet
Unit 2&3 - 250421 - 215911
60 pages
Class
No ratings yet
Class
102 pages
Logistic Regression
No ratings yet
Logistic Regression
19 pages
Logistic Regression Monograph - DSBA v2
No ratings yet
Logistic Regression Monograph - DSBA v2
54 pages
AIML ML Session 5 Session 6 - Student Common Reference (With More Additional Read
No ratings yet
AIML ML Session 5 Session 6 - Student Common Reference (With More Additional Read
84 pages
Applications of Machine Learning Explained
No ratings yet
Applications of Machine Learning Explained
13 pages
BANA 560 Lecture - 4 - LogisticRegression
No ratings yet
BANA 560 Lecture - 4 - LogisticRegression
26 pages
4.OptionalReadingHomework GUIDE
No ratings yet
4.OptionalReadingHomework GUIDE
4 pages
Q - Optimum Working Frequency (OWF)
No ratings yet
Q - Optimum Working Frequency (OWF)
9 pages
Int MId AP
No ratings yet
Int MId AP
9 pages
CH 15 Ass
No ratings yet
CH 15 Ass
1 page
Ass 2
No ratings yet
Ass 2
3 pages
Token Retrieval and Security Script
No ratings yet
Token Retrieval and Security Script
13 pages
2 - 8 - RISC - V - Architecture & Toolchain
No ratings yet
2 - 8 - RISC - V - Architecture & Toolchain
5 pages
Ass 1 2 3
No ratings yet
Ass 1 2 3
3 pages
AWP Imp For Mid
No ratings yet
AWP Imp For Mid
8 pages
Ipdc Merged
No ratings yet
Ipdc Merged
42 pages
4 - RISC-V Registers and Data
No ratings yet
4 - RISC-V Registers and Data
3 pages
5 Risc V Ap
No ratings yet
5 Risc V Ap
4 pages
Termination Criteria in Computerized Adaptive Tests: Variable-Length Cats Are Not Biased
No ratings yet
Termination Criteria in Computerized Adaptive Tests: Variable-Length Cats Are Not Biased
21 pages
Data Visualization Techniques Tools
No ratings yet
Data Visualization Techniques Tools
8 pages
Lecture 02 - Review of Statistics - McLave - 2 Per Page
No ratings yet
Lecture 02 - Review of Statistics - McLave - 2 Per Page
65 pages
Paired Sample T-Test
No ratings yet
Paired Sample T-Test
7 pages
B.Tech Linear Regression Guide
No ratings yet
B.Tech Linear Regression Guide
6 pages
Analyzing Assessment Data with Statistics
92% (12)
Analyzing Assessment Data with Statistics
34 pages
Factor Analysis PDF
100% (1)
Factor Analysis PDF
57 pages
Modeling Directional (Circular) Time Series
No ratings yet
Modeling Directional (Circular) Time Series
32 pages
Knowledge, Attitudes and Practices (KAP) : Conceptual Framework
No ratings yet
Knowledge, Attitudes and Practices (KAP) : Conceptual Framework
15 pages
Final Presentation
No ratings yet
Final Presentation
274 pages
Set - 2 - 2023 - Review - Outline Solutions
No ratings yet
Set - 2 - 2023 - Review - Outline Solutions
12 pages
MLT Week11 Notes
No ratings yet
MLT Week11 Notes
14 pages
Probability Assignment Questions
No ratings yet
Probability Assignment Questions
6 pages
CP4
No ratings yet
CP4
3 pages
KR20 & Coefficient Alpha: Their Equivalence For Binary Scored Items
No ratings yet
KR20 & Coefficient Alpha: Their Equivalence For Binary Scored Items
7 pages
MATH 1280 .Discussion Forum Unit 2
No ratings yet
MATH 1280 .Discussion Forum Unit 2
4 pages
TA Bivariate Data
No ratings yet
TA Bivariate Data
9 pages
Gold Price Forecasting with ARIMA
No ratings yet
Gold Price Forecasting with ARIMA
10 pages
Hubungan Antara Lingkungan Kerja Dengan Kompetensi Sosial Guru Madrasah
No ratings yet
Hubungan Antara Lingkungan Kerja Dengan Kompetensi Sosial Guru Madrasah
11 pages
Statistics - Correlation Activity
No ratings yet
Statistics - Correlation Activity
6 pages
Hypothesis Testing for Engineers
No ratings yet
Hypothesis Testing for Engineers
95 pages
Edno Sarthi Maths Book-44!56!1-7
No ratings yet
Edno Sarthi Maths Book-44!56!1-7
7 pages
Statistics I Full Notes
100% (1)
Statistics I Full Notes
30 pages
Box and Whisker Plot in Excel - Step by Step Tutorial
No ratings yet
Box and Whisker Plot in Excel - Step by Step Tutorial
10 pages
TSA: Time Series Analysis Package
No ratings yet
TSA: Time Series Analysis Package
79 pages
4 Chapter F
No ratings yet
4 Chapter F
24 pages
2170 11192 1 PB
No ratings yet
2170 11192 1 PB
9 pages
Lansbergenet Al. 2007 - Stroop Interference and Attention - de Cit - Hyperactivity Disorder
No ratings yet
Lansbergenet Al. 2007 - Stroop Interference and Attention - de Cit - Hyperactivity Disorder
12 pages
Percentile Rank
No ratings yet
Percentile Rank
18 pages
Binary Logistic Regression
No ratings yet
Binary Logistic Regression
8 pages

Chapter 8 Logistic Regression

Uploaded by

Chapter 8 Logistic Regression

Uploaded by

Logistic Regression

p = Pr(Y = 1 | X = x) =β0 + β1x

 Natural logarithm of the likelihood is

You might also like