100% found this document useful (1 vote)

119 views14 pages

Logistic Regression

Logistic regression is used for classification problems where the response variable is categorical. It models the probability of an event occurring versus not occurring. Some examples include predicting loan defaults, fraud detection, customer churn, and propensity to buy models. Unlike linear regression, which predicts absolute values, logistic regression predicts probabilities. It uses a sigmoid function to map predictor variable values to a probability between 0 and 1. Model parameters are estimated using maximum likelihood estimation to minimize the error between predicted and actual probabilities. Thresholds can be selected using methods like ROC curves to optimize sensitivity and specificity for classification.

Uploaded by

Saket Anand

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

119 views14 pages

Logistic Regression

Uploaded by

Saket Anand

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 14

Logistic regression

model

Case studies for choice

models
Choice model cater to cases where the response variable are
categorical variables
Home loan/credit card/ Consumer loan defaults { default vs. no
default}
Fraud detection {fraud case vs. no fraud}
Customer Churn Analysis {churn vs. no churn}
Propensity to buy models { buy vs. no buy|

Linear regression bad choice when

response variables are categorical

- Clearly simplest
model could be y =1
when tumor size is
greater than 5
- In the first model one
could do that by
saying y_predicted
>0.5
- Adding a few more
grey points should not
result in new model or
a new line because in
reality the cut has not
changed

General structure for choice

models
X

Home loan default

Income
Debt to Income
Default on other
loans
Salaried vs.
Business
Expense to
Income

Credit Score

Probability of
default

Logistic regression model

Instead of predicting absolute value we predict probability
of an event
1.2
Probability
of Cancer
1
0.8
0.6
0.4
0.2
0
0

P(z) = 1/(1+exp(-z))
6

Tumor Size

Sigmoid function

Error function(analogy)

Y=0

(p-0)
Roughly
MLE

1
Error

Y=1

(1-p)

Error

p1 y (1 p ) y

Minimiz
e

p y (1 p )1 y

Maximiz
e

MLE
(Maximum
Likelihood)

Estimate parameter using

Maximum Likelihood

Max yi ln( p ( zi )) (1 yi ) ln(1 p ( zi ))

where
zi xi

Churn Model Example

Setting Threshold for

classification
Positive

Threshold

Negative

High Threshold -> High Accuracy low

capture
Low Threshold -> Low Accuracy high
capture

Picking a threshold:
KS Chart
- Divide the
population into
deciles
-

Take upper limit of

all deciles and plot
the cumulative
percentage of good
and bad examples

- Pick the
score/threshold of
the decile where the
separation between
good and bad is the
maximum

Truth Table to measure

accuracy
False Negative Rate = False Negative/Total Actual False
(specificity)
True Positive Rate = True Positive/Total Actual True
(sensitivity)
actual
True

False

True

True Positive

False
Positive

False

True
Negative

False
Negative

Predicted

Max sensitivity and

Specificity
Choose the threshold where both sensitivity and specificity are
maximized

Goodness of fit ROC Curve

- The dotted line

represents the case
where model has not
learnt anything i.e. picks
the same percentage of
of false positives and
True Positives
- The area under the blue
curve therefore
represents the goodness
of fit (0.5<Area<1)

Logistic Regression Overview by Gunjan Bharadwaj
100% (1)
Logistic Regression Overview by Gunjan Bharadwaj
42 pages
Introduction to Statistics Basics
100% (1)
Introduction to Statistics Basics
46 pages
Stats & ML Model Comparisons
100% (1)
Stats & ML Model Comparisons
72 pages
Correlation & Regression
No ratings yet
Correlation & Regression
31 pages
One-Hot Encoding for Categorical Data
No ratings yet
One-Hot Encoding for Categorical Data
4 pages
Customer Data Analysis & Feature Engineering
No ratings yet
Customer Data Analysis & Feature Engineering
35 pages
Correlation Measures and Hypothesis Tests
100% (1)
Correlation Measures and Hypothesis Tests
24 pages
Linear Regression Techniques Explained
100% (1)
Linear Regression Techniques Explained
44 pages
Linear Regression: What Is Regression Analysis?
100% (1)
Linear Regression: What Is Regression Analysis?
21 pages
EDA Lecture Module 2
100% (1)
EDA Lecture Module 2
42 pages
### Data Exploration: 'Yes' 'No' 'Agency' 'Direct' 'Employee Referral' 'Yes' 'No'
100% (1)
### Data Exploration: 'Yes' 'No' 'Agency' 'Direct' 'Employee Referral' 'Yes' 'No'
6 pages
1694600777-Unit2.2 Logistic Regression CU 2.0
100% (1)
1694600777-Unit2.2 Logistic Regression CU 2.0
37 pages
Project 5 PDF
100% (1)
Project 5 PDF
48 pages
Bootstrap Powerpoint
100% (1)
Bootstrap Powerpoint
20 pages
Importing Stock Data with Pandas
100% (1)
Importing Stock Data with Pandas
4 pages
Bagging and Boosting Regression Algorithms
100% (1)
Bagging and Boosting Regression Algorithms
84 pages
Data Analytics - Ridge and LASSO Regression
No ratings yet
Data Analytics - Ridge and LASSO Regression
15 pages
PR01
100% (1)
PR01
41 pages
ML Lect1
100% (1)
ML Lect1
51 pages
Lab 3. Linear Regression 230223
100% (1)
Lab 3. Linear Regression 230223
7 pages
Supervised Learning: Logistic Regression
100% (1)
Supervised Learning: Logistic Regression
35 pages
Classification Problems
100% (1)
Classification Problems
25 pages
Machine Learning Data Preparation Guide
No ratings yet
Machine Learning Data Preparation Guide
49 pages
Leer Los Datos: Import As Import As Import As From Import From Import
100% (1)
Leer Los Datos: Import As Import As Import As From Import From Import
14 pages
Understanding Decision Trees in Classification
100% (1)
Understanding Decision Trees in Classification
58 pages
Python Cumprod Function Overview
100% (1)
Python Cumprod Function Overview
27 pages
Logistic Regression Example
100% (1)
Logistic Regression Example
22 pages
Quiz Feedback1 - Coursera
100% (1)
Quiz Feedback1 - Coursera
7 pages
Linear Regression Models Overview
100% (1)
Linear Regression Models Overview
39 pages
Linear Regression with Python OLS
No ratings yet
Linear Regression with Python OLS
23 pages
ML0101EN Clas Logistic Reg Churn Py v1
100% (1)
ML0101EN Clas Logistic Reg Churn Py v1
13 pages
Vinee
100% (1)
Vinee
28 pages
Classification With Decision Trees: Instructor: Qiang Yang
100% (1)
Classification With Decision Trees: Instructor: Qiang Yang
62 pages
Sajjad DS
100% (2)
Sajjad DS
97 pages
Cluster
100% (1)
Cluster
72 pages
Logistic Regression Analysis Overview
100% (1)
Logistic Regression Analysis Overview
5 pages
Python for Science: A Minimal Guide
100% (1)
Python for Science: A Minimal Guide
108 pages
Regression Analysis Essentials
100% (1)
Regression Analysis Essentials
2 pages
Assignment No - 6-1
100% (1)
Assignment No - 6-1
3 pages
Logistic Regression
100% (1)
Logistic Regression
29 pages
Logistics Regression
100% (1)
Logistics Regression
5 pages
1
100% (1)
1
385 pages
Understanding Simple Linear Regression
100% (1)
Understanding Simple Linear Regression
15 pages
Human Life Span Prediction Using Machine Learning
100% (1)
Human Life Span Prediction Using Machine Learning
9 pages
Data Pre-Processing (Pandas)
No ratings yet
Data Pre-Processing (Pandas)
19 pages
Stats 101: Double-Sided Cheat Sheet
100% (1)
Stats 101: Double-Sided Cheat Sheet
2 pages
Correlation & Regression Guide
100% (1)
Correlation & Regression Guide
53 pages
Variosalgoritmos - Jupyter Notebook
100% (1)
Variosalgoritmos - Jupyter Notebook
9 pages
Machine Learning Most Important Question For Mid Term Ipu University
No ratings yet
Machine Learning Most Important Question For Mid Term Ipu University
36 pages
Employee Attrition Analysis and Modeling
100% (1)
Employee Attrition Analysis and Modeling
15 pages
Regression Anallysis Hands0n 1
100% (1)
Regression Anallysis Hands0n 1
3 pages
Patient Data Management System
100% (1)
Patient Data Management System
27 pages
Oil Export Indonesia
100% (1)
Oil Export Indonesia
12 pages
Student Booklet For Sep 2015 v6
100% (1)
Student Booklet For Sep 2015 v6
50 pages
Lecture Week 2 KNN and Model Evaluation PDF
100% (1)
Lecture Week 2 KNN and Model Evaluation PDF
53 pages
Telco Customer Churn Prediction Dataset
No ratings yet
Telco Customer Churn Prediction Dataset
16 pages
Regression Diagnostics Overview
100% (1)
Regression Diagnostics Overview
53 pages
Intro to Machine Learning Basics
100% (1)
Intro to Machine Learning Basics
52 pages
AIMLB PGP 2025 Session 8
No ratings yet
AIMLB PGP 2025 Session 8
52 pages
Logistic Regression Cost Function Analysis
No ratings yet
Logistic Regression Cost Function Analysis
26 pages

Logistic Regression

Uploaded by

Logistic Regression

Uploaded by

Logistic regression

Case studies for choice

Linear regression bad choice when

General structure for choice

Home loan default

Logistic regression model

Estimate parameter using

Max yi ln( p ( zi )) (1 yi ) ln(1 p ( zi ))

Churn Model Example

Setting Threshold for

High Threshold -> High Accuracy low

Take upper limit of

Truth Table to measure

Max sensitivity and

Goodness of fit ROC Curve

- The dotted line

You might also like