0% found this document useful (0 votes)

29 views14 pages

Logistic Regression

Logistic regression is used when the dependent variable is binary. It violates assumptions of linear regression by allowing the error term to only take on two values. Weight of evidence and information value are used to measure the strength of predictor variables, with higher values indicating a stronger relationship. Dummy variables are used to code categorical predictors.

Uploaded by

tedom14127

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views14 pages

Logistic Regression

Uploaded by

tedom14127

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Logistic Regression

Why do we ever need Logistic Regression?

Violates the assumption of Linear Regression!

Assumption says that the residulas should be normally distributed.

The error term can only take on two values, hence it's impossible for it to
have a normal distribution.

Violates the assumption of Homoscedasticity!

Homoscedasticity describes a situation in which the error term is the

same across all values of the independent variables.
Logistic Regression
Odds
Weight of Evidence (WoE) and Information Value (IV)

Weight of Evidence
The Weight of Evidence or WoE value is a widely used measure of the “strength” of a
grouping for separating good and bad risk (default). It is computed from the basic
odds ratio:

Information Value (IV)

The Information Value (IV) of a predictor is related to the sum of the
(absolute) values for WoE over all groups.
Weight of Evidence (WoE) and Information Value (IV)

According to Siddiqi (2006), by convention the values of the IV statistic can be interpreted as follows. If
the IV statistic is:
•Less than 0.02, then the predictor is not useful for modeling (separating the Goods from the Bads)
•0.02 to 0.1, then the predictor has only a weak relationship to the Goods/Bads odds ratio
•0.1 to 0.3, then the predictor has a medium strength relationship to the Goods/Bads odds ratio
•0.3 or higher, then the predictor has a strong relationship to the Goods/Bads odds ratio.

Indicates a weak relationship to the binary dependent variable.

What are Dummy Variable, Design Variable, Boolean
Indicators and Proxies?

These are all the synonyms for dummy variable

Categorical Variables – Male / Female, High Low Bank Bal etc

They are coded with 1 and 0

Class Class_Dummy1 Class_Dummy2
1 1 0
1 1 0
1 1 0
2 0 1
2 0 1
2 0 1
3 0 0
3 0 0
3 0 0
Results and Interpretation

Independent p value interpretation – p value less than 0.05 (alpha)

should be retained in the model, else remove them from the model!

Analysis of Maximum Likelihood Estimates

Parameter DF Estimate Standard Error Wald Chi-Square Pr > ChiSq

Intercept 1 "-2.6516" 0.6748 15.4424 <.0001

blackd 1 0.5952 0.3939 2.2827 0.1308

whitvic 1 0.2565 0.4002 0.4107 0.5216
serious 1 0.1871 0.0612 9.3342 0.0022
Baseline, R Square and Max-rescaled R square and C

What is R square?
R square of Logistic Regression?

How much the goodness of fit improves!!

C statistics – based on receiver operating characteristic (ROC)

curve
Ranges from 0.5 to 1; closer to 1 better the model
Gini – 2*C statistics -1
Ranges from 0 to 1; closer to 1 better the model
Check Multicollinearity!!

Check the VIF / Tolerance to detect the

multicollinearity!!
Results and Interpretation – Classification Table

Correct Incorrect Percentages

Prob Non- Non- Sensi- Speci- FALSE FALSE
Level Event Event Event Event Correct tivity ficity POS NEG
0.05 30 47 23 0 77 100 67.1 43.4 0
0.1 30 53 17 0 83 100 75.7 36.2 0
0.15 30 55 15 0 85 100 78.6 33.3 0
0.2 30 60 10 0 90 100 85.7 25 0
0.25 29 61 9 1 90 96.7 87.1 23.7 1.6
0.3 25 62 8 5 87 83.3 88.6 24.2 7.5
0.35 23 62 8 7 85 76.7 88.6 25.8 10.1
0.4 23 63 7 7 86 76.7 90 23.3 10
0.45 23 63 7 7 86 76.7 90 23.3 10
0.5 23 63 7 7 86 76.7 90 23.3 10

Higher sensitivity and specificity indicates better fit.

Results and Interpretation – Predicted Probability

Obs CURED INTERVENTION DURATION _LEVEL_ pred

1 0 0 7 1 0.42812

2 0 0 7 1 0.42812

3 0 0 6 1 0.43004

4 1 0 8 1 0.42621

5 1 1 7 1 0.71991

6 1 0 6 1 0.43004
Logistic Regression – KS Stat
KS lies between 0 – 1
Closer to 1 better the model

Day 13 Logistic Regression
No ratings yet
Day 13 Logistic Regression
28 pages
Econometrics II CH 1
No ratings yet
Econometrics II CH 1
48 pages
Classification With Logistic Regression: DR Sandipan Karmakar Mnit Jaipur
No ratings yet
Classification With Logistic Regression: DR Sandipan Karmakar Mnit Jaipur
54 pages
Logistic Regression in Biostatistics
No ratings yet
Logistic Regression in Biostatistics
19 pages
Understanding Logistic Regression in Biostatistics
No ratings yet
Understanding Logistic Regression in Biostatistics
32 pages
Logistic Regression
No ratings yet
Logistic Regression
20 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
12 pages
Dissertation Help: Logistic Regression
100% (2)
Dissertation Help: Logistic Regression
6 pages
Logistic Regression Guide
No ratings yet
Logistic Regression Guide
19 pages
Logistic Regression vs Discriminant Analysis
No ratings yet
Logistic Regression vs Discriminant Analysis
54 pages
Logistic Regression Monograph - DSBA v2
No ratings yet
Logistic Regression Monograph - DSBA v2
54 pages
M8 Logreg
No ratings yet
M8 Logreg
10 pages
Logistic Regression Lecture Notes
No ratings yet
Logistic Regression Lecture Notes
11 pages
Nisha Arora - Logistics Regression Using SPSS
No ratings yet
Nisha Arora - Logistics Regression Using SPSS
76 pages
SPSS Binary Logistic Regression Demo 1 Terminate
100% (1)
SPSS Binary Logistic Regression Demo 1 Terminate
22 pages
Unit V
No ratings yet
Unit V
27 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Business Analytics and Operations Research
No ratings yet
Business Analytics and Operations Research
34 pages
Binary Logistic Regression Guide
No ratings yet
Binary Logistic Regression Guide
16 pages
Binary Logistic Regression - 6.2
No ratings yet
Binary Logistic Regression - 6.2
34 pages
Logistic+Regression+Monograph+ +DSBA+v2
No ratings yet
Logistic+Regression+Monograph+ +DSBA+v2
54 pages
Logistic Regression
No ratings yet
Logistic Regression
42 pages
Logistic Regression Lecture Notes
No ratings yet
Logistic Regression Lecture Notes
11 pages
2 Modele Lineare
No ratings yet
2 Modele Lineare
43 pages
Group 1 Biostat Assignement@
No ratings yet
Group 1 Biostat Assignement@
20 pages
ML Unit 3
No ratings yet
ML Unit 3
40 pages
Logistic Regression
100% (1)
Logistic Regression
37 pages
Logistic Regression Derivation Explained
No ratings yet
Logistic Regression Derivation Explained
6 pages
Business Analytics: Advance: Logistic Regression
100% (1)
Business Analytics: Advance: Logistic Regression
26 pages
Home Lesson 15: Logistic, Poisson & Nonlinear Regression
No ratings yet
Home Lesson 15: Logistic, Poisson & Nonlinear Regression
32 pages
HLST 2302 Lecture 3
No ratings yet
HLST 2302 Lecture 3
30 pages
Logistic Regression
No ratings yet
Logistic Regression
18 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
208 pages
Regression3 Slides
No ratings yet
Regression3 Slides
47 pages
Multinomial Logistic Regression Guide
No ratings yet
Multinomial Logistic Regression Guide
34 pages
Domande Complete ML UNIPD
No ratings yet
Domande Complete ML UNIPD
12 pages
Statistical Analysis Techniques
No ratings yet
Statistical Analysis Techniques
3 pages
Logistic Regression
No ratings yet
Logistic Regression
36 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Logistic Regression
No ratings yet
Logistic Regression
41 pages
79 LogisticReg - Cleaned
No ratings yet
79 LogisticReg - Cleaned
4 pages
CSS
No ratings yet
CSS
15 pages
Logistic Regression
0% (1)
Logistic Regression
71 pages
Practical Guide To Logistic Regression - Joseph M. Hilbe (2017)
100% (1)
Practical Guide To Logistic Regression - Joseph M. Hilbe (2017)
170 pages
(Book) Bayesian Logistik - Hilbe Practical Guide To Logistic Regression (PDFDrive)
No ratings yet
(Book) Bayesian Logistik - Hilbe Practical Guide To Logistic Regression (PDFDrive)
170 pages
Thesis Using Logistic Regression
100% (2)
Thesis Using Logistic Regression
7 pages
W5S01 - PM-Logistic Regression
No ratings yet
W5S01 - PM-Logistic Regression
17 pages
Results
No ratings yet
Results
11 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
23 pages
Binary Logistic
No ratings yet
Binary Logistic
29 pages
Binary Logistic Regression Overview
No ratings yet
Binary Logistic Regression Overview
48 pages
HLST 2302 Lecture 4
No ratings yet
HLST 2302 Lecture 4
30 pages
Unit - 5
No ratings yet
Unit - 5
111 pages
Lecture 22. GLM
No ratings yet
Lecture 22. GLM
41 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
37 pages
STAT22209 - Chapter 03-Multiple Regression - 2022
No ratings yet
STAT22209 - Chapter 03-Multiple Regression - 2022
41 pages
Logistic and Linear Regression Overview
No ratings yet
Logistic and Linear Regression Overview
32 pages
Logistic Regression
100% (2)
Logistic Regression
47 pages

Logistic Regression

Uploaded by

Logistic Regression

Uploaded by

Logistic Regression

Why do we ever need Logistic Regression?

Violates the assumption of Linear Regression!

Assumption says that the residulas should be normally distributed.

Violates the assumption of Homoscedasticity!

Homoscedasticity describes a situation in which the error term is the

Information Value (IV)

Indicates a weak relationship to the binary dependent variable.

These are all the synonyms for dummy variable

They are coded with 1 and 0

Independent p value interpretation – p value less than 0.05 (alpha)

Analysis of Maximum Likelihood Estimates

Parameter DF Estimate Standard Error Wald Chi-Square Pr > ChiSq

blackd 1 0.5952 0.3939 2.2827 0.1308

How much the goodness of fit improves!!

C statistics – based on receiver operating characteristic (ROC)

Check the VIF / Tolerance to detect the

Correct Incorrect Percentages

Higher sensitivity and specificity indicates better fit.

Obs CURED INTERVENTION DURATION _LEVEL_ pred

You might also like