0% found this document useful (0 votes)

32 views7 pages

Exp 2

The document provides a comprehensive theoretical exploration of Logistic Regression, a key algorithm in supervised machine learning for binary classification tasks. It covers the model's mathematical formulation, the sigmoid activation function, the Log-Loss cost function, and the optimization process for training the model. Additionally, it explains how to interpret model coefficients and includes code snippets for practical implementation.

Uploaded by

Tanvir Ahmed Rahat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views7 pages

Exp 2

Uploaded by

Tanvir Ahmed Rahat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Military Institute of Science and Technology

Department of Electrical, Electronic & Communication Engineering

Course Code : 408

Course: Artificial Intelligence & Machine Learning Laboratory

Exp. No.-2

Name of the Exp.: A Theoretical Deep Dive into Logistic Regression.

Introduction
In the field of machine learning, supervised learning is a fundamental category where an algorithm
learns from a labeled dataset. This means each data point is tagged with a correct output or ”label.”
The goal is to learn a mapping function that can predict the output for new, unseen data. Supervised
learning problems are primarily divided into two types: regression, which predicts continuous numerical
values, and classification, which assigns data to discrete categories. This experiment provides a detailed
theoretical exploration of Logistic Regression, a cornerstone algorithm for solving binary classification
problems, where the goal is to determine which of two groups a data point belongs to.

Objective of the Experiment

1. To understand the fundamental concepts of supervised machine learning for classification tasks.
2. To learn the detailed theory behind the Logistic Regression model, including its mathematical
formulation.
3. To deeply understand the role and properties of the sigmoid activation function.

4. To perform a technical analysis of the Log-Loss (Binary Cross-Entropy) cost function.

5. To learn how to interpret the coefficients of different types of logistic regression models (binary,
continuous, and multivariate).
6. To understand the optimization process for finding the model’s parameters.

Theory
1. The Logistic Regression Model
Despite its name, Logistic Regression is a supervised learning algorithm used for classification. It works
by predicting the probability that an observation falls into a particular class based on its features. While
it can be extended for more than two categories, its primary use is for binary classification.
The model’s operation is a two-step process. First, it calculates a linear predictor, denoted as z,
which is a weighted sum of the input features. This is mathematically identical to the equation used in
linear regression:
z = βˆ0 + βˆ1 x1 + βˆ2 x2 + ... + βˆk xk
Second, this linear predictor z is transformed by the sigmoid function to produce an output between
0 and 1, which can be interpreted as a probability. The final equation gives the probability that the
outcome y belongs to class 1, given the input features X:
1
P (y = 1|X) = sigmoid(z) =
1 + e−z
This resulting probability is then compared to a classification threshold (typically 0.5) to assign the
observation to a class. If the probability is higher than the threshold, the model predicts class 1;
otherwise, it predicts class 0.

2. Activation Function: The Sigmoid

The sigmoid function is the characteristic activation function of logistic regression. It is responsible for
converting the unbounded output of the linear predictor, z, into a bounded probability. The formula for
the sigmoid function is:
1
f (x) =
1 + e−x
Key properties of the sigmoid function include:
• Output Range: It squashes any real-valued input into a range between 0 and 1, which is essential
for interpreting the output as a probability.
• Asymptotic Behavior: As the input x approaches positive infinity, the output f (x) approaches
1. Conversely, as x approaches negative infinity, f (x) approaches 0.
• Monotonicity: The function is monotonically increasing, meaning a higher input value z will
always result in a higher or equal probability.
x −x
While other activation functions like ReLU (f (x) = max(0, x)) and Tanh (f (x) = eex −e
+e−x ) are common
in more complex models like neural networks, the sigmoid function is fundamental to standard logistic
regression.

3. The Cost Function: Log-Loss (Binary Cross-Entropy)

To train the model, we need a way to measure how well it is performing. This is done using a cost
function (or loss function). For logistic regression, the appropriate cost function is called Log-Loss or
Binary Cross-Entropy. The goal is to find the model parameters (βi ) that minimize this function.
The combined Log-Loss function for a single prediction is:

Loss = −[y log(p̂) + (1 − y) log(1 − p̂)]

Where y is the true label (0 or 1) and p̂ is the predicted probability. Let’s analyze its two parts:

• Case 1: True Class is 1 (y = 1): The loss function simplifies to Loss = − log(p̂).
– If the model correctly predicts a probability p̂ close to 1, the loss, − log(p̂), approaches 0.
– If the model incorrectly predicts a probability p̂ close to 0, the loss, − log(p̂), approaches
infinity. This heavily penalizes confident wrong predictions.

• Case 2: True Class is 0 (y = 0): The loss function simplifies to Loss = − log(1 − p̂).
– If the model correctly predicts a probability p̂ close to 0, the term (1 − p̂) is close to 1, and
the loss, − log(1 − p̂), approaches 0.
– If the model incorrectly predicts a probability p̂ close to 1, the term (1 − p̂) is close to 0, and
the loss approaches infinity.

2
The total cost over all n samples in the dataset is the sum of the individual losses:
n
X
Log − Loss = −(yi log(pi ) + (1 − yi ) log(1 − pi ))
i=1

Minimizing this Log-Loss function is equivalent to maximizing the Log-Likelihood of the parameters, a
concept from statistical estimation.

4. Model Training and Optimization

The process of finding the optimal coefficients (βˆ0 , βˆ1 , ..., βˆk ) that minimize the Log-Loss function is called
training or optimization. The two primary methods are Gradient Descent and Maximum Likelihood
Estimation (MLE).
• Gradient Descent: This is a common iterative optimization algorithm used in machine learning.
The process involves:
1. Selecting initial values for the parameters.
2. Calculating the gradient of the Log-Loss cost function with respect to each parameter. The
gradient is a vector that points in the direction of the steepest increase of the function.
3. Updating the parameters by taking a small step in the direction opposite to the gradient.
This step size is controlled by a value called the learning rate.
4. Repeating this process until the parameters converge to values that minimize the cost function,
resulting in a sigmoid curve that best fits the data.
• Maximum Likelihood Estimation (MLE): This is a statistical method that aims to find the
parameter values that maximize the likelihood of observing the actual data. For logistic regres-
sion, minimizing the Log-Loss is equivalent to maximizing the Log-Likelihood, making these two
approaches two sides of the same coin.

5. Interpreting Model Coefficients

The coefficients (β) in a logistic regression model have a specific and powerful interpretation related to
the ”log-odds.” The odds are the ratio of the probability of an event occurring to the probability of it
not occurring (p/(1 − p)).

Model with One Continuous Feature

Consider a model predicting a sunny day based on temperature: P (Day = Sunny|T emperature) =
1
−(β̂0 +β̂1 ∗T emperature)
.
1+e

• Intercept (β̂0 ): This is the log-odds of a sunny day when the temperature is 0 degrees. The
eβ̂0
probability at 0 degrees can be calculated as p = .
1+eβ̂0

• Weight (β̂1 ): This is the change in the log-odds for a one-unit increase in the feature (e.g., a
1-degree increase in temperature). The exponentiated coefficient, eβ̂1 , is the odds ratio. It tells
us how the odds of the outcome are multiplied for every one-unit increase in the feature. For
example, if β̂1 = 0.7, then e0.7 ≈ 2.01. This means that for each additional degree of temperature,
the odds of it being a sunny day are multiplied by 2.01 (i.e., they approximately double).

Model with One Binary Feature

Consider a model predicting a sunny day based on whether it is foggy (where Foggy=1 if true, 0 if false):
1
P (Day = Sunny|F oggy) = −(β̂0 +β̂1 ∗F oggy)
.
1+e

• Intercept (β̂0 ): This is the log-odds of a sunny day when the feature is 0 (i.e., when it is not
foggy).
• Weight (β̂1 ): This is the change in the log-odds when the day is foggy relative to when it is not
foggy. The odds ratio, eβ̂1 , indicates how the odds of a sunny day change if it is foggy. For example,
if β̂1 = −0.7, then e−0.7 ≈ 0.50. This means the odds of it being a sunny day are halved if it is
foggy compared to if it is not.

3
Multivariate Logistic Regression Model
1
Typically, a model will include multiple features. Example: P (Day = Sunny|T emp, F oggy) = ˆ ˆ ˆ .
1+e−(β0 +β1 ∗T emp+β2 ∗F oggy)

• Intercept (β̂0 ): This represents the log-odds of the outcome when all predictor variables are zero
(e.g., a non-foggy day with a temperature of 0 degrees).

• Weights (β̂1 , β̂2 , ...): Each coefficient, say β̂j , is the change in the log-odds for a one-unit change
in its corresponding feature xj , holding all other features constant. For example, β̂1 is the change
in the log-odds of a sunny day for a one-unit change in temperature, assuming the foggy/not-foggy
status does not change.

Codes
# import dependencies
# d a t a c l e a n i n g and m a n i p u l a t i o n
import pandas a s pd
import numpy a s np

# data v i s u a l i z a t i o n
import m a t p l o t l i b . p y p l o t a s p l t
import s e a b o r n a s s n s

# machine l e a r n i n g
from s k l e a r n . p r e p r o c e s s i n g import S t a n d a r d S c a l e r

import s k l e a r n . l i n e a r m o d e l a s s k l l m
from s k l e a r n import p r e p r o c e s s i n g
from s k l e a r n import n e i g h b o r s
from s k l e a r n . m e t r i c s import c o n f u s i o n m a t r i x , c l a s s i f i c a t i o n r e p o r t , p r e c i s i o n s c o r e
from s k l e a r n . m o d e l s e l e c t i o n import t r a i n t e s t s p l i t

import s t a t s m o d e l s . a p i a s sm
import s t a t s m o d e l s . f o r m u l a . a p i a s smf

# i n i t i a l i z e some p a c k a g e s e t t i n g s
s n s . set ( s t y l e=” w h i t e g r i d ” , c o l o r c o d e s=True , f o n t s c a l e =1.3)

%m a t p l o t l i b i n l i n e

# r ead i n t h e d a t a and c h e c k t h e f i r s t 5 rows

d f = pd . r e a d c s v ( ’ . . / i n p u t / data . c s v ’ , i n d e x c o l =0)
d f . head ( )

# g e n e r a l summary o f t h e d a t a f r a m e
df . i n f o ()

# remove t h e ’ Unnamed : 32 ’ column

d f = d f . drop ( ’ Unnamed : 32 ’ , a x i s =1)

# c h e c k t h e d a t a t y p e o f each column
df . dtypes

# v i s u a l i z e d i s t r i b u t i o n of classes
p l t . f i g u r e ( f i g s i z e =(8 , 4 ) )
s n s . c o u n t p l o t ( d f [ ’ d i a g n o s i s ’ ] , p a l e t t e= ’RdBu ’ )

4
# co u nt number o f o b v s i n each c l a s s
benign , m a l i g n a n t = d f [ ’ d i a g n o s i s ’ ] . v a l u e c o u n t s ( )
print ( ’ Number o f c e l l s l a b e l e d Benign : ’ , b e n i g n )
print ( ’ Number o f c e l l s l a b e l e d Malignant : ’ , m a l i g n a n t )
print ( ’ ’ )
print ( ’% o f c e l l s l a b e l e d Benign ’ , round ( b e n i g n / len ( d f ) ∗ 1 0 0 , 2 ) , ’%’ )
print ( ’% o f c e l l s l a b e l e d Malignant ’ , round ( m a l i g n a n t / len ( d f ) ∗ 1 0 0 , 2 ) , ’%’ )

# g e n e r a t e a s c a t t e r p l o t m a t r i x w i t h t h e ”mean” columns
cols = [ ’ diagnosis ’ ,
’ radius mean ’ ,
’ texture mean ’ ,
’ perimeter mean ’ ,
’ area mean ’ ,
’ smoothness mean ’ ,
’ compactness mean ’ ,
’ concavity mean ’ ,
’ concave p o i n t s m e a n ’ ,
’ symmetry mean ’ ,
’ fractal dimension mean ’ ]

s n s . p a i r p l o t ( data=d f [ c o l s ] , hue= ’ d i a g n o s i s ’ , p a l e t t e= ’RdBu ’ )

# generate a c o r r e l a t i o n matrix
c o r r = d f . c o r r ( ) . round ( 2 )

# mask f o r t h e upper t r i a n g l e
mask = np . z e r o s l i k e ( c o r r , dtype=np . bool )
mask [ np . t r i u i n d i c e s f r o m ( mask ) ] = True

# s e t up t h e f i g u r e
f , ax = p l t . s u b p l o t s ( f i g s i z e =(20 , 2 0 ) )

# g e n e r a t e a custom d i v e r g i n g colormap
cmap = s n s . d i v e r g i n g p a l e t t e ( 2 2 0 , 1 0 , as cmap=True )

# draw t h e heatmap
s n s . heatmap ( c o r r , mask=mask , cmap=cmap , vmin=−1, vmax=1, c e n t e r =0,
s q u a r e=True , l i n e w i d t h s =.5 , c b a r k w s={” s h r i n k ” : . 5 } , annot=True )

plt . tight layout ()

# c r e a t e a new d a t a f r a m e w i t h t h e ”mean” columns

df mean = d f [ [ ’ d i a g n o s i s ’ , ’ r a d i u s m e a n ’ , ’ t e x t u r e m e a n ’ , ’ p e r i m e t e r m e a n ’ , ’ area mean ’ ,

# c r e a t e a new d a t a f r a m e w i t h t h e ” s e ” columns
d f s e = df [ [ ’ diagnosis ’ , ’ r a d i u s s e ’ , ’ t e x t u r e s e ’ , ’ perimeter se ’ , ’ area se ’ , ’ smoothn

# c r e a t e a new d a t a f r a m e w i t h t h e ” w o r s t ” columns
df worst = df [ [ ’ diagnosis ’ , ’ radius worst ’ , ’ texture worst ’ , ’ perimeter worst ’ , ’ area wo

# c r e a t e a c o r r e l a t i o n m a t r i x f o r t h e ”mean” columns
corr mean = df mean . c o r r ( ) . round ( 2 )

# mask f o r t h e upper t r i a n g l e
mask mean = np . z e r o s l i k e ( corr mean , dtype=np . bool )
mask mean [ np . t r i u i n d i c e s f r o m ( mask mean ) ] = True

5
# s e t up t h e f i g u r e
f , ax = p l t . s u b p l o t s ( f i g s i z e =(10 , 1 0 ) )

# g e n e r a t e a custom d i v e r g i n g colormap
cmap = s n s . d i v e r g i n g p a l e t t e ( 2 2 0 , 1 0 , as cmap=True )

# draw t h e heatmap
s n s . heatmap ( corr mean , mask=mask mean , cmap=cmap , vmin=−1, vmax=1, c e n t e r =0,
s q u a r e=True , l i n e w i d t h s =.5 , c b a r k w s={” s h r i n k ” : . 5 } , annot=True )

plt . tight layout ()

# c r e a t e a c o r r e l a t i o n m a t r i x f o r t h e ” s e ” columns
c o r r s e = d f s e . c o r r ( ) . round ( 2 )

# mask f o r t h e upper t r i a n g l e
mask se = np . z e r o s l i k e ( c o r r s e , dtype=np . bool )
mask se [ np . t r i u i n d i c e s f r o m ( mask se ) ] = True

# s e t up t h e f i g u r e
f , ax = p l t . s u b p l o t s ( f i g s i z e =(10 , 1 0 ) )

# g e n e r a t e a custom d i v e r g i n g colormap
cmap = s n s . d i v e r g i n g p a l e t t e ( 2 2 0 , 1 0 , as cmap=True )

# draw t h e heatmap
s n s . heatmap ( c o r r s e , mask=mask se , cmap=cmap , vmin=−1, vmax=1, c e n t e r =0,
s q u a r e=True , l i n e w i d t h s =.5 , c b a r k w s={” s h r i n k ” : . 5 } , annot=True )

plt . tight layout ()

# c r e a t e a c o r r e l a t i o n m a t r i x f o r t h e ” w o r s t ” columns
c o r r w o r s t = d f w o r s t . c o r r ( ) . round ( 2 )

# mask f o r t h e upper t r i a n g l e
mask worst = np . z e r o s l i k e ( c o r r w o r s t , dtype=np . bool )
mask worst [ np . t r i u i n d i c e s f r o m ( mask worst ) ] = True

# s e t up t h e f i g u r e
f , ax = p l t . s u b p l o t s ( f i g s i z e =(10 , 1 0 ) )

# g e n e r a t e a custom d i v e r g i n g colormap
cmap = s n s . d i v e r g i n g p a l e t t e ( 2 2 0 , 1 0 , as cmap=True )

# draw t h e heatmap
s n s . heatmap ( c o r r w o r s t , mask=mask worst , cmap=cmap , vmin=−1, vmax=1, c e n t e r =0,
s q u a r e=True , l i n e w i d t h s =.5 , c b a r k w s={” s h r i n k ” : . 5 } , annot=True )

plt . tight layout ()

# c r e a t e X and y
X = d f . drop ( ’ d i a g n o s i s ’ , a x i s =1)
y = df [ ’ diagnosis ’ ]

# c r e a t e a LabelEncoder o b j e c t
l e = p r e p r o c e s s i n g . LabelEncoder ( )

# f i t and t r a n s f o r m t h e y v a r i a b l e
y = le . fit transform (y)

6
# c r e a t e t r a i n i n g and t e s t i n g s e t s
X t r a i n , X t e s t , y t r a i n , y t e s t = t r a i n t e s t s p l i t (X, y , t e s t s i z e =0.3 , r a n d o m s t a t e =42

# create a StandardScaler object

s c a l e r = StandardScaler ()

# f i t and t r a n s f o r m t h e t r a i n i n g d a t a
X train scaled = s c a l e r . fit transform ( X train )

# transform the t e s t i n g data

X t e s t s c a l e d = s c a l e r . transform ( X test )

# c r e a t e a l o g i s t i c r e g r e s s i o n model
log reg = skl lm . LogisticRegression ()

# f i t t h e model
log reg . f i t ( X train scaled , y train )

# make p r e d i c t i o n s
y pred = l o g r e g . p r e d i c t ( X t e s t s c a l e d )

# p r i n t the accuracy score

print ( ” Accuracy : ” , l o g r e g . s c o r e ( X t e s t s c a l e d , y t e s t ) )

# p r i n t the confusion matrix

confusion matrix ( y test , y pred )

# print the c l a s s i f i c a t i o n report

print ( c l a s s i f i c a t i o n r e p o r t ( y t e s t , y p r e d ) )

# p l o t t h e ROC c u r v e
from s k l e a r n . m e t r i c s import r o c c u r v e

# get the predicted p r o b a b i l i t i e s

y pred prob = l o g r e g . predict proba ( X test scaled ) [ : , 1 ]

# g e t t h e f p r , t p r , and t h r e s h o l d s
f p r , tpr , t h r e s h o l d s = r o c c u r v e ( y t e s t , y p r e d p r o b )

# p l o t t h e ROC c u r v e
p l t . plot ( fpr , tpr )
p l t . xlim ( [ 0 . 0 , 1 . 0 ] )
p l t . ylim ( [ 0 . 0 , 1 . 0 ] )
p l t . t i t l e ( ’ROC c u r v e f o r b r e a s t c a n c e r c l a s s i f i e r ’ )
p l t . x l a b e l ( ’ F a l s e P o s i t i v e Rate ( 1 − S p e c i f i c i t y ) ’ )
p l t . y l a b e l ( ’ True P o s i t i v e Rate ( S e n s i t i v i t y ) ’ )
p l t . g r i d ( True )

# c r e a t e a l o g i s t i c r e g r e s s i o n model
l o g r e g 2 = sm . L o g i t ( y t r a i n , X t r a i n s c a l e d )

# f i t t h e model
log reg2 res = log reg2 . f i t ()

# p r i n t t h e summary
print ( l o g r e g 2 r e s . summary ( ) )

11logistic Regression in Machine Learning - GeeksforGeeks
No ratings yet
11logistic Regression in Machine Learning - GeeksforGeeks
4 pages
09 23ECE216 LogisticRegression
No ratings yet
09 23ECE216 LogisticRegression
40 pages
Logistic Regression
No ratings yet
Logistic Regression
10 pages
Mathematics Behind Logistic Regression Model 1598272636
No ratings yet
Mathematics Behind Logistic Regression Model 1598272636
6 pages
Logistic Regression For Machine Learning Complete TutorialUnderstand This Popular Supervised Classifi
No ratings yet
Logistic Regression For Machine Learning Complete TutorialUnderstand This Popular Supervised Classifi
10 pages
Unit 3-ML
No ratings yet
Unit 3-ML
99 pages
Lecture 07
No ratings yet
Lecture 07
26 pages
Understanding Logistic Regression Techniques
No ratings yet
Understanding Logistic Regression Techniques
19 pages
3-LG Eval
No ratings yet
3-LG Eval
52 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
25 pages
Logistic Regression
No ratings yet
Logistic Regression
8 pages
ML Assignment
No ratings yet
ML Assignment
20 pages
Regression vs Classification Algorithms
100% (1)
Regression vs Classification Algorithms
13 pages
Logistic Regression for Binary Classification
No ratings yet
Logistic Regression for Binary Classification
84 pages
Lecture Notes 6 Logistic Regression
No ratings yet
Lecture Notes 6 Logistic Regression
8 pages
Logistic Regression
No ratings yet
Logistic Regression
12 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
4 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
10 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
56 pages
Week 7
No ratings yet
Week 7
21 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
41 pages
Logistic Regression
No ratings yet
Logistic Regression
6 pages
Task 1
No ratings yet
Task 1
7 pages
Machine Learning Using Optimization and Logistic Regression and Sigmoid Function - GRP 06
No ratings yet
Machine Learning Using Optimization and Logistic Regression and Sigmoid Function - GRP 06
31 pages
Logistic Regression
No ratings yet
Logistic Regression
36 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
10 pages
Logistic Regression Guide
No ratings yet
Logistic Regression Guide
23 pages
Exp3 ML
No ratings yet
Exp3 ML
4 pages
Exp 2 121a1047 ML Lavanya Kurup Div C C3
No ratings yet
Exp 2 121a1047 ML Lavanya Kurup Div C C3
8 pages
Logistic Regression
No ratings yet
Logistic Regression
13 pages
Report Logistic Regression
No ratings yet
Report Logistic Regression
21 pages
2+logistic Regression
No ratings yet
2+logistic Regression
10 pages
W8 - Logistic Regression
No ratings yet
W8 - Logistic Regression
18 pages
DS203 2024 01 02 LogisticRegression
No ratings yet
DS203 2024 01 02 LogisticRegression
38 pages
Logistic Regressions
No ratings yet
Logistic Regressions
11 pages
FALLSEM2024-25 BCSE209L TH VL2024250101695 2024-08-12 Reference-Material-II
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101695 2024-08-12 Reference-Material-II
19 pages
Logistic Regression
No ratings yet
Logistic Regression
34 pages
ML-Unit 4
No ratings yet
ML-Unit 4
29 pages
Eml 24.7.25
No ratings yet
Eml 24.7.25
23 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
53 pages
Logistic Regression
No ratings yet
Logistic Regression
6 pages
Logistic Regession Numericals
No ratings yet
Logistic Regession Numericals
3 pages
L14 Logistic Regression
No ratings yet
L14 Logistic Regression
22 pages
ML Unit 3
No ratings yet
ML Unit 3
40 pages
ML2 Logistic Regression
No ratings yet
ML2 Logistic Regression
23 pages
Day.12 Logistic Regression
No ratings yet
Day.12 Logistic Regression
8 pages
Lec 02 LogisticReg
No ratings yet
Lec 02 LogisticReg
33 pages
Logistic Regression
No ratings yet
Logistic Regression
21 pages
COMP-377Week6 v1.1
No ratings yet
COMP-377Week6 v1.1
38 pages
Logistic Regression Basics
No ratings yet
Logistic Regression Basics
18 pages
CSCI-43646364 S25 - Lecture 4
No ratings yet
CSCI-43646364 S25 - Lecture 4
92 pages
Business Analytics & Machine Learning: Logistic and Poisson Regressions
No ratings yet
Business Analytics & Machine Learning: Logistic and Poisson Regressions
62 pages
Wa0004.
No ratings yet
Wa0004.
9 pages
P-2 M.L. M-I U-I Logistic Regression
No ratings yet
P-2 M.L. M-I U-I Logistic Regression
50 pages
Logistic Regression
No ratings yet
Logistic Regression
16 pages
Deep Learning Week 204-4
No ratings yet
Deep Learning Week 204-4
1 page
Application Form For Appearing Improvement
No ratings yet
Application Form For Appearing Improvement
1 page
Note 3
No ratings yet
Note 3
23 pages
313 (2021)
No ratings yet
313 (2021)
5 pages
Lets Publish
No ratings yet
Lets Publish
5 pages
Research Paper: Anomaly Detection in Smart Grid Data: 1. Abstract
No ratings yet
Research Paper: Anomaly Detection in Smart Grid Data: 1. Abstract
18 pages
Note 9
No ratings yet
Note 9
12 pages
Report 1
No ratings yet
Report 1
3 pages
Admit Card
No ratings yet
Admit Card
1 page
Payment Mess Bill Male Mar-2025 (26-02-25 To 25-03-25) 16-04-25
No ratings yet
Payment Mess Bill Male Mar-2025 (26-02-25 To 25-03-25) 16-04-25
19 pages
Cover
No ratings yet
Cover
1 page
CamScanner 04-21-2025 23.23
No ratings yet
CamScanner 04-21-2025 23.23
4 pages
BD NB ProductGuide Q3 2024
No ratings yet
BD NB ProductGuide Q3 2024
54 pages
Orthogonal Frequency-Division Multiplexing (OFDM)
No ratings yet
Orthogonal Frequency-Division Multiplexing (OFDM)
30 pages
Exp4 Eece318
No ratings yet
Exp4 Eece318
17 pages
Hall Bill January 2025
No ratings yet
Hall Bill January 2025
3 pages
Osmany Hall Chess Tournament 2025
No ratings yet
Osmany Hall Chess Tournament 2025
3 pages
OGF-25 Badminton Schedule
No ratings yet
OGF-25 Badminton Schedule
2 pages
Csn-513-Mte-2022 1
No ratings yet
Csn-513-Mte-2022 1
2 pages
HW Sol
No ratings yet
HW Sol
3 pages
Maths PPT
No ratings yet
Maths PPT
28 pages
Image Compression Techniques Overview
100% (3)
Image Compression Techniques Overview
38 pages
Process Control Strategies Guide
No ratings yet
Process Control Strategies Guide
2 pages
A Lightweight CNN For Efficient Deepfake Detection of Low-Resolution Images in Frequency Domain
No ratings yet
A Lightweight CNN For Efficient Deepfake Detection of Low-Resolution Images in Frequency Domain
6 pages
Job Sequencing with Deadlines Explained
No ratings yet
Job Sequencing with Deadlines Explained
18 pages
11 Sorting
No ratings yet
11 Sorting
21 pages
01 - Signal Flow Graph
No ratings yet
01 - Signal Flow Graph
31 pages
Advanced AI Course Syllabus 2023
No ratings yet
Advanced AI Course Syllabus 2023
5 pages
Lectures 1 and 2 CEN545
100% (1)
Lectures 1 and 2 CEN545
10 pages
Daa Lab 2024
No ratings yet
Daa Lab 2024
9 pages
Computer Science Extended Essay
No ratings yet
Computer Science Extended Essay
15 pages
Compiler Design June July 2022
No ratings yet
Compiler Design June July 2022
2 pages
AVL Example
No ratings yet
AVL Example
21 pages
AlphaGo's AI Mastery in Go
100% (1)
AlphaGo's AI Mastery in Go
41 pages
Matrix Operations in Excel Guide
No ratings yet
Matrix Operations in Excel Guide
1 page
Phasor Estimation
No ratings yet
Phasor Estimation
77 pages
(IT3102) - Computational Complexity and Algorithm
No ratings yet
(IT3102) - Computational Complexity and Algorithm
3 pages
Data Structures Notes Muhammad Usman Complete
No ratings yet
Data Structures Notes Muhammad Usman Complete
17 pages
Computer Graphics Practical File
No ratings yet
Computer Graphics Practical File
34 pages
AIML
No ratings yet
AIML
13 pages
Optimizer Methods HYSYS PDF
No ratings yet
Optimizer Methods HYSYS PDF
9 pages
40 Algorithms Every Data Scientist Should Know - Navigating Through Essential AI and ML Algorithms by W
No ratings yet
40 Algorithms Every Data Scientist Should Know - Navigating Through Essential AI and ML Algorithms by W
848 pages
Source Code:: Implementation of Searching Algorithms Over An Array Based List Linear Search Binary Search
No ratings yet
Source Code:: Implementation of Searching Algorithms Over An Array Based List Linear Search Binary Search
14 pages
Numerical Interpolation Techniques
No ratings yet
Numerical Interpolation Techniques
13 pages
Dip Lab 8 Muhammad Ali 18-Se-16: Example 1
No ratings yet
Dip Lab 8 Muhammad Ali 18-Se-16: Example 1
6 pages
Uninformed Search: BFS and DFS Explained
No ratings yet
Uninformed Search: BFS and DFS Explained
41 pages
Introduction: Boundary Fill Algorithm Starts at A Pixel Inside The Polygon To Be
No ratings yet
Introduction: Boundary Fill Algorithm Starts at A Pixel Inside The Polygon To Be
6 pages
EE436 HW1 Spring2014
No ratings yet
EE436 HW1 Spring2014
2 pages

Exp 2

Uploaded by

Exp 2

Uploaded by

Military Institute of Science and Technology

Department of Electrical, Electronic & Communication Engineering

Course Code : 408

Name of the Exp.: A Theoretical Deep Dive into Logistic Regression.

Objective of the Experiment

4. To perform a technical analysis of the Log-Loss (Binary Cross-Entropy) cost function.

2. Activation Function: The Sigmoid

3. The Cost Function: Log-Loss (Binary Cross-Entropy)

Loss = −[y log(p̂) + (1 − y) log(1 − p̂)]

4. Model Training and Optimization

5. Interpreting Model Coefficients

Model with One Continuous Feature

Model with One Binary Feature

# r ead i n t h e d a t a and c h e c k t h e f i r s t 5 rows

# remove t h e ’ Unnamed : 32 ’ column

s n s . p a i r p l o t ( data=d f [ c o l s ] , hue= ’ d i a g n o s i s ’ , p a l e t t e= ’RdBu ’ )

plt . tight layout ()

# c r e a t e a new d a t a f r a m e w i t h t h e ”mean” columns

plt . tight layout ()

plt . tight layout ()

plt . tight layout ()

# create a StandardScaler object

# transform the t e s t i n g data

# p r i n t the accuracy score

# p r i n t the confusion matrix

# print the c l a s s i f i c a t i o n report

# get the predicted p r o b a b i l i t i e s

You might also like