0% found this document useful (0 votes)

46 views8 pages

Deep Learning Assignment 1 - Logistic Regression Solutions

The document provides solutions to an assignment on logistic regression, covering topics such as the sigmoid function, forward propagation, cost function calculation, gradient computation, and parameter updates. It includes detailed calculations for various problems, including individual losses and average costs, as well as updates for model parameters based on gradients. The assignment emphasizes the application of logistic regression concepts through practical examples and calculations.

Uploaded by

salamat ali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views8 pages

Deep Learning Assignment 1 - Logistic Regression Solutions

Uploaded by

salamat ali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Deep Learning Assignment 1: Logistic Regression - Solutions

Problem 1: Sigmoid Function and Basic Computations (15 Points)

Part A (5 points) - Sigmoid Values

Formula: σ(z) = 1/(1 + e^(-z))

(i) σ(2.5)

σ(2.5) = 1/(1 + e^(-2.5)) = 1/(1 + 0.082) = 1/1.082 = 0.924

(ii) σ(-1.8)

σ(-1.8) = 1/(1 + e^(1.8)) = 1/(1 + 6.050) = 1/7.050 = 0.142

(iii) σ(0)

σ(0) = 1/(1 + e^0) = 1/(1 + 1) = 1/2 = 0.5

Part B (5 points) - Derivative Values

Formula: σ'(z) = σ(z)(1 - σ(z))

(i) σ'(1.2)

First: σ(1.2) = 1/(1 + e^(-1.2)) = 1/(1 + 0.301) = 0.769

Then: σ'(1.2) = 0.769 × (1 - 0.769) = 0.769 × 0.231 = 0.178

(ii) σ'(-0.5)

First: σ(-0.5) = 1/(1 + e^(0.5)) = 1/(1 + 1.649) = 0.378

Then: σ'(-0.5) = 0.378 × (1 - 0.378) = 0.378 × 0.622 = 0.235

(iii) Maximum of σ'(z)

σ'(z) = σ(z)(1 - σ(z)) is maximized when σ(z) = 0.5

This occurs when z = 0
Maximum value: σ'(0) = 0.5 × 0.5 = 0.25
Part C (5 points) - Solving for z
(i) σ(z) = 0.75

0.75 = 1/(1 + e^(-z))

0.75(1 + e^(-z)) = 1
0.75 + 0.75e^(-z) = 1
0.75e^(-z) = 0.25
e^(-z) = 1/3
-z = ln(1/3) = -ln(3) = -1.099
z = 1.099

(ii) σ(z) = 0.1

0.1 = 1/(1 + e^(-z))

0.1(1 + e^(-z)) = 1
0.1 + 0.1e^(-z) = 1
0.1e^(-z) = 0.9
e^(-z) = 9
-z = ln(9) = 2ln(3) = 2.197
z = -2.197

Problem 2: Logistic Regression Forward Propagation (20 Points)

Given:
w = [1.5, 2.3, -0.8]ᵀ, b = -0.5

Part A (10 points) - Single Email

Email features: x = [0.6, 1, 0.4]ᵀ

(i) Linear combination z

z = wᵀx + b = 1.5×0.6 + 2.3×1 + (-0.8)×0.4 + (-0.5)

z = 0.9 + 2.3 - 0.32 - 0.5 = 2.38

(ii) Predicted probability ŷ

ŷ = σ(2.38) = 1/(1 + e^(-2.38)) = 1/(1 + 0.093) = 0.915

(iii) Classification with threshold 0.5

Since ŷ = 0.915 > 0.5, classify as SPAM

Part B (10 points) - Three Emails

Email 1: x⁽¹⁾ = [0.2, 0, 0.8]ᵀ

z⁽¹⁾ = 1.5×0.2 + 2.3×0 + (-0.8)×0.8 + (-0.5) = 0.3 + 0 - 0.64 - 0.5 = -0.84

ŷ⁽¹⁾ = σ(-0.84) = 1/(1 + e^(0.84)) = 1/(1 + 2.316) = 0.302

Email 2: x⁽²⁾ = [1.1, 1, 0.3]ᵀ

z⁽²⁾ = 1.5×1.1 + 2.3×1 + (-0.8)×0.3 + (-0.5) = 1.65 + 2.3 - 0.24 - 0.5 = 3.21
ŷ⁽²⁾ = σ(3.21) = 1/(1 + e^(-3.21)) = 1/(1 + 0.040) = 0.961

Email 3: x⁽³⁾ = [0.0, 0, 1.2]ᵀ

z⁽³⁾ = 1.5×0 + 2.3×0 + (-0.8)×1.2 + (-0.5) = 0 + 0 - 0.96 - 0.5 = -1.46

ŷ⁽³⁾ = σ(-1.46) = 1/(1 + e^(1.46)) = 1/(1 + 4.306) = 0.188

Most likely spam: Email 2 with ŷ⁽²⁾ = 0.961

Problem 3: Cost Function Calculation (15 Points)

Given Loss Function: L(ŷ, y) = -y log(ŷ) - (1-y) log(1-ŷ)

Part A (10 points) - Individual Losses

Example 1: ŷ⁽¹⁾ = 0.9, y⁽¹⁾ = 1

L⁽¹⁾ = -1×log(0.9) - 0×log(0.1) = -log(0.9) = 0.105

Example 2: ŷ⁽²⁾ = 0.2, y⁽²⁾ = 0

L⁽²⁾ = -0×log(0.2) - 1×log(0.8) = -log(0.8) = 0.223

Example 3: ŷ⁽³⁾ = 0.7, y⁽³⁾ = 1

L⁽³⁾ = -1×log(0.7) - 0×log(0.3) = -log(0.7) = 0.357

Example 4: ŷ⁽⁴⁾ = 0.4, y⁽⁴⁾ = 0

L⁽⁴⁾ = -0×log(0.4) - 1×log(0.6) = -log(0.6) = 0.511

Part B (5 points) - Average Cost

J = (1/4) × (L⁽¹⁾ + L⁽²⁾ + L⁽³⁾ + L⁽⁴⁾)

J = (1/4) × (0.105 + 0.223 + 0.357 + 0.511) = 0.299

Problem 4: Gradient Computation and Parameter Updates (25 Points)

Given:
x = [2.1, -1.3]ᵀ, y = 1
w = [0.4, -0.7]ᵀ, b = 0.2

α = 0.3

Part A (10 points) - Forward Propagation

(i) Calculate z

z = wᵀx + b = 0.4×2.1 + (-0.7)×(-1.3) + 0.2 = 0.84 + 0.91 + 0.2 = 1.95

(ii) Compute ŷ

ŷ = σ(1.95) = 1/(1 + e^(-1.95)) = 1/(1 + 0.142) = 0.876

(iii) Calculate loss

L = -y×log(ŷ) - (1-y)×log(1-ŷ) = -1×log(0.876) - 0×log(0.124) = 0.132

Part B (10 points) - Gradients

(i) ∂L/∂z

∂L/∂z = ŷ - y = 0.876 - 1 = -0.124

(ii) ∂L/∂w₁
∂L/∂w₁ = (ŷ - y) × x₁ = (-0.124) × 2.1 = -0.260

(iii) ∂L/∂w₂

∂L/∂w₂ = (ŷ - y) × x₂ = (-0.124) × (-1.3) = 0.161

(iv) ∂L/∂b

∂L/∂b = ŷ - y = -0.124

Part C (5 points) - Parameter Updates

(i) New w₁

w₁_new = w₁ - α × (∂L/∂w₁) = 0.4 - 0.3 × (-0.260) = 0.4 + 0.078 = 0.478

(ii) New w₂

w₂_new = w₂ - α × (∂L/∂w₂) = -0.7 - 0.3 × 0.161 = -0.7 - 0.048 = -0.748

(iii) New b

b_new = b - α × (∂L/∂b) = 0.2 - 0.3 × (-0.124) = 0.2 + 0.037 = 0.237

Problem 5: Multiple Training Examples (20 Points)

Given Data:
Example x⁽ⁱ⁾ y⁽ⁱ⁾ ŷ⁽ⁱ⁾

1 [1.0, 0.5]ᵀ 1 0.8

2 [-0.5, 1.2]ᵀ 0 0.3

3 [0.8, -0.3]ᵀ 1 0.6

Part A (10 points) - Cost Function

(i) Individual Losses
L⁽¹⁾ = -1×log(0.8) - 0×log(0.2) = -log(0.8) = 0.223
L⁽²⁾ = -0×log(0.3) - 1×log(0.7) = -log(0.7) = 0.357
L⁽³⁾ = -1×log(0.6) - 0×log(0.4) = -log(0.6) = 0.511

(ii) Average Cost

J = (1/3) × (0.223 + 0.357 + 0.511) = 0.364

Part B (10 points) - Average Gradients

(i) ∂J/∂w₁

∂J/∂w₁ = (1/3) × [(0.8-1)×1.0 + (0.3-0)×(-0.5) + (0.6-1)×0.8]

= (1/3) × [(-0.2)×1.0 + (0.3)×(-0.5) + (-0.4)×0.8]
= (1/3) × [-0.2 - 0.15 - 0.32] = (1/3) × (-0.67) = -0.223

(ii) ∂J/∂w₂

∂J/∂w₂ = (1/3) × [(0.8-1)×0.5 + (0.3-0)×1.2 + (0.6-1)×(-0.3)]

= (1/3) × [(-0.2)×0.5 + (0.3)×1.2 + (-0.4)×(-0.3)]
= (1/3) × [-0.1 + 0.36 + 0.12] = (1/3) × 0.38 = 0.127

(iii) ∂J/∂b

∂J/∂b = (1/3) × [(0.8-1) + (0.3-0) + (0.6-1)]

= (1/3) × [-0.2 + 0.3 - 0.4] = (1/3) × (-0.3) = -0.1

Problem 6: Complete Logistic Regression Implementation (20 Points)

Given:
Training data: (0.7, 1), (0.3, 0)
Initial: w = 0.5, b = 0

Learning rate: α = 0.4

Part A (15 points) - One Complete Iteration

For Example 1: (x⁽¹⁾, y⁽¹⁾) = (0.7, 1)

(i) Calculate z⁽¹⁾

z⁽¹⁾ = w×x⁽¹⁾ + b = 0.5×0.7 + 0 = 0.35

(ii) Compute ŷ⁽¹⁾

ŷ⁽¹⁾ = σ(0.35) = 1/(1 + e^(-0.35)) = 1/(1 + 0.705) = 0.587

(iii) Gradients for Example 1

∂L⁽¹⁾/∂w = (ŷ⁽¹⁾ - y⁽¹⁾) × x⁽¹⁾ = (0.587 - 1) × 0.7 = -0.289

∂L⁽¹⁾/∂b = ŷ⁽¹⁾ - y⁽¹⁾ = 0.587 - 1 = -0.413

For Example 2: (x⁽²⁾, y⁽²⁾) = (0.3, 0)

(i) Calculate z⁽²⁾

z⁽²⁾ = w×x⁽²⁾ + b = 0.5×0.3 + 0 = 0.15

(ii) Compute ŷ⁽²⁾

ŷ⁽²⁾ = σ(0.15) = 1/(1 + e^(-0.15)) = 1/(1 + 0.861) = 0.537

(iii) Gradients for Example 2

∂L⁽²⁾/∂w = (ŷ⁽²⁾ - y⁽²⁾) × x⁽²⁾ = (0.537 - 0) × 0.3 = 0.161

∂L⁽²⁾/∂b = ŷ⁽²⁾ - y⁽²⁾ = 0.537 - 0 = 0.537

Parameter Updates:

(i) Average Gradients

∂J/∂w = (1/2) × [(-0.289) + 0.161] = (1/2) × (-0.128) = -0.064

∂J/∂b = (1/2) × [(-0.413) + 0.537] = (1/2) × 0.124 = 0.062

(ii) Updated Parameters

w_new = w - α × (∂J/∂w) = 0.5 - 0.4 × (-0.064) = 0.5 + 0.026 = 0.526

b_new = b - α × (∂J/∂b) = 0 - 0.4 × 0.062 = -0.025
Part B (5 points) - Prediction

For student with 0.5 study hours:

z = w_new × 0.5 + b_new = 0.526 × 0.5 + (-0.025) = 0.263 - 0.025 = 0.238

ŷ = σ(0.238) = 1/(1 + e^(-0.238)) = 1/(1 + 0.788) = 0.559

The probability that the student will pass is 0.559 or 55.9%

LogisticRegression ExercisesSolutions
No ratings yet
LogisticRegression ExercisesSolutions
5 pages
Midem ML Makeup Sol Upated
No ratings yet
Midem ML Makeup Sol Upated
6 pages
Machine Learning Homework1 Solutions
No ratings yet
Machine Learning Homework1 Solutions
16 pages
Week 3
No ratings yet
Week 3
7 pages
Supervised Learning Problem Set
No ratings yet
Supervised Learning Problem Set
5 pages
AI2025 Lecture05 Inperson Slide
No ratings yet
AI2025 Lecture05 Inperson Slide
47 pages
2+logistic Regression
No ratings yet
2+logistic Regression
10 pages
AC-ED L04 - Logistic Regression, Regularization
No ratings yet
AC-ED L04 - Logistic Regression, Regularization
80 pages
Eee3335 - Suggestion and Solution
No ratings yet
Eee3335 - Suggestion and Solution
5 pages
02 ML Sol MidSem - Makeup - Sol - Upated (From Bits)
No ratings yet
02 ML Sol MidSem - Makeup - Sol - Upated (From Bits)
6 pages
Practice Midterm Sol
No ratings yet
Practice Midterm Sol
15 pages
Mid-Term A2 ML Solution
No ratings yet
Mid-Term A2 ML Solution
7 pages
Backpropagation Math
No ratings yet
Backpropagation Math
11 pages
C1 W3 Logistic Regression
No ratings yet
C1 W3 Logistic Regression
27 pages
Data 604 HW 5 Taneir Arani
No ratings yet
Data 604 HW 5 Taneir Arani
13 pages
Logistic Regression (Probability Concepts) and Perceptron
No ratings yet
Logistic Regression (Probability Concepts) and Perceptron
20 pages
Deep Learning Exam Solutions
No ratings yet
Deep Learning Exam Solutions
9 pages
Module 04 - Extra Class: Logistic Regression
No ratings yet
Module 04 - Extra Class: Logistic Regression
41 pages
Backpropagation: Loading Data
No ratings yet
Backpropagation: Loading Data
12 pages
Machine Learning Homework 1 Solutions
No ratings yet
Machine Learning Homework 1 Solutions
11 pages
Machine Learning Homework Guide
No ratings yet
Machine Learning Homework Guide
3 pages
Machine Learning Lab (3) Report (21 CP 81)
No ratings yet
Machine Learning Lab (3) Report (21 CP 81)
7 pages
Ch2Regression and Regularization1
No ratings yet
Ch2Regression and Regularization1
45 pages
Updating Weight
No ratings yet
Updating Weight
9 pages
2021 Exam2 Solution
No ratings yet
2021 Exam2 Solution
11 pages
Logistic Regression
No ratings yet
Logistic Regression
9 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
21 pages
2019-20-I MS Key
No ratings yet
2019-20-I MS Key
6 pages
13.logistic Regression
No ratings yet
13.logistic Regression
9 pages
AML - Lecture 3 Logistic Regression. Neural Networks
No ratings yet
AML - Lecture 3 Logistic Regression. Neural Networks
59 pages
Output 23
No ratings yet
Output 23
6 pages
01B DL2023 LinearModels
No ratings yet
01B DL2023 LinearModels
47 pages
Logistic Regression Ex1
No ratings yet
Logistic Regression Ex1
35 pages
Intro To Neural Networks Explained For Beginners: Sajjad Mustafa
No ratings yet
Intro To Neural Networks Explained For Beginners: Sajjad Mustafa
110 pages
AI2025 Lecture04 Recording Slide
No ratings yet
AI2025 Lecture04 Recording Slide
42 pages
Sample Midterm With Solutions (Updated)
No ratings yet
Sample Midterm With Solutions (Updated)
26 pages
Math Lab Exercises for Students
No ratings yet
Math Lab Exercises for Students
5 pages
Homework 2
No ratings yet
Homework 2
3 pages
W2 Ann
No ratings yet
W2 Ann
12 pages
Logistic Regression and Regularization Techniques
No ratings yet
Logistic Regression and Regularization Techniques
39 pages
Lab 6
No ratings yet
Lab 6
6 pages
CMU 2018s NinaBALCAN HW3
No ratings yet
CMU 2018s NinaBALCAN HW3
7 pages
HW 4
No ratings yet
HW 4
7 pages
Logistic Regression Overview
No ratings yet
Logistic Regression Overview
11 pages
Artificial Neural Networks and Deep Learning
No ratings yet
Artificial Neural Networks and Deep Learning
55 pages
Matlab Trials
No ratings yet
Matlab Trials
5 pages
Data Science L19 - LogisticRegression
No ratings yet
Data Science L19 - LogisticRegression
52 pages
Solutions Problem Set 1
No ratings yet
Solutions Problem Set 1
7 pages
Logistic Regression with Gradient Ascent
No ratings yet
Logistic Regression with Gradient Ascent
3 pages
Taller 3 (A. NG.) - Introducción Al Aprendizaje Supervisado
No ratings yet
Taller 3 (A. NG.) - Introducción Al Aprendizaje Supervisado
8 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
DNN Cluster S2 22 MidSem Regular
No ratings yet
DNN Cluster S2 22 MidSem Regular
6 pages
Linear and Logistic Regression: Marta Arias Marias@lsi - Upc.edu
No ratings yet
Linear and Logistic Regression: Marta Arias Marias@lsi - Upc.edu
25 pages
Backpropagation Math
No ratings yet
Backpropagation Math
6 pages
Machine Learning Homework 1
No ratings yet
Machine Learning Homework 1
8 pages
CS 229, Public Course Problem Set #4 Solutions: Unsupervised Learn-Ing and Reinforcement Learning
No ratings yet
CS 229, Public Course Problem Set #4 Solutions: Unsupervised Learn-Ing and Reinforcement Learning
12 pages
Binary Classification and Logistic Regression
No ratings yet
Binary Classification and Logistic Regression
7 pages
A Study of ConvNeXt Architectures For Enhanced Image Captioning
No ratings yet
A Study of ConvNeXt Architectures For Enhanced Image Captioning
18 pages
PROJECT ESE Updated
No ratings yet
PROJECT ESE Updated
4 pages
L10 Logistic Regression-Updated
No ratings yet
L10 Logistic Regression-Updated
20 pages
Domain Generalization Through Meta-Learning: A Survey: Arsham Gholamzadeh Khoee, Yinan Yu and Robert Feldt
No ratings yet
Domain Generalization Through Meta-Learning: A Survey: Arsham Gholamzadeh Khoee, Yinan Yu and Robert Feldt
44 pages
Advancements in Machine Visions For Fruit Sorting and Grading: A Bibliometric Analysis, Systematic Review, and Future Research Directions
No ratings yet
Advancements in Machine Visions For Fruit Sorting and Grading: A Bibliometric Analysis, Systematic Review, and Future Research Directions
17 pages
Place Recognition Meet Multiple Modalities: A Comprehensive Review, Current Challenges and Future Development
No ratings yet
Place Recognition Meet Multiple Modalities: A Comprehensive Review, Current Challenges and Future Development
48 pages
Question Bank For The Subject SOFT COMPUTING-1
No ratings yet
Question Bank For The Subject SOFT COMPUTING-1
2 pages
Resume Soumyajit
No ratings yet
Resume Soumyajit
1 page
15 Cutting-Edge AI Projects For Defense Applications-1
No ratings yet
15 Cutting-Edge AI Projects For Defense Applications-1
6 pages
Social Media Popularity Prediction Based On Multi Modal Self Attention Mechanisms
No ratings yet
Social Media Popularity Prediction Based On Multi Modal Self Attention Mechanisms
8 pages
Differences Between ANN and The McCulloch
No ratings yet
Differences Between ANN and The McCulloch
2 pages
AIML Drone Interview Prep
No ratings yet
AIML Drone Interview Prep
3 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
26 pages
Hybrid Deep Learning Model Based On GAN and RESNET For Detecting Fake Faces
No ratings yet
Hybrid Deep Learning Model Based On GAN and RESNET For Detecting Fake Faces
13 pages
Explanation of REINFORCE Training Code For CartPole
No ratings yet
Explanation of REINFORCE Training Code For CartPole
3 pages
2024 - Conf - Deep Texture Feature Aggregation On Leaf Microscopy Images For Brazilian Plant Species Recognition (CNN, App1)
No ratings yet
2024 - Conf - Deep Texture Feature Aggregation On Leaf Microscopy Images For Brazilian Plant Species Recognition (CNN, App1)
8 pages
DL Lesson Plan
No ratings yet
DL Lesson Plan
8 pages
Model Fine-Tuning Mastery (T5-Small) - Presentatio
No ratings yet
Model Fine-Tuning Mastery (T5-Small) - Presentatio
3 pages
AI Powered Sentiment Analysis of Social Media Presence
No ratings yet
AI Powered Sentiment Analysis of Social Media Presence
5 pages
LNN13 Paper
No ratings yet
LNN13 Paper
11 pages
Robo Ai - Curriculum
No ratings yet
Robo Ai - Curriculum
8 pages
Deep Learning Questions
No ratings yet
Deep Learning Questions
3 pages
Deep Neural Network
No ratings yet
Deep Neural Network
24 pages
Syllabus Machine Learning 6th CSE
No ratings yet
Syllabus Machine Learning 6th CSE
7 pages
1 s2.0 S2590123025036539 Main
No ratings yet
1 s2.0 S2590123025036539 Main
18 pages
ML Algorithms Explained
No ratings yet
ML Algorithms Explained
27 pages
ECA-Net: Efficient Channel Attention For Deep Convolutional Neural Networks
No ratings yet
ECA-Net: Efficient Channel Attention For Deep Convolutional Neural Networks
12 pages
In The Wild Video Violence Detection: An Unsupervised Domain Adaptation Approach
No ratings yet
In The Wild Video Violence Detection: An Unsupervised Domain Adaptation Approach
12 pages
Jalammar - Github.io-The Illustrated GPT-2 Visualizing Transformer Language Models
No ratings yet
Jalammar - Github.io-The Illustrated GPT-2 Visualizing Transformer Language Models
37 pages
GPT On A Quantum Computer
No ratings yet
GPT On A Quantum Computer
35 pages