0% found this document useful (0 votes)

44 views47 pages

Understanding Sigmoid Neurons in ML

Uploaded by

Harish S

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views47 pages

Understanding Sigmoid Neurons in ML

Uploaded by

Harish S

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

Sigmoid Neuron
Parveen Khurana Following
Jan 3 · 13 min read

This article covers the content discussed in the Sigmoid Neuron module of
the Deep Learning course and all the images are taken from the same
module.

In this article, we discuss the 6 jars of the Machine Learning with respect to
the Sigmoid Model but before beginning with that let’s see the drawback of
the Perceptron Model.

Sigmoid Model and a drawback of the Perceptron Model:

The limitation of the perceptron model is that we have this harsh
function(boundary) separating the classes on two sides as depicted below

[Link] 1/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

And we would like to have a smoother transition curve which is closer to the
way humans make decisions in the sense that something is not drastically
changed, it slowly changes over a range of values. So, we would like to have
something like the S-shaped function(in red in the below image).

[Link] 2/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

And we have the Sigmoid family of functions in Deep Learning of which

many of the functions are S-shaped. One such function is the logistic
function(it is one smooth continuous function) and this function is
defined by the below equation:

So, we will now approximate the relationship between the input

x(which could be n-dimensional) and the output y using this Logistic
function(Sigmoid function). This function would have some parameters
[Link] 3/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

and we would try to learn the parameters using the data in such a way that
the loss is minimized.

Now to visualize this function, we can take some values of x and y and
plot it to see what it looks like, for example in the below case, we are
plotting (‘wx + b’) on the x-axis and ‘y’ value on the y-axis.

[Link] 4/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

If ‘wx + b’ is 0, then the equation(y) is reduced to:

Let’s try some other value:

[Link] 5/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

And y, in this case, would be:

[Link] 6/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

[Link] 7/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

We have plotted some points of the function

[Link] 8/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

And we can get the general trend of the function. So, we can visualize the
function to see what the function looks like or how it varies with
respect to the input.

So, this is clearly is a smoother function as opposed to the if-else condition

that we have in the Perceptron case.

For the 2 inputs case, the function equation would be:

[Link] 9/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

And if we plot it, it would look like:

To understand the plot better, we try to look at it from the top:

[Link] 10/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

The dark red region(circled) in the above image is the region where the
output value is close to 0 because as we plug larger and larger values, the
output would be close to 0.

[Link] 11/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

The green region would correspond to value 1 and the middle

region(orange color) would correspond to value 0.5.

If we have more than 2 inputs, then we would write our equation as:

[Link] 12/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

And the below summation

would be the same as the dot product of the two vectors w and x.

[Link] 13/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

The output is going to be a scalar value between 0 and 1 no matter how

many inputs we have.

Let’s consider the 2-D case, we have the equation as:

[Link] 14/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

Let the value of w1 be 0.2, w2 be -0.2 and b be equal to -8.

The output of the sigmoid function would be equal to 0.5 when the below
quantity is 0 as then only the overall denominator would be 2:

Putting in the values of w1, w2, and b:

which is same as

[Link] 15/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

So, for this 2D case, whenever the difference of the two input values is
40, then the sum w1x1 + w2x2 +b would be 0 and in effect, the value y
would be 0.5. And this is how we can go about plotting out this function.

How does the model help when the data is not linearly separable?
Let’s consider the below case where we have two inputs: Salary in LPA and
Family Size and based on these inputs, we are going to make a decision
whether that person is going to buy a car or not. We are assuming that there
is some relation between the inputs(x1 and x2) and the output y. We don’t
know the true relation between the input and the output and we are
approximating this relation using the sigmoid(logistic) function.

[Link] 16/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

This is the Yes-No decision-making process and the sigmoid function also
gives output between 0 to 1.

If we plot out all the data:

[Link] 17/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

Red points are the points for which the output is 0 and the green points are
the ones for which the output is 1. And it is clear that no matter how we
draw a line we would not be able to separate the red points from the green
points. And if train a perceptron model on this, it would not converge for
sure but we would train it in a way such that we are okay with the number
of errors it makes(meaning the wrong classification of some points).

And if we plot out the perceptron linear boundary for the above data, we
have:

[Link] 18/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

In the above image, in the red region, we largely have the red points and in
the green region, we largely have the green points but off course, there is an
error on both sides.

The important thing to note is that the perceptron does not make any
distinction between the two circled points in the below image:

[Link] 19/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

The point in yellow in the above image is way inside the decision boundary
that means for this points we are very confident that for a person with
annual income of 2.5 Lakhs and of family size 8, a human decision-maker
would be very confident that this person will not buy a car whereas, for the
point in pink in the above image, we would be slightly confused whether
this person may buy or may not buy a car. But if we look at the Perceptron
decision boundary, its very firm meaning the model is confident for both
the points(in yellow and in pink in the above image) that the person is not
going to buy a car even though there is difference between these two
points; one is near the boundary almost sitting at the fence whereas the
other one is way inside the boundary but the Perceptron decision surface or
the perceptron output is not able to make these distinctions because the
output is either 1 or 0, it’s not a smooth number between 0 to 1.

[Link] 20/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

Now let’s see what would be the scenario if we try to fit it using the Sigmoid
function:

We will look at the data and using some learning algorithm and some loss
function, we will find the parameters of the model/function.

If we try to fit the data using Sigmoid, we would get the below kind of plot:

[Link] 21/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

And the equivalent 2D plot would like:

[Link] 22/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

If we look at the circled points in the above image, for a person with an
annual income of 2.5 Lakhs and with a family size of 8, the output is close
to 0(as the point lies in the dark red region for which the output is 0 or close
to 0).

And for the pink circled input point in the below image

the output would be close to 0.3 or 0.4 which means the model is not very
confident, it thinks it is on the lower side, it's not clearly 1, its not clearly 0
but maybe on the lower side, there is 30% chance that this person might
buy a car. So, that’s the interesting thing about Sigmoid function, since
[Link] 23/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

it lies between 0 and 1 and another quantity of interest that we care

about is Probability which also lies between 0 and 1. So, we can
actually interpret the output of the Sigmoid Neuron as a probability.
So, when it is 0, we can say there is a 0% chance of this person buying
a car and the output of Sigmoid is 1 we can say there is a 100% chance
of this person buying a car and so on.

So, now we have this nice way of interpreting the output rather than being
very rigid which means saying 0 here and 1 here, we can also account for
the fence-sitters and we can say that this person is on leaning towards the
positive side but completely towards 1. So, this is how we can actually
interpret the output of the sigmoid function.

We are still not able to separate the green points from the red points.
The non-linearity that we have introduced is giving us a graded output
which allows a better interpretation to evaluate it in terms of
probability.

Now, as we keep changing the values of the parameters w, b, we will get

different types of sigmoid function for example:

[Link] 24/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

[Link] 25/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

We will get different sigmoid plots for different values of the parameters but
none of them would be able to separate the green points from the redpoint.

How does the function change with the change in w and b?

Let’s consider this for only one input, in that case, we have the equation as:

[Link] 26/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

where the parameters w and b are going to be a scalar value and x

represents the input.

If we take ‘w’ as -0.3 and b as 0, we have the plot as below:

As w is negative, the slope of the sigmoid function is also negative, so what

is happening is that as we are increasing the values of x, the value of the

[Link] 27/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

output is decreasing, this is what the negative slope means.

And as we keep increasing the slope or rather make it more and more
negative, the curve becomes sharper, that’s what a high negative slope
means, even if we change the value of x slightly, the value of the output
is dropping drastically:

[Link] 28/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

[Link] 29/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

[Link] 30/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

[Link] 31/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

And now if we make the value of w positive, the slope is going to be

positive and the smaller the slope the less drastic is the change in the
value of output.

[Link] 32/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

[Link] 33/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

[Link] 34/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

And the next thing to show is how the function change as we change the
value of b:

[Link] 35/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

To start with, we have taken the value of b as 4.9 and if we keep

decreasing the value of b(keeping w constant), the function would
shift towards the right.

[Link] 36/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

[Link] 37/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

[Link] 38/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

And there is an explanation for why this happens:

[Link] 39/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

We know that the value of the sigmoid function would be 0.5 when

[Link] 40/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

So, the value of the sigmoid is 0.5 when x is equal to the below:

As we keep decreasing b, negative of b would keep on increasing the

boundary would shift towards the right(assuming w is positive).

The implication of all these would be when we are minimizing some loss
function and change some parameters, we get the idea of how the function
plot is going to change.

Sigmoid: Data and Tasks

[Link] 41/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

So far we have looked at MP Neuron and Perceptron model where our task
was of Binary Classification(output could be 0 or 1) and we could also
use Sigmoid Neuron for this kind of task with the exception that now
instead of getting 0 or 1 as the output, it gives a value between 0 to 1 say 0.7
and we could use that to indicate whether the output is closer to class 1 or
class 0. And we can take some threshold value based on the task at the hand
to map the output to a particular class for example if the threshold is 0.5
then we can say that it belongs to class 1 and any value less than 0.5 we can
map it to class 0.

Of course, once we put a threshold it becomes the same as the dealing with
a Perceptron model except that now we have more flexibility.

We could also use this function in the case of Regression task where the
output is going to be between 0 to 1.

Data could be a bunch of inputs say ’n’ inputs, the true output is some
value between 0 to 1.

[Link] 42/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

Sigmoid Loss Function:

We have looked at the 3 jars: Model, Data and Task and we are
approximating the relationship between the input and the output using a
Sigmoid function.

Now we want to compute the loss given input data, true output, and the
Sigmoid function:

[Link] 43/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

Input Data

We will first compute the predicted output as per the Sigmoid function for
the given input data(let’s say we have the parameters value, so we will be
able to compute the predicted output), once we have the predicted output,
we can use the Squared Error Loss function:

[Link] 44/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

In practice, we might have the true output as Binary and in that case, we
could still use the Sigmoid function as the approximation between the input
and the output and we could still compute the Loss using the squared error
loss:

[Link] 45/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

And the point of treating the output as a probability instead of having a real
value as the predicted output is that it helps the model to understand which
data point is contributing more to the loss and then accordingly adjust its
parameters, for example, let’s say the true output is 1 for two points and the
predicted output is 0.6 and 0.7 for these two data points, then as 0.6 is far
from 1 compared to 0.7, 0.6 would contribute more to the loss which would
not have been the case for Perceptron model where the predicted output
would have been 1 instead of 0.6 and 0.7.

We are now left with 2 jars for the Sigmoid Neuron model which are the
Learning Algorithm and the Evaluation metrics which are discussed in this
article.

Machine Learning Arti cial Intelligence Sigmoid Deep Learning Logistic Sigmoid

[Link] 46/47
5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

Discover Medium Make Medium yours Explore your membership

Welcome to a place where words matter. Follow all the topics you care about, and Thank you for being a member of
On Medium, smart voices and original we’ll deliver the best stories for you to Medium. You get unlimited access to
ideas take center stage - with no ads in your homepage and inbox. Explore insightful stories from amazing thinkers
sight. Watch and storytellers. Browse

About Help Legal

[Link] 47/47

08 Neural Networks
No ratings yet
08 Neural Networks
47 pages
Sigmoid Neuron - Part 2 - Parveen Khurana - Medium
No ratings yet
Sigmoid Neuron - Part 2 - Parveen Khurana - Medium
50 pages
Sigmoid Neurons and Gradient Descent
No ratings yet
Sigmoid Neurons and Gradient Descent
200 pages
Sigmoid Neurons - Gradient Descent
No ratings yet
Sigmoid Neurons - Gradient Descent
15 pages
Activation Functions
No ratings yet
Activation Functions
11 pages
Lesson 7.0 Supervised Learning With Neural Networks
No ratings yet
Lesson 7.0 Supervised Learning With Neural Networks
22 pages
7 NN Apr 28 2021
No ratings yet
7 NN Apr 28 2021
81 pages
Deep Learning for Beginners
No ratings yet
Deep Learning for Beginners
19 pages
Sigmoid Neurons and Gradient Descent
No ratings yet
Sigmoid Neurons and Gradient Descent
92 pages
Deep Learning 21
No ratings yet
Deep Learning 21
22 pages
3.5. Feed Forward Neural Network - Updated
No ratings yet
3.5. Feed Forward Neural Network - Updated
380 pages
Deep Learning Unit1
No ratings yet
Deep Learning Unit1
25 pages
DMBI-MMK-Lec-29-CIET-Neural Network-Part-IV
No ratings yet
DMBI-MMK-Lec-29-CIET-Neural Network-Part-IV
473 pages
Artificial Neural Networks Basics
No ratings yet
Artificial Neural Networks Basics
50 pages
Unit 2 - Machine Learning - WWW - Rgpvnotes.in
No ratings yet
Unit 2 - Machine Learning - WWW - Rgpvnotes.in
18 pages
Structure of Neural Networks
No ratings yet
Structure of Neural Networks
12 pages
NN Unit - 1
100% (1)
NN Unit - 1
27 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
86 pages
DeepLearing Theory
No ratings yet
DeepLearing Theory
51 pages
Sigmoid Model
No ratings yet
Sigmoid Model
1 page
Unit 2 - Machine Learning - WWW - Rgpvnotes.in
No ratings yet
Unit 2 - Machine Learning - WWW - Rgpvnotes.in
18 pages
Understanding Neural Networks Basics
100% (1)
Understanding Neural Networks Basics
11 pages
Lecture - 05 (Introduction To ANN)
No ratings yet
Lecture - 05 (Introduction To ANN)
27 pages
Ai Unit-3
No ratings yet
Ai Unit-3
69 pages
Advanced Supervised Learning
No ratings yet
Advanced Supervised Learning
17 pages
Neural Networks for Visual Recognition
No ratings yet
Neural Networks for Visual Recognition
12 pages
SVM vs LS-SVM and ANN Basics
No ratings yet
SVM vs LS-SVM and ANN Basics
2 pages
NN 02
No ratings yet
NN 02
25 pages
Neural Networks (Basics)
No ratings yet
Neural Networks (Basics)
30 pages
Understanding Perceptrons in Neural Networks
No ratings yet
Understanding Perceptrons in Neural Networks
17 pages
UNIT III 3.1 ML Artificial Neural Networks
No ratings yet
UNIT III 3.1 ML Artificial Neural Networks
65 pages
tmpD53B TMP
No ratings yet
tmpD53B TMP
6 pages
Neuron Models and Activation Functions
No ratings yet
Neuron Models and Activation Functions
25 pages
Unit V
No ratings yet
Unit V
26 pages
Module 5
No ratings yet
Module 5
27 pages
Artifical Neural Networks - Lect - 2
No ratings yet
Artifical Neural Networks - Lect - 2
16 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
66 pages
Unit 2
No ratings yet
Unit 2
18 pages
Unit V
No ratings yet
Unit V
25 pages
WINSEM2023-24 BITE410L TH VL2023240503970 2024-03-11 Reference-Material-I
No ratings yet
WINSEM2023-24 BITE410L TH VL2023240503970 2024-03-11 Reference-Material-I
40 pages
Neural Networks for Beginners
No ratings yet
Neural Networks for Beginners
40 pages
ANN PG Module1
No ratings yet
ANN PG Module1
75 pages
SCT Unit2
No ratings yet
SCT Unit2
11 pages
1 Ai
No ratings yet
1 Ai
118 pages
Perceptron Basics in SciML
No ratings yet
Perceptron Basics in SciML
42 pages
CFBC 718 e 2 C
No ratings yet
CFBC 718 e 2 C
30 pages
Basic of NN
No ratings yet
Basic of NN
230 pages
Lecture 19 NN
No ratings yet
Lecture 19 NN
32 pages
Lecture 19 NN
No ratings yet
Lecture 19 NN
32 pages
Nns Are A Study of Parallel and Distributed Processing Systems (PDPS)
No ratings yet
Nns Are A Study of Parallel and Distributed Processing Systems (PDPS)
46 pages
UNIT1 Perceptron MLP
No ratings yet
UNIT1 Perceptron MLP
26 pages
Neural Networks: Neuron Modeling & Activation Functions
No ratings yet
Neural Networks: Neuron Modeling & Activation Functions
13 pages
ML Module 5
No ratings yet
ML Module 5
14 pages
C2+TC2 - Introduction To Sigmoid Model
No ratings yet
C2+TC2 - Introduction To Sigmoid Model
22 pages
Neural Network (ANN)
No ratings yet
Neural Network (ANN)
38 pages
Module 6
No ratings yet
Module 6
104 pages
McCulloch Pitts (MP) Neuron - Parveen Khurana - Medium
No ratings yet
McCulloch Pitts (MP) Neuron - Parveen Khurana - Medium
38 pages
Deep Learning - Neuron
No ratings yet
Deep Learning - Neuron
15 pages
Week 2 Artificial Neural Networks
No ratings yet
Week 2 Artificial Neural Networks
62 pages
Drug Release Models Review
No ratings yet
Drug Release Models Review
15 pages
Assessment 2 Chapter 1
No ratings yet
Assessment 2 Chapter 1
23 pages
Understanding Toxic Positivity
No ratings yet
Understanding Toxic Positivity
2 pages
Class 5-Lesson 2. Parallels and Meridians F. Answer The Following Questions
75% (4)
Class 5-Lesson 2. Parallels and Meridians F. Answer The Following Questions
1 page
Freedom of Information Request Form: A. Requesting Party
No ratings yet
Freedom of Information Request Form: A. Requesting Party
1 page
Cci Kpi Report 2004
100% (1)
Cci Kpi Report 2004
22 pages
Strength of Materials Syllabus
No ratings yet
Strength of Materials Syllabus
4 pages
Salamanca
No ratings yet
Salamanca
11 pages
Embrace Life: Healing Through Illness
No ratings yet
Embrace Life: Healing Through Illness
37 pages
History of Architecture
No ratings yet
History of Architecture
6 pages
Vedic Astrology: Aast Grihas & Doshas
0% (1)
Vedic Astrology: Aast Grihas & Doshas
8 pages
EE 126 Midterm Exam Guide
No ratings yet
EE 126 Midterm Exam Guide
5 pages
EEE598 Project 2
No ratings yet
EEE598 Project 2
5 pages
Gone Girl Thesis Statement
100% (2)
Gone Girl Thesis Statement
7 pages
School Form 2 Attendance Guide
No ratings yet
School Form 2 Attendance Guide
73 pages
Assignment 3 Organizational Behaviour
No ratings yet
Assignment 3 Organizational Behaviour
4 pages
Lesson Plan 2
No ratings yet
Lesson Plan 2
3 pages
Parts of A Microscope and Their Functions
100% (1)
Parts of A Microscope and Their Functions
3 pages
Grade 3 Caption Writing Lesson
No ratings yet
Grade 3 Caption Writing Lesson
2 pages
The System - How To Take Control of Your Vices
No ratings yet
The System - How To Take Control of Your Vices
6 pages
Android Based Tracking Application-DOPE HUNT
100% (1)
Android Based Tracking Application-DOPE HUNT
4 pages
Lesson 7.2 - Determining Textual Evidence
No ratings yet
Lesson 7.2 - Determining Textual Evidence
14 pages
Linear Momentum in Physics Form 4
100% (1)
Linear Momentum in Physics Form 4
41 pages
Chapter 5 Heat and Its Effects Content
No ratings yet
Chapter 5 Heat and Its Effects Content
6 pages
Encrypted Document Analysis
No ratings yet
Encrypted Document Analysis
101 pages
Ran15.0 Umts-To-lte Fast Return
83% (6)
Ran15.0 Umts-To-lte Fast Return
16 pages
The Marketing Environment Determines The Success of Marketing Strategies.
No ratings yet
The Marketing Environment Determines The Success of Marketing Strategies.
14 pages
No. 4 Find Solution Using Simplex (Bigm) Method Max Z 3X - 4Y + 3Z Subject To - X + Y + Z 0 and X, Y, Z 0 Solution: Problem Is
No ratings yet
No. 4 Find Solution Using Simplex (Bigm) Method Max Z 3X - 4Y + 3Z Subject To - X + Y + Z 0 and X, Y, Z 0 Solution: Problem Is
5 pages
Week 7 - Linear and Multiple Regression
100% (1)
Week 7 - Linear and Multiple Regression
2 pages
Report Card
No ratings yet
Report Card
1 page

Understanding Sigmoid Neurons in ML

Uploaded by

Understanding Sigmoid Neurons in ML

Uploaded by

5/2/2020 Sigmoid Neuron - Parveen Khurana - Medium

Sigmoid Model and a drawback of the Perceptron Model:

And we have the Sigmoid family of functions in Deep Learning of which

So, we will now approximate the relationship between the input

If ‘wx + b’ is 0, then the equation(y) is reduced to:

Let’s try some other value:

And y, in this case, would be:

We have plotted some points of the function

So, this is clearly is a smoother function as opposed to the if-else condition

For the 2 inputs case, the function equation would be:

And if we plot it, it would look like:

To understand the plot better, we try to look at it from the top:

The green region would correspond to value 1 and the middle

And the below summation

The output is going to be a scalar value between 0 and 1 no matter how

Let’s consider the 2-D case, we have the equation as:

Let the value of w1 be 0.2, w2 be -0.2 and b be equal to -8.

Putting in the values of w1, w2, and b:

If we plot out all the data:

And the equivalent 2D plot would like:

it lies between 0 and 1 and another quantity of interest that we care

Now, as we keep changing the values of the parameters w, b, we will get

How does the function change with the change in w and b?

where the parameters w and b are going to be a scalar value and x

If we take ‘w’ as -0.3 and b as 0, we have the plot as below:

As w is negative, the slope of the sigmoid function is also negative, so what

output is decreasing, this is what the negative slope means.

And now if we make the value of w positive, the slope is going to be

To start with, we have taken the value of b as 4.9 and if we keep

And there is an explanation for why this happens:

As we keep decreasing b, negative of b would keep on increasing the

Sigmoid: Data and Tasks

Sigmoid Loss Function:

Discover Medium Make Medium yours Explore your membership

About Help Legal

You might also like