0% found this document useful (0 votes)

84 views34 pages

Activation Functions

Activation functions are essential in artificial neural networks as they introduce non-linearity and determine whether a neuron should fire based on the weighted sum of inputs. Various types of activation functions, such as Step, Sigmoid, ReLU, and Swish, each have unique characteristics and applications. The document also includes Python implementations for several activation functions, demonstrating their use in neural networks.

Uploaded by

bca2m2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

84 views34 pages

Activation Functions

Uploaded by

bca2m2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Activation Functions

Activation Functions
• Activation functions are a crucial component of
artificial neural networks, used to introduce non-
linearity into the model.
• They determine whether a neuron should be activated
(fire) or not, based on the weighted sum of inputs.
• There are several types of activation functions, each
with its own characteristics and use cases.
• To put in simple terms, an artificial neuron calculates
the ‘weighted sum’ of its inputs and adds a bias, as
shown in the following figure the net input.
Activation Functions
Activation Functions
• Now the value of net input can be any anything from -inf to
+inf.
• The neuron doesn’t really know how to bound to value and
thus is not able to decide the firing pattern.
• Thus the activation function is an important part of an artificial
neural network.
• They basically decide whether a neuron should be activated or
not.
• Thus it bounds the value of the net input.
The activation function is a non-linear transformation that we
do over the input before sending it to the next layer of neurons
or finalizing it as output.
Types of Activation Functions
• Types of Activation Functions –
– Several different types of activation functions are
used in Machine Learning.
– Some of them are explained next.
Step Function
• Step Function is one of the simplest kind of
activation functions.
• In this, we consider a threshold value and if
the value of net input say y is greater than the
threshold then the neuron is activated.
Mathematically,
Given below is the graphical representation of
step function.
Sigmoid Function
• Sigmoid function is a widely used activation
function. It is defined as:
Sigmoid Function
• This is a smooth function and is continuously
differentiable.
• The biggest advantage that it has over step and linear
function is that it is non-linear.
• This is an incredibly cool feature of the sigmoid
function.
• This essentially means that when I have multiple
neurons having sigmoid function as their activation
function – the output is non linear as well.
• The function ranges from 0-1 having an S shape.
ReLU
• The ReLU function is the Rectified linear unit.
• It is the most widely used activation function.
It is defined as:
ReLU
• Graphically,
ReLU
• The main advantage of using the ReLU
function over other activation functions is that
it does not activate all the neurons at the
same time.
• It means if the input is negative it will convert
it to zero and the neuron does not get
activated.
Leaky ReLU
• Leaky ReLU function is nothing but an
improved version of the ReLU function.
• Instead of defining the Relu function as 0 for x
less than 0, we define it as a small linear
component of x. It can be defined as:
Leaky ReLU
• Graphically,
Parametric Rectified Linear Unit (PReLU)

• Similar to Leaky ReLU, but a is learned during

training rather than being a fixed constant.
• Provides flexibility to adapt the slope of the
negative part of the activation function.
Exponential Linear Unit (ELU)
• Formula:
f(x) = x for x >= 0,
f(x) = a * (e^x - 1) for x < 0 (where a is a positive
constant)
• Output range: (-a, ∞)
• Similar to Leaky ReLU but differentiable and
has a non-zero mean.
• It addresses the vanishing gradient problem.
Scaled Exponential Linear Unit (SELU)
• Formula:
– Similar to ELU but with a particular choice for a
and scaling of weights.
• Designed to have self-normalizing properties
and improve training stability in deep
networks.
Hyperbolic Tangent (Tanh)
• Formula: f(x) = (e^(2x) - 1) / (e^(2x) + 1)
• Output range: (-1, 1)
• Similar to the sigmoid but centered at 0.
• It's zero-centered and helps mitigate the
vanishing gradient problem.
• Used in hidden layers of neural networks,
especially when outputs are normalized.
Swish
• Formula: f(x) = x / (1 + e^(-x))
• Designed to be smoother than ReLU and has
shown promise in some applications.
Implementation
• Step Function
import numpy as np
import [Link] as plt
def step_function(x):
return [Link](x >= 0, 1, 0)
x = [Link](-5, 5, 100)
y = step_function(x)
[Link](x, y)
[Link]("Step Function")
[Link]()
[Link]()
Sigmoid Gunction
import numpy as np
import [Link] as plt
def sigmoid(x):
return 1 / (1 + [Link](-x))
x = [Link](-5, 5, 100)
y = sigmoid(x)
[Link](x, y)
[Link]("Sigmoid Function")
[Link]()
[Link]()
Hyperbolic Tangent (Tanh)
import numpy as np
import [Link] as plt
def tanh(x):
return [Link](x)
x = [Link](-5, 5, 100)
y = tanh(x)
[Link](x, y)
[Link]("Hyperbolic Tangent (Tanh)")
[Link]()
[Link]()
Rectified Linear Unit (ReLU)
import numpy as np
import [Link] as plt
def relu(x):
return [Link](0, x)
x = [Link](-5, 5, 100)
y = relu(x)
[Link](x, y)
[Link]("Rectified Linear Unit (ReLU)")
[Link]()
[Link]()
Leaky Rectified Linear Unit (Leaky ReLU)

import numpy as np
import [Link] as plt
def leaky_relu(x, alpha=0.01):
return [Link](x >= 0, x, alpha * x)
x = [Link](-5, 5, 100)
y = leaky_relu(x)
[Link](x, y)
[Link]("Leaky Rectified Linear Unit (Leaky ReLU)")
[Link]()
[Link]()
Parametric Rectified Linear Unit (PReLU)

import numpy as np
import [Link] as plt
def prelu(x, a=0.01):
return [Link](x >= 0, x, a * x)
x = [Link](-5, 5, 100)
y = prelu(x)
[Link](x, y)
[Link]("Parametric Rectified Linear Unit (PReLU)")
[Link]()
[Link]()
Exponential Linear Unit (ELU)
import numpy as np
import [Link] as plt
def elu(x, alpha=1.0):
return [Link](x >= 0, x, alpha * ([Link](x) - 1))
x = [Link](-5, 5, 100)
y = elu(x)
[Link](x, y)
[Link]("Exponential Linear Unit (ELU)")
[Link]()
[Link]()
Exponential Linear Unit (ELU)
• In this program, we define the ELU activation
function using the formula
elu(x, alpha) = x for x >= 0,
and elu(x, alpha) = alpha * (exp(x) - 1) for x < 0.
• You can adjust the alpha parameter to control
the slope of the negative part of the curve.
• The code then creates a range of x values,
computes the corresponding y values using
the ELU function, and plots the result.
Scaled Exponential Linear Unit (SELU)
• The Scaled Exponential Linear Unit (SELU) is a
self-normalizing activation function that can
maintain mean activations close to 0 and
standard deviations close to 1 during training.
• Here's a Python implementation of the SELU
activation function:
Scaled Exponential Linear Unit (SELU)
import numpy as np
import [Link] as plt
def selu(x, alpha=1.67326, scale=1.0507):
return scale * [Link](x > 0, x, alpha * ([Link](x) - 1))
x = [Link](-5, 5, 100)
y = selu(x)
[Link](x, y)
[Link]("Scaled Exponential Linear Unit (SELU)")
[Link]()
[Link]()
Scaled Exponential Linear Unit (SELU)
Scaled Exponential Linear Unit (SELU)
• In this implementation, we define the SELU
activation function using the formula
selu(x, alpha, scale) = scale * x for x >= 0,
and selu(x, alpha, scale) = scale * (alpha * (exp(x) - 1)) for
x < 0.
• The alpha and scale parameters are specific values
that are part of the SELU definition.
• The code then creates a range of x values,
computes the corresponding y values using the
SELU function, and plots the result.
Swish Function
• The Swish activation function is defined as:
swish(x) = x / (1 + exp(-x))
Swish Function
import numpy as np
import [Link] as plt
def swish(x):
return x / (1 + [Link](-x))
x = [Link](-5, 5, 100)
y = swish(x)
[Link](x, y)
[Link]("Swish Activation Function")
[Link]()
[Link]()
Swish Function
• In this code, we define the Swish function
using the formula provided.
• We create a range of x values, calculate the
corresponding y values using the Swish
function, and plot the curve.
• The Swish function is known for being smooth
and continuous, allowing it to be a viable
choice as an activation function in neural
networks.

Neural Networks Activation Functions 1694135997
No ratings yet
Neural Networks Activation Functions 1694135997
7 pages
Deepak DL
No ratings yet
Deepak DL
32 pages
Neural Network Activation Guide
No ratings yet
Neural Network Activation Guide
14 pages
Activation Function
No ratings yet
Activation Function
36 pages
Activation Functions - Ipynb - Colaboratory
No ratings yet
Activation Functions - Ipynb - Colaboratory
10 pages
Ijisae 4865
No ratings yet
Ijisae 4865
8 pages
Activation Function
No ratings yet
Activation Function
10 pages
4 4 Choosing The Right Activation Function For Neural Networks
No ratings yet
4 4 Choosing The Right Activation Function For Neural Networks
25 pages
Unit 3 Deep Learning
No ratings yet
Unit 3 Deep Learning
11 pages
Activation Functions in Neural Networks
No ratings yet
Activation Functions in Neural Networks
3 pages
Lecture 2.1.2activation Function
No ratings yet
Lecture 2.1.2activation Function
15 pages
Activation Functions
No ratings yet
Activation Functions
10 pages
Pr1 ANN Writeup
No ratings yet
Pr1 ANN Writeup
7 pages
Deep Learning: International Islamic University of Chittagong
No ratings yet
Deep Learning: International Islamic University of Chittagong
31 pages
Neural Network Activation Functions
No ratings yet
Neural Network Activation Functions
7 pages
Activation
No ratings yet
Activation
7 pages
Activation Functions
No ratings yet
Activation Functions
8 pages
003 Activation Functions in Machine Learning
No ratings yet
003 Activation Functions in Machine Learning
19 pages
ML PPT Activation Functions
100% (1)
ML PPT Activation Functions
12 pages
Neural Network Activation Guide
No ratings yet
Neural Network Activation Guide
43 pages
7 Types of Neural Network Activation Functions
No ratings yet
7 Types of Neural Network Activation Functions
16 pages
Deep Learning Activation Functions
No ratings yet
Deep Learning Activation Functions
10 pages
DL Lab 02
No ratings yet
DL Lab 02
12 pages
Activation Functions
No ratings yet
Activation Functions
4 pages
ANN Viva Prep
No ratings yet
ANN Viva Prep
66 pages
Machine Learning (CSO851) - Lecture 08
No ratings yet
Machine Learning (CSO851) - Lecture 08
27 pages
26 - Netinput Activation Function Forward and Back Propogation
No ratings yet
26 - Netinput Activation Function Forward and Back Propogation
41 pages
Activation Function
No ratings yet
Activation Function
34 pages
Experiment No. 1 SL-II (ANN)
No ratings yet
Experiment No. 1 SL-II (ANN)
3 pages
Deeplearning Shreiyans
No ratings yet
Deeplearning Shreiyans
18 pages
Module 2
No ratings yet
Module 2
126 pages
Module II
No ratings yet
Module II
152 pages
Rectified Linear Units (ReLU) in Deep Learning - Kaggle
No ratings yet
Rectified Linear Units (ReLU) in Deep Learning - Kaggle
3 pages
Need and Use of Activation Functions in Anndeep Learning
No ratings yet
Need and Use of Activation Functions in Anndeep Learning
7 pages
Choosing Neural Network Activation Functions
No ratings yet
Choosing Neural Network Activation Functions
36 pages
Activation Function
No ratings yet
Activation Function
18 pages
Act Fun
No ratings yet
Act Fun
7 pages
Artificial Neural Networks (ANN)
No ratings yet
Artificial Neural Networks (ANN)
67 pages
Activation Functions in Neural Networks
No ratings yet
Activation Functions in Neural Networks
12 pages
Beyond Relu
No ratings yet
Beyond Relu
39 pages
Choosing Activation Functions in NN
No ratings yet
Choosing Activation Functions in NN
15 pages
Lec08-1Activation Functions
No ratings yet
Lec08-1Activation Functions
19 pages
Activation Function in NN
No ratings yet
Activation Function in NN
29 pages
Arjun Yadav 32, Activation Function Assignment
No ratings yet
Arjun Yadav 32, Activation Function Assignment
7 pages
Activation Functions for Multi-Class Output
No ratings yet
Activation Functions for Multi-Class Output
15 pages
L11 Introduction To Neural Network AI&ML CS877
No ratings yet
L11 Introduction To Neural Network AI&ML CS877
24 pages
Activation Functions in Neural Networks
No ratings yet
Activation Functions in Neural Networks
7 pages
Lec 22 Activations Functions Complete
No ratings yet
Lec 22 Activations Functions Complete
33 pages
Unit 2
No ratings yet
Unit 2
35 pages
Unit Ii DNN
No ratings yet
Unit Ii DNN
24 pages
Performance Analysis of Various Activation Functio
No ratings yet
Performance Analysis of Various Activation Functio
7 pages
CNN Activation Functions Explained
No ratings yet
CNN Activation Functions Explained
5 pages
UNIT-III Activation-Function
No ratings yet
UNIT-III Activation-Function
6 pages
Activation Functions in Neural Networks
No ratings yet
Activation Functions in Neural Networks
7 pages
Activation Function
No ratings yet
Activation Function
20 pages
Activation Function
No ratings yet
Activation Function
13 pages
Activation Function
No ratings yet
Activation Function
4 pages
Common Activation Function
No ratings yet
Common Activation Function
13 pages
Activation Functions in Neural Networks
No ratings yet
Activation Functions in Neural Networks
10 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
27 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
61 pages
Understanding Kohonen Self-Organizing Maps
No ratings yet
Understanding Kohonen Self-Organizing Maps
8 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
27 pages
Lecture5 Deadlock
No ratings yet
Lecture5 Deadlock
13 pages
Understanding RAG Deadlock in Linux
No ratings yet
Understanding RAG Deadlock in Linux
13 pages
Base and Limit Registers in Memory Protection
No ratings yet
Base and Limit Registers in Memory Protection
17 pages
Lecture3 Deadlock
No ratings yet
Lecture3 Deadlock
14 pages
Factors Influencing Women's Empowerment
No ratings yet
Factors Influencing Women's Empowerment
33 pages
Comprehensive Brand Audit Guide
No ratings yet
Comprehensive Brand Audit Guide
29 pages
DM - RDL 1 - Module 3
100% (1)
DM - RDL 1 - Module 3
17 pages
Customer Service Insights at Big Bazaar
No ratings yet
Customer Service Insights at Big Bazaar
40 pages
The Subjectivity of Credible Threats - SCorrea
No ratings yet
The Subjectivity of Credible Threats - SCorrea
8 pages
Factors Affecting Accuracy of Pretender Cost Estimate - Studies of Saudi Arabia
No ratings yet
Factors Affecting Accuracy of Pretender Cost Estimate - Studies of Saudi Arabia
17 pages
1d - Oil Analysis Report
No ratings yet
1d - Oil Analysis Report
10 pages
Software Testing and Quality Assurance Course
No ratings yet
Software Testing and Quality Assurance Course
10 pages
Factor Analysis
67% (3)
Factor Analysis
25 pages
Weib KlakattawiI
No ratings yet
Weib KlakattawiI
20 pages
Critical Thinking for Professionals
No ratings yet
Critical Thinking for Professionals
5 pages
A Meta-Analysis of The Effectiveness of Social Media Influencers Mechanisms and Moderation
No ratings yet
A Meta-Analysis of The Effectiveness of Social Media Influencers Mechanisms and Moderation
21 pages
Creative Question Starts
No ratings yet
Creative Question Starts
1 page
CRV Open Source - v3.0. Remote Viewing
100% (2)
CRV Open Source - v3.0. Remote Viewing
104 pages
Dissertation Sur La Temperance
100% (2)
Dissertation Sur La Temperance
4 pages
Test Construction Guide
No ratings yet
Test Construction Guide
21 pages
Reseaech Jash Final 1 1
No ratings yet
Reseaech Jash Final 1 1
29 pages
Annex - 3-UNICEF - Concept - Note - Template - With - Guideline - 2024 - vZ5tI8b 09-04-2024-$
No ratings yet
Annex - 3-UNICEF - Concept - Note - Template - With - Guideline - 2024 - vZ5tI8b 09-04-2024-$
17 pages
Unit LLL Narrative Report
No ratings yet
Unit LLL Narrative Report
27 pages
Investigating Continual Pretraining in Large
No ratings yet
Investigating Continual Pretraining in Large
25 pages
3D Printing Turbine Blades
No ratings yet
3D Printing Turbine Blades
21 pages
Ictict608 Upashna
No ratings yet
Ictict608 Upashna
44 pages
PP ATL Skills
100% (2)
PP ATL Skills
4 pages
The Effects of Strict and Lenient Teachers to Students Aila Research
No ratings yet
The Effects of Strict and Lenient Teachers to Students Aila Research
8 pages
Process Mapping Techniques and Organisational Analysis: Lessons From Sociotechnical System Theory
No ratings yet
Process Mapping Techniques and Organisational Analysis: Lessons From Sociotechnical System Theory
11 pages
Magazine Project
No ratings yet
Magazine Project
9 pages
Freud and Nietzsche (Athlone Contemporary European Thinkers - Paul-Laurent Assoun - 2003 - Continuum International Publishing Group - 9780826482990 - Anna's Archive
100% (1)
Freud and Nietzsche (Athlone Contemporary European Thinkers - Paul-Laurent Assoun - 2003 - Continuum International Publishing Group - 9780826482990 - Anna's Archive
279 pages
Critical Thinking in Nursing Practice
No ratings yet
Critical Thinking in Nursing Practice
19 pages
Measurement Techniques in Physics
No ratings yet
Measurement Techniques in Physics
5 pages
Rojas Estrada Et Al 2023 Media and Information Literacy in The Prescribed Curriculum
No ratings yet
Rojas Estrada Et Al 2023 Media and Information Literacy in The Prescribed Curriculum
28 pages

Activation Functions

Uploaded by

Activation Functions

Uploaded by

Activation Functions

• Similar to Leaky ReLU, but a is learned during

You might also like