0% found this document useful (0 votes)

61 views31 pages

Chapter 7

The document discusses neural networks and neural language models, explaining that neural networks are composed of interconnected computational units modeled after neurons that take weighted inputs and produce an output. It provides details on the basic components of neural networks, including units, weights, biases, activation functions, and how they are arranged in layers to perform tasks like XOR classification through hidden representations. Feedforward neural networks are introduced as the simplest type of neural network with units connected in a forward direction from input to output layers.

Uploaded by

Fairooz Toroshe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

61 views31 pages

Chapter 7

Uploaded by

Fairooz Toroshe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 31

CHAPTER 7 | Neural Networks and

Neural Language Models

“[M]achines of this character can behave in a very

complicated manner when the number of units is large.”

Alan Turing (1948) “Intelligent Machines”, page 6

1
Introduction

They are called neural because:

 Origins lie in the neuron

 Simplified model of the human
neuron like computing element
 Described in terms of
propositional logic
Introduction
Neural network is a

Using vector notation:

z = w· x+b
[w: weight vector, b: scalar bias b, x: input vector x]
Activation

 a non-linear function f to z
 output of this function : the activation value, a
y = a = f(z)
 Different activation functions:
• Sigmoid
• Tanh
• rectified linear unit or ReLU
Sigmoid
it maps the output into the range (0,1)


 the output of a neural unit:

 Example:

weight vector: w = [0.2,0.3,0.9] , bias: b = 0.5 and x = [0.5,0.6,0.1]

Sigmoid

 Used in the output layer of binary classification

 Disadvantage:
• Non-zero centered output
Tanh
 Advantages: mean of the activation closer to zero
Relu (Rectified linear unit)

y = ReLU(z) = max(z,0)

Advantages:

• Avoids vanishing gradient problem

• Doesn't become saturated
Vanishing Gradient:
Networks is trained by
 Propagating an error signal backwards

Now,

 gradients that are almost 0 cause the error signal to get smaller to be used
for training
 a problem called the vanishing gradient problem
The XOR problem
Can neural units compute simple functions of input?
AND OR
XOR
x1 x2 y x1 x2 y x1 x2 y
0 0 0 0 0 0 0 0 0
0 1 0 0 1 1 0 1 1
1 0 0 1 0 1 1 0 1
1 1 1 1 1 1 1 1 0
Perceptrons

A very simple neural unit
• Binary output (0 or 1)
• No non-linear activation function
Easy to build AND or OR with perceptrons
AND :

0
0 0 0

-1

Easy to build AND or OR with perceptrons
AND :

0
1 1 0

-1

Easy to build AND or OR with perceptrons
AND :

1
0 0 0

-1

Easy to build AND or OR with perceptrons
AND :

1
1 1 1

-1

Easy to build AND or OR with perceptrons
OR :

    0
0
0 0 0

0
Easy to build AND or OR with perceptrons
OR :

    0
0
1 1 1

0
Easy to build AND or OR with perceptrons
OR :

    1
1
0 0 1

0
Easy to build AND or OR with perceptrons
OR :

    1
1
1 1 1

0
Not possible to capture XOR with perceptrons !!

 Perceptron equation given x1 and x2, is the equation of a line

w1x1 + w2x2 + b = 0

 in standard linear format: x2 = (−w1/w2)x1 + (−b/w2)

 This line acts as a decision boundary
• 0 if input is on one side of the line
• 1 if on the other side of the line
Decision boundaries
x x x
12 12 12

?
0 x 0 x 0
0 1 0 1 0 1
1 1
a) x1 b) x1 c) x1

AND x2 OR x2 XOR x2
 Filled circles represent perceptron outputs of 1,
 white circles perceptron outputs of 0
 no way to draw a line that correctly separates the two categories for
XOR
The solution: neural networks
 Can be calculated by a layered network of perceptron units

 Can compute using two layers of ReLU-based units

• The middle layer (called h) has two units
• the output layer (called y) has one unit
The solution: neural networks
0 0
                           0
                 0
                                       0  input x = [0, 0]
0  In hidden layer, [0, -1]
0 0 -1 =>[0]  After Relu h layer as
0 [0, 0]
-1  Final output 0
The solution: neural networks
0 0
                           1
                 1
                                       1  input x = [0, 1]
0  In hidden layer, [1, 0]
1 0 0  After Relu h layer as
1 [1, 0]
-1  Final output 1
The hidden representation h
 hidden representations x
= [0, 1] and x = [1, 0] are
merged into h = [1, 0]
 The merger makes it easy
to linearly separate the
positive and negative
cases of XOR
Feedforward Neural Networks
 Simplest kind of neural network
 Multilayer network
 Units are connected with no cycles
 Outputs from units in each layer are passed to units in the
next higher layer
 No outputs are passed back to lower layers
 Sometimes called multi-layer perceptrons
Feedforward Neural
Networks
 feedforward networks have three
kinds of nodes
• input units, hidden units, and
output units
 fully-connected
• Takes as input the outputs from
previous layer
• link between every pair of units
from two adjacent layers
Hidden layer computation
3 steps : Multiplying the weight matrix W by the input vector x , adding the
bias vector  b , applying the activation function g

h = σ(Wx+b)
X = Where,
g[z1,z2,z3] =
[g(z1),g(z2),g(z3)]

+ =
Output layer computation
weight matrix U , U ∈ n2×n1
input vector (h)
intermediate output z, z ∈ R n2
Where, z = Uh

z : a vector of real-valued number

can’t be the output of the classifier
Softmax
For normalizing a vector of real values
To a vector that encodes a probability
distribution between 0 and 1
 sum to 1
Used for muliclass classification
Final Equation

h = σ(Wx+b)
z = Uh
y = softmax(z)

Where x ∈ n0 , h ∈ n1 , b ∈ n1 , W ∈ n1×n0 , U ∈ n2×n1 , and the

output vector y ∈ n2

Dave Reed: Connectionist Approach To AI
No ratings yet
Dave Reed: Connectionist Approach To AI
26 pages
Artificial Neural Networks Explained
No ratings yet
Artificial Neural Networks Explained
54 pages
03-Feedforward NN Editable
No ratings yet
03-Feedforward NN Editable
74 pages
Deep Feedforward Networks Explained
No ratings yet
Deep Feedforward Networks Explained
34 pages
Neural Networks and Fuzzy Logic Overview
50% (2)
Neural Networks and Fuzzy Logic Overview
54 pages
Module I
No ratings yet
Module I
109 pages
AND Operation Using NN
No ratings yet
AND Operation Using NN
53 pages
Ai Unit 4 Part 2
No ratings yet
Ai Unit 4 Part 2
45 pages
DL Mod 1 Final
No ratings yet
DL Mod 1 Final
4 pages
Neural Network
No ratings yet
Neural Network
82 pages
Learning XOR - Gradient Based Learning - Hidden Units
No ratings yet
Learning XOR - Gradient Based Learning - Hidden Units
43 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
87 pages
02 Neural Network
No ratings yet
02 Neural Network
28 pages
Perceptron and XOR Explained
No ratings yet
Perceptron and XOR Explained
17 pages
7 NN Apr 28 2021
No ratings yet
7 NN Apr 28 2021
81 pages
ML Lect6n7 NN Architecture and Training
No ratings yet
ML Lect6n7 NN Architecture and Training
122 pages
Module 2
No ratings yet
Module 2
44 pages
ML Lecture#4
No ratings yet
ML Lecture#4
109 pages
05 ANN Artificial Neural Networks
No ratings yet
05 ANN Artificial Neural Networks
221 pages
Neural Deep Learning
No ratings yet
Neural Deep Learning
221 pages
Module 2 DL Snotes P1
No ratings yet
Module 2 DL Snotes P1
16 pages
Neural Networks for Beginners
No ratings yet
Neural Networks for Beginners
65 pages
Week 03-04 - Deep Feedforward Networks - Intro
No ratings yet
Week 03-04 - Deep Feedforward Networks - Intro
141 pages
Unit - II ML
No ratings yet
Unit - II ML
9 pages
Unit 2.1
No ratings yet
Unit 2.1
37 pages
CH 12 - Artificial Neural Networks
No ratings yet
CH 12 - Artificial Neural Networks
39 pages
05 ANN Artificial Neural Networks
No ratings yet
05 ANN Artificial Neural Networks
216 pages
Supervised Learning Neural Networks
No ratings yet
Supervised Learning Neural Networks
34 pages
08 NN
No ratings yet
08 NN
117 pages
5 - From Linear Models To Multi-Layer Perceptrons
No ratings yet
5 - From Linear Models To Multi-Layer Perceptrons
45 pages
3-Neural Networks - Parts 1 and 2
No ratings yet
3-Neural Networks - Parts 1 and 2
48 pages
Deep Learning
No ratings yet
Deep Learning
180 pages
Neural Network Notes
No ratings yet
Neural Network Notes
8 pages
P95 Course Slides
No ratings yet
P95 Course Slides
86 pages
9.deep Feedforward Networks
100% (1)
9.deep Feedforward Networks
13 pages
Understanding Neural Networks Basics
No ratings yet
Understanding Neural Networks Basics
6 pages
Neural Networks: Perceptron Basics
No ratings yet
Neural Networks: Perceptron Basics
12 pages
Refined Chapter 5 UceQEJ
No ratings yet
Refined Chapter 5 UceQEJ
79 pages
AN2DL 02 2324 Perceptron 2 FeedForward
No ratings yet
AN2DL 02 2324 Perceptron 2 FeedForward
55 pages
Ann Module 2
No ratings yet
Ann Module 2
68 pages
P5 Neural Nets
No ratings yet
P5 Neural Nets
114 pages
Chapter 5 Part I Basics Neural Networks
No ratings yet
Chapter 5 Part I Basics Neural Networks
85 pages
Nns Are A Study of Parallel and Distributed Processing Systems (PDPS)
No ratings yet
Nns Are A Study of Parallel and Distributed Processing Systems (PDPS)
46 pages
Artificial Neural Networks Basics
No ratings yet
Artificial Neural Networks Basics
50 pages
Neural Network and Deep Learning - Unit 1
No ratings yet
Neural Network and Deep Learning - Unit 1
20 pages
ANN Architecture
No ratings yet
ANN Architecture
41 pages
Contents MLP PDF
No ratings yet
Contents MLP PDF
60 pages
08 Neural Networks
No ratings yet
08 Neural Networks
47 pages
Ch5-Feedforward Neural Networks, Word Embeddings, Neural Language Models, and Word2vec PDF
No ratings yet
Ch5-Feedforward Neural Networks, Word Embeddings, Neural Language Models, and Word2vec PDF
67 pages
Neural Network Presentation
No ratings yet
Neural Network Presentation
33 pages
02 Perceptron
No ratings yet
02 Perceptron
114 pages
Artificial Intelligence: Outline
No ratings yet
Artificial Intelligence: Outline
35 pages
Module 5
No ratings yet
Module 5
27 pages
Unit I - Artificial Neural Network - Assignment 1
No ratings yet
Unit I - Artificial Neural Network - Assignment 1
4 pages
02A DL2023 NN Basics
No ratings yet
02A DL2023 NN Basics
52 pages
DL RL Exp 1
No ratings yet
DL RL Exp 1
16 pages
22SCSE1180094 - Shyam Lab File (SCA) !!!!
No ratings yet
22SCSE1180094 - Shyam Lab File (SCA) !!!!
28 pages
14 - Performance Measure - Final
No ratings yet
14 - Performance Measure - Final
17 pages
13 3 KMeans
No ratings yet
13 3 KMeans
15 pages
18 - Computational Complexity
No ratings yet
18 - Computational Complexity
21 pages
HOG, SIFT, and SURF Feature Descriptors
No ratings yet
HOG, SIFT, and SURF Feature Descriptors
81 pages
Intelligent Agents
No ratings yet
Intelligent Agents
61 pages
15 - DT and KNN Algorithm
No ratings yet
15 - DT and KNN Algorithm
34 pages
Management Lecture
No ratings yet
Management Lecture
62 pages
CSE 433 - Presentation 0423
No ratings yet
CSE 433 - Presentation 0423
8 pages
Financial Statement
No ratings yet
Financial Statement
15 pages
E-Eli5-Way-3bd2b1164a53: CNN (Source:)
No ratings yet
E-Eli5-Way-3bd2b1164a53: CNN (Source:)
4 pages
Lecture03 VDL
No ratings yet
Lecture03 VDL
46 pages
Introduction to Neurobiology Lecture
No ratings yet
Introduction to Neurobiology Lecture
33 pages
Multilayer Perceptrons in Machine Learning
No ratings yet
Multilayer Perceptrons in Machine Learning
38 pages
Chapter 2. The Biological Perspective
No ratings yet
Chapter 2. The Biological Perspective
40 pages
Nervous System For MYP 2
No ratings yet
Nervous System For MYP 2
8 pages
Neurological Diagnosis and Localization
No ratings yet
Neurological Diagnosis and Localization
48 pages
Elitox PPT ENG Compressed
No ratings yet
Elitox PPT ENG Compressed
18 pages
Cmos SNN
No ratings yet
Cmos SNN
9 pages
Synaptic Transmission 4
No ratings yet
Synaptic Transmission 4
36 pages
Difference Between Oligodendrocytes and Schwann Cells
No ratings yet
Difference Between Oligodendrocytes and Schwann Cells
5 pages
Unit III (2) RNN, LSTM, Gru
No ratings yet
Unit III (2) RNN, LSTM, Gru
14 pages
Pharma CNS
No ratings yet
Pharma CNS
353 pages
Quiz احمد علاو دفعه 39
No ratings yet
Quiz احمد علاو دفعه 39
7 pages
Grade 8 Biology Chapter 7 Lesson 2
No ratings yet
Grade 8 Biology Chapter 7 Lesson 2
14 pages
Figure PPT ch008
No ratings yet
Figure PPT ch008
46 pages
Cuddon - Electrophysiology in Neuromuscular Disease - 2002
No ratings yet
Cuddon - Electrophysiology in Neuromuscular Disease - 2002
32 pages
Cerebral Cortex
No ratings yet
Cerebral Cortex
15 pages
19eid331 - Artificial Neural Networks
No ratings yet
19eid331 - Artificial Neural Networks
3 pages
Control and Coordination in Animals - Notes
No ratings yet
Control and Coordination in Animals - Notes
6 pages
Neurons: Conducting Action Potentials
No ratings yet
Neurons: Conducting Action Potentials
25 pages
Sequence Modeling for IT Students
No ratings yet
Sequence Modeling for IT Students
71 pages
Guided Notes Biological Bases of Behavior 2023
100% (1)
Guided Notes Biological Bases of Behavior 2023
26 pages
UTBK SBMPTN 2024 English Practice Test
No ratings yet
UTBK SBMPTN 2024 English Practice Test
7 pages
EMG and NCS Testing Guide Houston
No ratings yet
EMG and NCS Testing Guide Houston
1 page
Biochemistry of Neurotransmitters
No ratings yet
Biochemistry of Neurotransmitters
31 pages
Portable EMG/NCS System for Clinicians
No ratings yet
Portable EMG/NCS System for Clinicians
7 pages
Test Nervous System g12
No ratings yet
Test Nervous System g12
5 pages
Erythropoietin and Neuronal Function
No ratings yet
Erythropoietin and Neuronal Function
4 pages
Overview of the Nervous System
No ratings yet
Overview of the Nervous System
8 pages