0% found this document useful (0 votes)

35 views10 pages

Deep Learning

Good book

Uploaded by

mraj00643

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views10 pages

Deep Learning

Good book

Uploaded by

mraj00643

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

DEEP

LEARNING
┌──────────────────────┐
│ Deep Learning │
└──────────────────────┘
│
┌──────────────┼───────────────┐
│ │ │
CNNs RNNs Transformers
│ │ │
Vision Tasks Sequence Data Language Models
│ │ │
Image Recognition Speech/Text Chatbots, GPT, BERT

1. What is Deep Learning?

Deep Learning (DL) is a subset of Machine Learning (ML) that uses artificial
neural networks with multiple layers (hence the word deep) to automatically
learn representations from data.

It uses artificial neural networks — computational systems composed of

layers of interconnected “neurons.”

Each layer in a neural network automatically learns to transform input data

(like images, sounds, or text) into increasingly abstract representations.
Difference Between Machine Learning and Deep Learning:

Traditional Machine
Deep Learning
Learning

Manual (requires domain Automatic (learns features

Feature Extraction
expertise) directly from data)

Works with smaller

Data Requirement Requires large datasets
datasets

Limited on complex data Excellent on unstructured

Performance
(e.g., images, text) data

High (requires
Computation Low to moderate
GPUs/TPUs)

Interpretability Easier to Interpret

2. Key Benefits of Deep Learning

1.Automatic Feature Extraction

Deep Learning eliminates the need for manual feature engineering by learning
useful features directly from raw data.

2. Handles Unstructured Data

DL can process images, audio, text, and video, which are challenging for traditional
algorithms.

3. High Accuracy

With sufficient data and computational power, deep models often outperform
traditional ML models in real-world tasks.
4. Scalability

DL models improve performance with more data, unlike many classical methods
that plateau.

5. End-to-End Learning

DL systems can take raw input and directly output predictions without manual
preprocessing.

3. Neurons & Perceptrons

A neuron (or perceptron) is the basic computational unit of a neural network.

Each neuron:

Takes multiple inputs.

Multiplies them by weights and adds a bias.
Passes the result through an activation function.

Mathematical Representation:
y = f(w_1x_1 + w_2x_2 + ... + w_nx_n + b)
Where:
x_i: Inputs
w_i: Weights
b: Bias
f: Activation function

4. Activation Functions

Activation functions introduce non-linearity so networks can learn complex

mappings.

1. Sigmoid Function

f(x) = 1 / (1 + e^-x)

Range: (0, 1)
Shape: S-shaped (smooth curve)
Key Points:
Maps any real number into a range between 0 and 1.
Often used for binary classification in the output layer.
Major drawback: causes vanishing gradients (very small gradient values during
training).

Visual Idea:
x: -∞ → ∞, f(x): 0 → 1
S-shaped curve

Example use:
Output neuron predicting probability of “yes/no” or “0/1”.

2. Tanh (Hyperbolic Tangent) Function

f(x)=tanh(x)=(e^x - e^-x)/(e^x + e^-x)

Range: (-1, 1)
Shape: S-shaped, but centered at zero.

Key Points:
Similar to sigmoid but zero-centered, which helps training stability.
Still suffers from vanishing gradient for large values of x.

Visual Idea:
x: -∞ → ∞, f(x): -1 → 1
S-shaped curve centered at 0

Use Case:
Hidden layers in simple neural networks.

3. ReLU (Rectified Linear Unit)

f(x)=max(0,x)

Range: [0, ∞)
Shape: Linear for positive values, flat for negative.
Key Points:
The most commonly used activation in deep learning.
Fast and efficient — introduces non-linearity cheaply.
Simple: outputs 0 for negatives, same value for positives.
Problem: “Dying ReLU” — neurons can become inactive if weights push
inputs below 0 for all data.

Visual Idea:
x<0 → 0, x>0 → x

Use Case:
Hidden layers in CNNs, MLPs, etc.

4. Leaky ReLU

f(x)=x if x>0 else 0.01x

Range: (-∞, ∞)
Shape: Like ReLU but with a small slope for negative inputs.

Key Points:
Solves the dying ReLU problem.
Allows small gradient even for negative values (e.g., slope = 0.01)
.
Visual Idea:
x<0 → small slope (0.01x)
x>0 → linear (x)

Use Case:
Modern CNNs, GANs, and deeper models.

5. Softmax Function

f(x)=e^xi / Σe^xj

Range: (0, 1), sum of all outputs = 1

Key Points:
Key Points:
Converts raw scores (logits) into probabilities.
Used in the output layer of multi-class classification models.
Ensures all class probabilities add up to 1.

Visual Idea:
Input: [2.0, 1.0, 0.1]
Output: [0.65, 0.24, 0.11] (probabilities)

5. Neural Network Architectures

1. Artificial Neural Network (ANN)

Structure:
Simplest form of a neural network.
Made of layers of neurons: input layer → hidden layers → output layer.
Each neuron is connected to all neurons in the next layer (fully connected).

How it works:
Each layer learns certain relationships in data by adjusting weights.
Information flows forward only (no loops).

Use cases:
Works well for structured/tabular data — e.g., predicting house prices,
classification tasks, credit scoring.

Visual Idea:

Input → Hidden Layer(s) → Output

2. Convolutional Neural Network (CNN)

Structure:
Designed for image and spatial data.
Has convolution layers (which detect patterns like edges or shapes) and pooling
layers (which reduce size).

How it works:
The model “scans” over the image using filters (small windows) to find patterns.
Deeper layers detect complex features (like faces, objects, etc.).

Use cases:
Computer vision tasks: image classification, facial recognition, medical imaging,
object detection.

Visual Idea:
Image → Convolution → Pooling → Flatten → Dense → Output

3. Recurrent Neural Network (RNN)

Structure:
Designed for sequential or time-based data.
Has connections that form loops, allowing memory of previous inputs.

How it works:
Each output depends not only on the current input but also on previous steps
(context).
Variants: LSTM (Long Short-Term Memory) and GRU (Gated Recurrent Unit)
handle long-term dependencies better.

Use cases:
Text, speech, time-series data, stock predictions, chatbots, translation.

Visual Idea:
Input_t → Hidden_t → Output_t
↑ ↓
Hidden_(t-1)

4. Autoencoders
Structure:
Consist of two main parts: Encoder (compresses data) and Decoder (reconstructs
data).

How it works:
The network learns to compress data into a smaller representation (latent space)
and then recreate the original input.

Use cases:
Dimensionality reduction, denoising, anomaly detection, image compression.

Visual Idea:
Input → Encoder → Latent Space → Decoder → Output

5. Transformers

Structure:
Built using a mechanism called self-attention, which allows the model to
understand the relationship between all elements in a sequence (e.g., words in a
sentence).
Replaces recurrence (no loops like RNNs).

How it works:
The self-attention layer learns which parts of the input are most relevant to each
other.
Processes sequences in parallel, making it faster and more scalable.

Use cases:
Natural Language Processing (NLP): translation, chatbots, summarization,
question answering.
Also used in vision (Vision Transformers / ViT) and audio processing.

Visual Idea:
Input → Multi-Head Attention → Feed Forward → Output

6. Applications of Deep Learning

Domain Example Applications

Computer Vision Image classification, object

detection medical imaging
Natural Language Processing Chatbots, text translation,
(NLP) sentiment analysis
Speech & Audio Voice assistants (Alexa, Siri),
speech recognition
Healthcare Disease prediction, drug
discovery medical imaging
Autonomous Systems Self-driving cars, robotics,
drones
Finance & Business Fraud detection, algorithmic
trading recommendation

NNML Full
No ratings yet
NNML Full
19 pages
Deep Learning Report For Students
No ratings yet
Deep Learning Report For Students
32 pages
Deep Learning Fundamentals
No ratings yet
Deep Learning Fundamentals
19 pages
Deep Learning (DL) - Comprehensive Summary
No ratings yet
Deep Learning (DL) - Comprehensive Summary
9 pages
clc02 Nvmhoang Ass3
No ratings yet
clc02 Nvmhoang Ass3
26 pages
Deep Learning Concise Notes
No ratings yet
Deep Learning Concise Notes
4 pages
CH 4 Deep Learning
No ratings yet
CH 4 Deep Learning
7 pages
DL
No ratings yet
DL
3 pages
Unit 1 Deep Learning New 2025
No ratings yet
Unit 1 Deep Learning New 2025
12 pages
Deep Learning & Neural Networks Guide
No ratings yet
Deep Learning & Neural Networks Guide
5 pages
Deep Learning and Neural Networks
No ratings yet
Deep Learning and Neural Networks
1 page
Lesson 2 Neural Network Architectures
No ratings yet
Lesson 2 Neural Network Architectures
35 pages
UNIT - 5 Lecture 2
No ratings yet
UNIT - 5 Lecture 2
26 pages
Unit I
No ratings yet
Unit I
90 pages
Machine Learning
No ratings yet
Machine Learning
11 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
4 pages
Deep Learning Essentials for Experts
No ratings yet
Deep Learning Essentials for Experts
8 pages
Deep Learnig
No ratings yet
Deep Learnig
16 pages
Lecture5 MCQ Guide
No ratings yet
Lecture5 MCQ Guide
9 pages
Four Unit
No ratings yet
Four Unit
3 pages
Eng PPT Tech
No ratings yet
Eng PPT Tech
18 pages
Unit 2 Notes NLP
No ratings yet
Unit 2 Notes NLP
6 pages
cq02 Vdthanh Ass3
No ratings yet
cq02 Vdthanh Ass3
20 pages
Unit.1.Introduction To Deep Learning
No ratings yet
Unit.1.Introduction To Deep Learning
10 pages
W 10
No ratings yet
W 10
34 pages
Expanded Deep Learning Document-1
No ratings yet
Expanded Deep Learning Document-1
11 pages
02 Neural Networks 1
No ratings yet
02 Neural Networks 1
24 pages
Deep Learning - Unit 1 Notes
No ratings yet
Deep Learning - Unit 1 Notes
27 pages
R21 - A7709 - Deep Learning: Dr. Bhawani Sankar Panigrahi
No ratings yet
R21 - A7709 - Deep Learning: Dr. Bhawani Sankar Panigrahi
92 pages
DLC Unit 1
No ratings yet
DLC Unit 1
7 pages
Deep Learning UNIT 5
No ratings yet
Deep Learning UNIT 5
182 pages
Deep Learning Updated
No ratings yet
Deep Learning Updated
11 pages
W1 Ann
No ratings yet
W1 Ann
3 pages
Deep Learing
No ratings yet
Deep Learing
37 pages
Deep Learning UNIT 1
No ratings yet
Deep Learning UNIT 1
22 pages
Group I
No ratings yet
Group I
20 pages
Deep Learning for Tech Enthusiasts
No ratings yet
Deep Learning for Tech Enthusiasts
20 pages
Deep Learning-Ann Thoery
No ratings yet
Deep Learning-Ann Thoery
14 pages
Tutorial 1,2
No ratings yet
Tutorial 1,2
12 pages
Deep Learning
No ratings yet
Deep Learning
13 pages
DL Lab Manual
No ratings yet
DL Lab Manual
20 pages
Notes DL-1
No ratings yet
Notes DL-1
10 pages
DeepLearning - 1NT22CS078 - I Shania Jone
No ratings yet
DeepLearning - 1NT22CS078 - I Shania Jone
4 pages
Unit 1 Neural Network Basics
No ratings yet
Unit 1 Neural Network Basics
15 pages
DL
No ratings yet
DL
4 pages
2 Marks Gen AI
No ratings yet
2 Marks Gen AI
14 pages
AI Unit5 Neural Network 1c2c9166 c1b7 47a3 8ce1 E914f1ab6afb
No ratings yet
AI Unit5 Neural Network 1c2c9166 c1b7 47a3 8ce1 E914f1ab6afb
52 pages
Deep Learning
No ratings yet
Deep Learning
7 pages
Neural Networks & Deep Learning - Study Notes
No ratings yet
Neural Networks & Deep Learning - Study Notes
8 pages
DL Notes
No ratings yet
DL Notes
97 pages
Notes For Deep Learning
No ratings yet
Notes For Deep Learning
6 pages
Deep Learning-1
No ratings yet
Deep Learning-1
20 pages
Unit 1 Fundamentals of Deep Learning
No ratings yet
Unit 1 Fundamentals of Deep Learning
20 pages
Deep Learning
No ratings yet
Deep Learning
98 pages
Deep Learning
No ratings yet
Deep Learning
5 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
2 pages
Deep Learning
No ratings yet
Deep Learning
1 page
Lect 2 Common Architectural Principles of Deep Networks
No ratings yet
Lect 2 Common Architectural Principles of Deep Networks
20 pages
NTU Singapore GCF 2026 Guide by Ribhu Susmita
No ratings yet
NTU Singapore GCF 2026 Guide by Ribhu Susmita
4 pages
Customcert Report
No ratings yet
Customcert Report
1 page
Priya Urtw
No ratings yet
Priya Urtw
4 pages
Physica Practical
No ratings yet
Physica Practical
27 pages
Ncert Solutions Class 12 Maths Chapter 8 Ex 8 2
No ratings yet
Ncert Solutions Class 12 Maths Chapter 8 Ex 8 2
10 pages
Data Science
No ratings yet
Data Science
207 pages
Research On The Prediction of Boston House Price B
No ratings yet
Research On The Prediction of Boston House Price B
11 pages
Report 1 Crim
No ratings yet
Report 1 Crim
73 pages
Unit-4 Ai
No ratings yet
Unit-4 Ai
32 pages
WEBGIS Based Road Infrastructure Asset Managment SystemFinal2pptx
No ratings yet
WEBGIS Based Road Infrastructure Asset Managment SystemFinal2pptx
22 pages
The Cognitive Electronic Warfare in The Age of Artificial 13gbejqbzluq
No ratings yet
The Cognitive Electronic Warfare in The Age of Artificial 13gbejqbzluq
10 pages
Machine Learning Semester Paper
No ratings yet
Machine Learning Semester Paper
31 pages
Regression Analysis for Beginners
No ratings yet
Regression Analysis for Beginners
35 pages
A Reinforced Active Learning Approach For Optimal Sampling in Aspect Term 2022
No ratings yet
A Reinforced Active Learning Approach For Optimal Sampling in Aspect Term 2022
18 pages
SentinelX Proposal
No ratings yet
SentinelX Proposal
8 pages
Module 333
No ratings yet
Module 333
56 pages
Principled Penalty-Based Methods For Bilevel Reinforcement Learning and RLHF
No ratings yet
Principled Penalty-Based Methods For Bilevel Reinforcement Learning and RLHF
49 pages
AI Transforming Insurtech & Real Estate
No ratings yet
AI Transforming Insurtech & Real Estate
5 pages
CV v2.1
No ratings yet
CV v2.1
2 pages
A Machine Learning Approach To Waiting Time Prediction in Queueing Scenarios
No ratings yet
A Machine Learning Approach To Waiting Time Prediction in Queueing Scenarios
5 pages
AI for Kids' Metacognitive Growth
No ratings yet
AI for Kids' Metacognitive Growth
38 pages
Astudyof Sentimentanalysis
No ratings yet
Astudyof Sentimentanalysis
17 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
36 pages
CNN Architecture
No ratings yet
CNN Architecture
28 pages
Deep Learning Based Face Recognition Method Using Siamese Network
No ratings yet
Deep Learning Based Face Recognition Method Using Siamese Network
6 pages
7 CSBS 20CB923 Important Questions Unit-1 and Unit-2
No ratings yet
7 CSBS 20CB923 Important Questions Unit-1 and Unit-2
2 pages
Energy Consumption Prediction Report
No ratings yet
Energy Consumption Prediction Report
4 pages
Published Paper
No ratings yet
Published Paper
8 pages
BI Unit 3
No ratings yet
BI Unit 3
132 pages
Unit 2
No ratings yet
Unit 2
64 pages
Pattern Classification With Missing Data-A Review
No ratings yet
Pattern Classification With Missing Data-A Review
20 pages
Course Outline Comp6721 W25
No ratings yet
Course Outline Comp6721 W25
5 pages
Advancing Personalized and Adaptive Learning Experience in Education With Artificial Intelligence
No ratings yet
Advancing Personalized and Adaptive Learning Experience in Education With Artificial Intelligence
6 pages
AI and ML Lab Manual for Computer Engineering
No ratings yet
AI and ML Lab Manual for Computer Engineering
36 pages
Fruit Freshness Checker
No ratings yet
Fruit Freshness Checker
36 pages

Deep Learning

Uploaded by

Deep Learning

Uploaded by

DEEP

1. What is Deep Learning?

It uses artificial neural networks — computational systems composed of

Each layer in a neural network automatically learns to transform input data

Manual (requires domain Automatic (learns features

Works with smaller

Limited on complex data Excellent on unstructured

Interpretability Easier to Interpret

2. Key Benefits of Deep Learning

1.Automatic Feature Extraction

2. Handles Unstructured Data

3. Neurons & Perceptrons

A neuron (or perceptron) is the basic computational unit of a neural network.

Takes multiple inputs.

Activation functions introduce non-linearity so networks can learn complex

2. Tanh (Hyperbolic Tangent) Function

f(x)=tanh(x)=(e^x - e^-x)/(e^x + e^-x)

3. ReLU (Rectified Linear Unit)

f(x)=x if x>0 else 0.01x

Range: (0, 1), sum of all outputs = 1

5. Neural Network Architectures

1. Artificial Neural Network (ANN)

Input → Hidden Layer(s) → Output

2. Convolutional Neural Network (CNN)

3. Recurrent Neural Network (RNN)

6. Applications of Deep Learning

Computer Vision Image classification, object

You might also like