0% found this document useful (0 votes)

11 views28 pages

Intro

Uploaded by

Mohammad Zohaib

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views28 pages

Intro

Uploaded by

Mohammad Zohaib

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

CS-866 Deep Reinforcement Learning

Introduction

Nazar Khan
Department of Computer Science

University of the Punjab

Introduction Supervised ML Unsupervised ML Reinforcement Learning

What is Deep Reinforcement Learning?

Deep RL studies how to solve complex problems that require making a

sequence of good decisions.
I

I These problems often live in high-dimensional state spaces:

I Many variables must be considered simultaneously.
I Example: In chess, the position of each piece denes the state; there are
more possible states than atoms in the universe.
I Example: In robotics, sensors may produce hundreds or thousands of
readings per time step.
Introduction Supervised ML Unsupervised ML Reinforcement Learning

Examples of Sequential Decision-Making

Making Tea: wait until water is boiling, add tea leaves, adjust milk,
control sweetness, simmer for avor, strain before serving.
I

Tic-Tac-Toe: sequences of moves, opponent's responses, and planning

ahead.
I

Chess: much more complex version of tic-tac-toe with astronomical state

space.
I

Having a Conversation: listen to the other person, interpret context,

choose a relevant response, maintain ow, achieve an agenda.
I

Success comes from a sequence of decisions, not a single one. Each

decision has an immediate consequence and a long-term consequence.
An RL agent learns through trial-and-error.
Introduction Supervised ML Unsupervised ML Reinforcement Learning

State Spaces

X
O
O X
Tic-Tac-Toe Chess
3 = 19,683 possible boards.
9 ≈ 10 possible states.
47

Go Conversation
≈ 10 possible states.
170 Innite possible states.
Introduction Supervised ML Unsupervised ML Reinforcement Learning

What is Deep Reinforcement Learning?

I Combination of deep learning + reinforcement learning
I Goal: learn optimal actions that maximize reward across all states
I Works in high-dimensional, interactive environments
Introduction Supervised ML Unsupervised ML Reinforcement Learning

Deep Learning
I Function approximation in high dimensions
I Uses deep neural networks
I Examples: speech recognition, image classication
Introduction Supervised ML Unsupervised ML Reinforcement Learning

Reinforcement Learning
I Learns from trial and error, not from xed datasets
I Feedback comes from the environment (reward / punishment)
I Builds a policy: which action to take in each state
Introduction Supervised ML Unsupervised ML Reinforcement Learning

Where DRL Fits

Low-Dimensional High-Dimensional
Static Dataset Supervised Learning Deep Supervised Learning
Interaction Tabular RL Deep RL
Introduction Supervised ML Unsupervised ML Reinforcement Learning

Applications of DRL
I Robotics: locomotion, manipulation, pancake ipping, helicopters
I Games: Chess, Go, Pac-Man, StarCraft
Real-world: healthcare, nance, recommender systems, energy grids,
ChatGPT
I
Introduction Supervised ML Unsupervised ML Reinforcement Learning

Four Related Fields

1. Psychology

I Conditioning: Pavlov's dog

I Operant conditioning (Skinner)
I Learning from reinforcement is a core AI idea
Introduction Supervised ML Unsupervised ML Reinforcement Learning

Four Related Fields

1. Psychology

: A natural reaction to food is that a dog salivates. By ringing a

bell whenever the dog is given food, the dog learns to associate the sound with
Pavlov's dog

food, and after enough trials, the dog starts salivating as soon as it hears the
bell, presumably in anticipation of the food, whether it is there or not.
Introduction Supervised ML Unsupervised ML Reinforcement Learning

Four Related Fields

2. Mathematics

I Markov Decision Processes (MDPs)

I Optimization, planning, graph theory
I Symbolic AI: search, reasoning, theorem proving

Andrei Markov (1856-1922)

Introduction Supervised ML Unsupervised ML Reinforcement Learning

Four Related Fields

3. Engineering

I Known as optimal control in engineering.

I Focus on dynamical systems.
Bellman and Pontryagin's work in optimal control laid the foundation of
RL.
I

Two space vehicles docking Richard Bellman (1920-1984) Lev Pontryagin (1908-1988)
Introduction Supervised ML Unsupervised ML Reinforcement Learning

Four Related Fields

4. Biology

I Connectionism: swarm intelligence, neural networks

I Nature-inspired algorithms: ant colony, evolutionary algorithms

Biological Neuron Articial Neural Network Hinton, LeCun, Bengio

Introduction Supervised ML Unsupervised ML Reinforcement Learning

Three Paradigms of Machine Learning

Machine Learning studies how to approximate functions f : X → Y from

data.
I

I Often, functions are not known analytically.

I Instead, we them from observations.
learn

I Three main paradigms:

1. Supervised Learning
2. Unsupervised Learning
3. Reinforcement Learning
Introduction Supervised ML Unsupervised ML Reinforcement Learning

Functions in AI
I A function transforms input x to output y : f (x) → y .
I More generally: f : X → Y , where X , Y can be discrete or continuous.
I Real-world functions may be stochastic: f : X → p(Y ).
Introduction Supervised ML Unsupervised ML Reinforcement Learning

Given vs. Learned Functions

I Sometimes f is given exactly (laws of physics, explicit algorithms).
Example: Newton's 2nd Law F = m · a.
I Often, f is unknown and must be approximated from data.
I This is the domain of machine learning.
Introduction Supervised ML Unsupervised ML Reinforcement Learning

Supervised Learning
I Data: example pairs (x, y ).
I Goal: learn a function fˆ that predicts y from x .
I Common tasks:
I Regression: predict a continuous value.
I Classication: predict a discrete category.
I Loss function measures prediction error, e.g. MSE or cross-entropy.
Introduction Supervised ML Unsupervised ML Reinforcement Learning

Example: Regression

Figure: Blue: data points. Red: learned linear function ŷ = ax + b.

Introduction Supervised ML Unsupervised ML Reinforcement Learning

Example: Classication

Cat Cat Dog Cat Dog Dog

Introduction Supervised ML Unsupervised ML Reinforcement Learning

Unsupervised Learning
I No labels: only input data x .
I Goal: nd structure in data (clusters, latent variables).
I Examples:
I k -means clustering
I Principal Component Analysis (PCA)
I Autoencoders
I Learns p(x) instead of p(y |x).
Introduction Supervised ML Unsupervised ML Reinforcement Learning

Reinforcement Learning
I Third paradigm of machine learning
I Learns by interaction with the environment
I Data comes sequentially (one state at a time)
I Objective: learn a policy a function mapping states to the best actions
Introduction Supervised ML Unsupervised ML Reinforcement Learning

Agent and Environment

Figure: Agent interacts with Environment to maximize reward.

I Agent: Learner/decision-maker
I Environment: Provides feedback and state transitions
I Goal: maximize long-term accumulated reward
Introduction Supervised ML Unsupervised ML Reinforcement Learning

Key Dierences from Supervised/Unsupervised Learning

1. Interaction-based: No pre-collected dataset; data generated dynamically
via interaction between agent and environment
2. Reward signal: Partial numeric feedback, not full labels; UL has no
labels, RL has reward, SL has complete labels
3. Sequential decision-making: Learns policies across multiple steps
In RL there is no teacher or supervisor, and there is no static
dataset.
I

RL learns a policy for the environment by interacting with it and

receiving rewards and punishments.
I

SL can classify a set of images for you; UL can tell you which items
belong together; RL can tell you the winning of moves in
I

a game of chess, or the action- that robot-legs need to

sequence

take in order to walk.

sequence
Introduction Supervised ML Unsupervised ML Reinforcement Learning

Supervised vs Reinforcement Learning

Concept Supervised Learning Reinforcement Learning

Inputs x Full dataset One state at a time
Labels y Full (correct action) Partial (numeric reward)
Table: Comparison of paradigms
Introduction Supervised ML Unsupervised ML Reinforcement Learning

Implications of RL Paradigm
I Data is generated step-by-step ⇒ suited for sequential problems
I Risk of circular feedback (policy both selects and learns from actions)
I RL can continue to learn indenitely if environment is challenging
I Example: Chess, Go, robotics, conversational agents
Introduction Supervised ML Unsupervised ML Reinforcement Learning

Deep Reinforcement Learning

I Traditional RL: works on small, low-dimensional state spaces
I Many real-world problems: large, high-dimensional state spaces
I Deep RL = RL + Deep Learning
I Handles large state spaces
I Scales to complex tasks
I Key driver of recent breakthroughs in AI
Introduction Supervised ML Unsupervised ML Reinforcement Learning

Summary
I Deep RL = deep learning + reinforcement learning
I Solves sequential decision problems in high dimensions
I Rooted in psychology, math, engineering, biology
I Applications: robotics, games, healthcare, nance, any interactive setting

Lecture1 Introduction Part1
No ratings yet
Lecture1 Introduction Part1
17 pages
RL Introduction
No ratings yet
RL Introduction
225 pages
RL Week - 1
No ratings yet
RL Week - 1
53 pages
2025 RL Vs DeepLearning
No ratings yet
2025 RL Vs DeepLearning
9 pages
Introduction - Week 1
No ratings yet
Introduction - Week 1
52 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
25 pages
UNIT V Reinforcement Learning
No ratings yet
UNIT V Reinforcement Learning
8 pages
Lecture Week12
No ratings yet
Lecture Week12
37 pages
Reinforcement Learning Notes ?
No ratings yet
Reinforcement Learning Notes ?
40 pages
Playbook Executive Briefing Reinforcement Learning
No ratings yet
Playbook Executive Briefing Reinforcement Learning
20 pages
RL Report
No ratings yet
RL Report
15 pages
1 Introduction To RL
No ratings yet
1 Introduction To RL
46 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
25 pages
Lec 23
No ratings yet
Lec 23
51 pages
Module 01
No ratings yet
Module 01
66 pages
Green and Black Modern Machine Learning Presentation
No ratings yet
Green and Black Modern Machine Learning Presentation
14 pages
2-Artificial Intelligence, Concept and Application
No ratings yet
2-Artificial Intelligence, Concept and Application
24 pages
Unit 3
No ratings yet
Unit 3
13 pages
Intro to AI & Deep Learning
No ratings yet
Intro to AI & Deep Learning
20 pages
ML Unit2
No ratings yet
ML Unit2
17 pages
Reinforcement Learning in Mechatronics
No ratings yet
Reinforcement Learning in Mechatronics
12 pages
Machine Learning Unit-1.2
No ratings yet
Machine Learning Unit-1.2
23 pages
Reinforcement Learning With Python
No ratings yet
Reinforcement Learning With Python
24 pages
Final
No ratings yet
Final
18 pages
Deep Reinforcement Learning Overview
No ratings yet
Deep Reinforcement Learning Overview
50 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
47 pages
2015.08.26.Lecture01Intro 2
No ratings yet
2015.08.26.Lecture01Intro 2
37 pages
Seminar Report
No ratings yet
Seminar Report
12 pages
Reinforcement Learning-1
No ratings yet
Reinforcement Learning-1
19 pages
Unit 5 ML
No ratings yet
Unit 5 ML
49 pages
An Introduction To Deep Reinforcement Learning PDF
No ratings yet
An Introduction To Deep Reinforcement Learning PDF
140 pages
Module 1
No ratings yet
Module 1
72 pages
Reinforcement Learning Overview
No ratings yet
Reinforcement Learning Overview
73 pages
tiếng anhi
No ratings yet
tiếng anhi
7 pages
Unit 5
No ratings yet
Unit 5
7 pages
Deep Reinforcement Learning Overview
No ratings yet
Deep Reinforcement Learning Overview
9 pages
Reinforcement Learning (RL) : Big Data Mining
No ratings yet
Reinforcement Learning (RL) : Big Data Mining
86 pages
Python Reinforcement Learning Overview
No ratings yet
Python Reinforcement Learning Overview
29 pages
(Addison-Wesley Data & Analytics Series) Laura Graesser - Wah Loon Keng - Foundations of Deep Reinforcement Learning - Theory and Practice in Python-Addison-Wesley Professional (2019) PDF
100% (1)
(Addison-Wesley Data & Analytics Series) Laura Graesser - Wah Loon Keng - Foundations of Deep Reinforcement Learning - Theory and Practice in Python-Addison-Wesley Professional (2019) PDF
656 pages
Reinforcement Learning vs Supervised Learning
No ratings yet
Reinforcement Learning vs Supervised Learning
9 pages
Overview of Deep Reinforcement Learning
No ratings yet
Overview of Deep Reinforcement Learning
30 pages
Deep Reinforcement Learning Overview
No ratings yet
Deep Reinforcement Learning Overview
30 pages
Reinforcement Learning For IoT - Final
No ratings yet
Reinforcement Learning For IoT - Final
45 pages
Lecture 1: Introduction: Reinforcement Learning With Tensorflow&Openai Gym
No ratings yet
Lecture 1: Introduction: Reinforcement Learning With Tensorflow&Openai Gym
18 pages
Lec10 - Interaction
No ratings yet
Lec10 - Interaction
40 pages
CMPE257 - W10C13 - Reinforcement Learning
No ratings yet
CMPE257 - W10C13 - Reinforcement Learning
161 pages
RL & DL Notes
No ratings yet
RL & DL Notes
43 pages
Datamahadev Com Ai Pilot Deep Reinforcement Learning Change Aviation Warfare
No ratings yet
Datamahadev Com Ai Pilot Deep Reinforcement Learning Change Aviation Warfare
8 pages
A Concise Introduction To Reinforcement Learning: February 2018
No ratings yet
A Concise Introduction To Reinforcement Learning: February 2018
12 pages
Deep Learning Introduction Class
No ratings yet
Deep Learning Introduction Class
46 pages
RL Chap 5
No ratings yet
RL Chap 5
21 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
3 pages
CSD411-Week 3 - Learning Paradigms and Mathematical Foundations
No ratings yet
CSD411-Week 3 - Learning Paradigms and Mathematical Foundations
132 pages
Intro to Reinforcement Learning
No ratings yet
Intro to Reinforcement Learning
9 pages
Day 5 - CNN, Autoencoder, GANs
No ratings yet
Day 5 - CNN, Autoencoder, GANs
19 pages
Sikpa Joris Weekly Report November 18-22
No ratings yet
Sikpa Joris Weekly Report November 18-22
8 pages
Artificial Intelligence and Neural Network - HumayunKabir
100% (1)
Artificial Intelligence and Neural Network - HumayunKabir
30 pages
Grokking Machine Learning MEAP v07 Luis G Serrano PDF Download
No ratings yet
Grokking Machine Learning MEAP v07 Luis G Serrano PDF Download
108 pages
D1.3 Digital Twin Enabling Technology Catalogue
No ratings yet
D1.3 Digital Twin Enabling Technology Catalogue
31 pages
Adobe Scan 23 Jun 2024
No ratings yet
Adobe Scan 23 Jun 2024
6 pages
Cybersecurity - The AI Arsenal 15 - Darktrace
No ratings yet
Cybersecurity - The AI Arsenal 15 - Darktrace
17 pages
Machine Learning For Absolute Beginners A - Oliver Theobald
100% (3)
Machine Learning For Absolute Beginners A - Oliver Theobald
179 pages
Aparajita Khan Final Thesis-22.11
No ratings yet
Aparajita Khan Final Thesis-22.11
285 pages
Unsupervised Traffic Accident Detection in First-Person Videos
No ratings yet
Unsupervised Traffic Accident Detection in First-Person Videos
9 pages
Technical Documenetflix Technicalnt
No ratings yet
Technical Documenetflix Technicalnt
15 pages
Reliable Industry 4.0 Based On Machine Learning and IoT For
No ratings yet
Reliable Industry 4.0 Based On Machine Learning and IoT For
16 pages
Cs3491 Aiml Q&A Material
No ratings yet
Cs3491 Aiml Q&A Material
22 pages
What Is Machine Learning - Definition, Types, and Examples - Coursera
No ratings yet
What Is Machine Learning - Definition, Types, and Examples - Coursera
10 pages
Video Learning with LSTMs
No ratings yet
Video Learning with LSTMs
9 pages
DeepSeek图解10页
No ratings yet
DeepSeek图解10页
11 pages
9th Computer Exercise Unit 5
No ratings yet
9th Computer Exercise Unit 5
6 pages
Sustainable Energy Management in The AI Era: A Comprehensive Analysis of ML and DL Approaches
No ratings yet
Sustainable Energy Management in The AI Era: A Comprehensive Analysis of ML and DL Approaches
64 pages
Unit 1 Introduction To AI and ML
No ratings yet
Unit 1 Introduction To AI and ML
74 pages
AI Primer VF2
No ratings yet
AI Primer VF2
17 pages
Syllabus F24
No ratings yet
Syllabus F24
10 pages
21 17+updated
No ratings yet
21 17+updated
7 pages
Hexagon MI MSC WhitePaper-Ai 2021 DTE
No ratings yet
Hexagon MI MSC WhitePaper-Ai 2021 DTE
24 pages
ML Platforms: Buyer's Guide 2019
No ratings yet
ML Platforms: Buyer's Guide 2019
38 pages
Handwritten Gregg Shorthand Recognition
No ratings yet
Handwritten Gregg Shorthand Recognition
8 pages
Practical Aspect of Robot Design, Control and Application of AI
No ratings yet
Practical Aspect of Robot Design, Control and Application of AI
68 pages
HW 1
No ratings yet
HW 1
6 pages
Machine Learning, History and Types of ML
No ratings yet
Machine Learning, History and Types of ML
18 pages
Machine Learning for Business Students
No ratings yet
Machine Learning for Business Students
32 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
4 pages