Open navigation menu

Scribd

0% found this document useful (0 votes)

54 views26 pages

Lecture 34 - Model Based Reinforcement Learning

The document covers Lecture #34 of AI-832 on Model Based Reinforcement Learning, led by Dr. Zuhair Zafar. It discusses the differences between model-based and model-free reinforcement learning, the advantages of model-based approaches, and the concept of model learning, including model-based Monte Carlo methods for estimating transition probabilities and rewards. The lecture also addresses the challenges of planning with inaccurate models.

Uploaded by

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

54 views26 pages

Lecture 34 - Model Based Reinforcement Learning

The document covers Lecture #34 of AI-832 on Model Based Reinforcement Learning, led by Dr. Zuhair Zafar. It discusses the differences between model-based and model-free reinforcement learning, the advantages of model-based approaches, and the concept of model learning, including model-based Monte Carlo methods for estimating transition probabilities and rewards. The lecture also addresses the challenges of planning with inaccurate models.

Uploaded by

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

AI-832 Reinforcement Learning

Instructor: Dr. Zuhair Zafar

Lecture # 34: Model Based Reinforcement Learning

Recap

• Actor Critic Methods

• What is Actor?

• What is Critic?

• Can we reduce variance in Actor Critic Methods? How?

Today’s Agenda

• Model Based Reinforcement Learning

Model-Based Reinforcement Learning
Model-Based and Model-Free RL
Model-Based and Model-Free RL
Model-Based RL
Advantages of Model-Based RL
What is a Model?
Model Learning
Table Lookup Model
AB Example
Planning with a Model
Sample-Based Planning
Back to the AB Example
Planning with an Inaccurate Model
Model Based Monte Carlo

• In model-based Monte Carlo, the idea is to estimate transition

probabilities and rewards from the data.

• By looking thousand of samples, the model-based Monte Carlo can fairly

accurately estimate the average transition probabilities and rewards.

• The estimated values might not be 100% accurate as they are estimated
from the data.

• By using policy evaluation and value iteration, the optimal utility can be
calculated.
Problem: Model Based Monte Carlo

You might also like

20CM1111
No ratings yet
20CM1111
3 pages
RL Unit 2
No ratings yet
RL Unit 2
10 pages
5SC28 Machine Learning For Systems and Control
No ratings yet
5SC28 Machine Learning For Systems and Control
68 pages
L13 Reinforcement Learning
No ratings yet
L13 Reinforcement Learning
57 pages
Model-Based Reinforcement Learning
No ratings yet
Model-Based Reinforcement Learning
41 pages
ReinforcementLearningAssign2 1)
No ratings yet
ReinforcementLearningAssign2 1)
7 pages
Unit-5 Mla
No ratings yet
Unit-5 Mla
22 pages
Unit-3 Unit-3 RL Problems, Prediction and Control P 241111 181426
No ratings yet
Unit-3 Unit-3 RL Problems, Prediction and Control P 241111 181426
15 pages
IntroductiontoRL BR
No ratings yet
IntroductiontoRL BR
22 pages
Introduction to Reinforcement Learning
No ratings yet
Introduction to Reinforcement Learning
7 pages
RL Unit 1
100% (1)
RL Unit 1
26 pages
Model-Based Reinforcement Learning Overview
No ratings yet
Model-Based Reinforcement Learning Overview
56 pages
CMPE257 - W10C13 - Reinforcement Learning
No ratings yet
CMPE257 - W10C13 - Reinforcement Learning
161 pages
ML 10
No ratings yet
ML 10
9 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
5 pages
Unit-5 (AI)
No ratings yet
Unit-5 (AI)
21 pages
PowerPoint Presentation
No ratings yet
PowerPoint Presentation
35 pages
L-14 - Reinforcement-L-d-07062024-111949am
No ratings yet
L-14 - Reinforcement-L-d-07062024-111949am
22 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
15 pages
Serge Levine Course Introduction To Reinforcement Learning 3: RL Introduction
No ratings yet
Serge Levine Course Introduction To Reinforcement Learning 3: RL Introduction
46 pages
ML - Unit-3 - Reinforcement Learning
No ratings yet
ML - Unit-3 - Reinforcement Learning
47 pages
Artificial Intelligence: Computer Science & Engineering, Khulna University
No ratings yet
Artificial Intelligence: Computer Science & Engineering, Khulna University
30 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
5 pages
Understanding Reinforcement Learning Basics
No ratings yet
Understanding Reinforcement Learning Basics
26 pages
RL Unit - Iii
No ratings yet
RL Unit - Iii
20 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
30 pages
Algorithm For RL
No ratings yet
Algorithm For RL
99 pages
20ai903 - RL - Unit 4
No ratings yet
20ai903 - RL - Unit 4
49 pages
Reinforcement Learning Basics
No ratings yet
Reinforcement Learning Basics
51 pages
Artificial Intelligence: Lecture 10 - Reinforcement Learning Prof. Shivanjali Khare
No ratings yet
Artificial Intelligence: Lecture 10 - Reinforcement Learning Prof. Shivanjali Khare
45 pages
Intro to Reinforcement Learning
No ratings yet
Intro to Reinforcement Learning
9 pages
Unit 3
No ratings yet
Unit 3
29 pages
Unit-6 Reinforcement Learning
No ratings yet
Unit-6 Reinforcement Learning
75 pages
Introduction To Reinforcement Learning: Instructor: Sergey Levine UC Berkeley
No ratings yet
Introduction To Reinforcement Learning: Instructor: Sergey Levine UC Berkeley
46 pages
Lecture 10
No ratings yet
Lecture 10
25 pages
Reinforcement Learning - Introduction
No ratings yet
Reinforcement Learning - Introduction
12 pages
Fundamentals of Reinforcement Learning
No ratings yet
Fundamentals of Reinforcement Learning
33 pages
Unit-8 - Reinforcement Learning
No ratings yet
Unit-8 - Reinforcement Learning
52 pages
SP14 CS188 Lecture 10 - Reinforcement Learning I
No ratings yet
SP14 CS188 Lecture 10 - Reinforcement Learning I
35 pages
Unit 4
No ratings yet
Unit 4
56 pages
L11 Reinforcement Learning 1
No ratings yet
L11 Reinforcement Learning 1
18 pages
RL Presentation2
No ratings yet
RL Presentation2
19 pages
Reinforcemnet Learning
No ratings yet
Reinforcemnet Learning
8 pages
Reinforcement
No ratings yet
Reinforcement
9 pages
Reinforcement Learning Basics and Beyond
No ratings yet
Reinforcement Learning Basics and Beyond
1 page
Reinforcement Learning: Monte Carlo in Reinforcement Learning
No ratings yet
Reinforcement Learning: Monte Carlo in Reinforcement Learning
15 pages
Unit 3
No ratings yet
Unit 3
12 pages
Alg RLearning Ejemplo
No ratings yet
Alg RLearning Ejemplo
99 pages
Monte Carlo Methods in AI & Data Science
No ratings yet
Monte Carlo Methods in AI & Data Science
40 pages
RL Algorithms in Gymnasium
No ratings yet
RL Algorithms in Gymnasium
59 pages
Lec 10
No ratings yet
Lec 10
50 pages
Unit4 (AI) 2024 Docx-1
No ratings yet
Unit4 (AI) 2024 Docx-1
22 pages
2025 Reinforcement Learning Basics
No ratings yet
2025 Reinforcement Learning Basics
6 pages
Reinforcement Learning With Python
No ratings yet
Reinforcement Learning With Python
24 pages
A (Long) Peek Into Reinforcement Learning - Lil'Log
No ratings yet
A (Long) Peek Into Reinforcement Learning - Lil'Log
23 pages
Lecture 30 Reinforcement-Learning
No ratings yet
Lecture 30 Reinforcement-Learning
50 pages
Module 1
No ratings yet
Module 1
72 pages
Lec10 - Interaction
No ratings yet
Lec10 - Interaction
40 pages
CS-878 Lecture-02 Logistic Regression
No ratings yet
CS-878 Lecture-02 Logistic Regression
55 pages
Lecture W7ab
No ratings yet
Lecture W7ab
21 pages
Self Reading - KNN - Notes
No ratings yet
Self Reading - KNN - Notes
7 pages
Lecture W3
No ratings yet
Lecture W3
28 pages
Lecture W5ab
No ratings yet
Lecture W5ab
56 pages
Lecture W6b
No ratings yet
Lecture W6b
33 pages
Lesson 8-Image Segmentation - Traditional Approaches
No ratings yet
Lesson 8-Image Segmentation - Traditional Approaches
35 pages
Lecture 14 15 - Temporal Difference Learning, Lambda-Return, Backward View of TD (Lambda)
No ratings yet
Lecture 14 15 - Temporal Difference Learning, Lambda-Return, Backward View of TD (Lambda)
26 pages
Eigen Values and Eigen Vectors
No ratings yet
Eigen Values and Eigen Vectors
53 pages
Lecture 11 12 - Model Free Prediction, Monte-Carlo Learning, Temporal Difference Learning
No ratings yet
Lecture 11 12 - Model Free Prediction, Monte-Carlo Learning, Temporal Difference Learning
24 pages
Lecture 19 - Model-Free Control, Off-Policy Learning
No ratings yet
Lecture 19 - Model-Free Control, Off-Policy Learning
9 pages
Lecture 35 36 - Exploration vs. Exploitation
No ratings yet
Lecture 35 36 - Exploration vs. Exploitation
18 pages
Lecture 22 - Value Function Approximation
No ratings yet
Lecture 22 - Value Function Approximation
17 pages