0% found this document useful (0 votes)

75 views6 pages

Reinforcement Learning Syllabus

Uploaded by

Husein Yusuf

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

75 views6 pages

Reinforcement Learning Syllabus

Uploaded by

Husein Yusuf

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Course Number

Course Title Reinforcement Learning

ECTS Credits 5 ects (3 Cr.)

Contact Hours (per week) Lectures Tutorial Practice or Laboratory

2 0 3

Course Objectives &

Competencies to be The objectives of this course typically include:
Acquired
● Understanding Reinforcement Learning Concepts: The
course aims to provide a solid understanding of the
fundamental concepts and principles of reinforcement
learning, including Markov Decision Processes, value
functions, policies, exploration, exploitation, and the
trade-off between exploration and exploitation.
● Mastering Reinforcement Learning Algorithms: The
course focuses on teaching various reinforcement learning
algorithms and techniques, such as Q-learning, policy
gradients, Monte Carlo methods, temporal difference
learning, and model-based approaches. Students learn how
these algorithms work, their strengths, limitations, and
how to apply them to solve different types of problems.
● Solving Real-World Problems: The course aims to equip
students with the skills to apply reinforcement learning to
real-world problems. It covers topics such as function
approximation, handling continuous state and action
spaces, dealing with high-dimensional inputs, and
incorporating deep neural networks into reinforcement
learning algorithms.
● Understanding Exploration and Exploitation: The course
explores the challenges of balancing exploration (gaining
new knowledge) and exploitation (using existing
knowledge) in reinforcement learning. Students learn
about exploration strategies, such as epsilon-greedy,
softmax, and UCB (Upper Confidence Bound), and how
to design effective exploration policies.
● Deep Reinforcement Learning: The course covers
advanced topics in deep reinforcement learning, which
involve combining deep neural networks with
reinforcement learning algorithms. Students learn about
Deep Q-Networks (DQN), actor-critic methods, policy
gradients with neural networks, and other state-of-the-art
techniques used in deep reinforcement learning.
● Evaluating and Analyzing Reinforcement Learning
Agents: The course teaches students how to evaluate and
analyze the performance of reinforcement learning agents.
This includes understanding performance metrics,
conducting experiments, analyzing learning curves, and
assessing the robustness and generalization capabilities of
learned policies.
● Applications and Case Studies: The course explores
various applications of reinforcement learning across
different domains, such as robotics, game playing,
recommendation systems, autonomous systems, and
resource management. Students learn about successful
case studies and gain insights into how reinforcement
learning can be used to solve complex problems.
● Ethical Considerations: The course addresses ethical
considerations and challenges in reinforcement learning,
such as fairness, bias, safety, and interpretability. Students
learn about the societal impact of reinforcement learning
algorithms and discuss ethical guidelines and responsible
use of these techniques.

Course Outcomes The course outcomes for computer vision graduate courses
include:

● Knowledge Acquisition: Students will acquire a

comprehensive understanding of the theoretical
foundations, concepts, and principles of reinforcement
learning. They will develop a solid knowledge base of the
key components, such as Markov Decision Processes,
value functions, policies, exploration-exploitation
trade-offs, and various reinforcement learning algorithms.
● Algorithm Implementation: Students will gain practical
experience in implementing and applying reinforcement
learning algorithms. They will be able to write code and
develop software systems to simulate environments, train
agents, and evaluate their performance. Students will
become proficient in implementing algorithms like
Q-learning, policy gradients, and value iteration.
● Problem Solving: Students will develop problem-solving
skills specific to reinforcement learning. They will learn
how to analyze real-world problems and formulate them
as reinforcement learning tasks. They will be able to apply
appropriate algorithms, tune hyperparameters, and
evaluate the effectiveness of different approaches in
solving the given problems.
● Experimental Design and Evaluation: Students will learn
how to design experiments to evaluate the performance of
reinforcement learning agents. They will acquire skills in
collecting and analyzing data, interpreting experimental
results, and drawing meaningful conclusions. Students
will be able to assess the strengths and weaknesses of
different algorithms based on empirical evaluation.
● Application to Real-World Scenarios: Students will gain
the ability to apply reinforcement learning techniques to
real-world domains and scenarios. They will understand
how to adapt and extend reinforcement learning
algorithms to handle complex and practical challenges.
Students will be able to identify appropriate applications
for reinforcement learning and propose effective solutions.
● Critical Thinking and Analysis: Students will develop
critical thinking skills by analyzing and evaluating the
theoretical and practical aspects of reinforcement learning.
They will be able to assess the advantages, limitations,
and trade-offs associated with different algorithms and
approaches. Students will be encouraged to think
creatively and propose innovative solutions to
reinforcement learning problems.
● Communication and Collaboration: Students will enhance
their communication and collaboration skills through
group projects, presentations, and discussions. They will
be able to effectively convey their ideas, present their
findings, and engage in constructive discussions related to
reinforcement learning concepts, algorithms, and
applications.
● Ethical Considerations: Students will develop an
understanding of the ethical implications and societal
impact of reinforcement learning. They will be aware of
issues such as fairness, transparency, and accountability in
the deployment of reinforcement learning systems.
Students will be encouraged to think critically about the
ethical use of reinforcement learning techniques and
consider the broader implications.
Course Contents Lecture 1: Introduction to Reinforcement Learning
Lecture 2: Exploration & Control
Exploration
Epsilon-Greedy
Upper Confidence Bound (UCB)
Thompson Sampling
Optimistic Initialization
Boltzmann Exploration
Upper Confidence Trees (UCT)
Exploitation
Greedy Algorithm
Q-Learning
Policy Gradient Methods
Actor-Critic Methods
Deterministic Policy Optimization (DPO)
Lecture 3: MDPs & Dynamic Programming
Policy Evaluation
Policy Improvement
Value Iteration
Policy Iteration
Lecture 4: Theoretical Fundamentals of Dynamic Programming
Algorithms [reading]
Principle of Optimality
Bellman Equations
Value Function Iteration
Policy Iteration
Optimal Bellman Operator
Lecture 5: Model-free Prediction
Monte Carlo Methods
Temporal Difference (TD) Learning
TD(0) or One-Step TD
TD($\lambda$) or Multi-Step TD
Lecture 6: Model-free Control
Q-Learning
SARSA (State-Action-Reward-State-Action)
Deep Q-Networks (DQN)
Policy Gradient Methods
Actor-Critic Methods
Lecture 7: Function Approximation
Parametric Function
Training Data
Loss Function and Optimization
Generalization
Lecture 8: Planning & models
Model-Based RL
Model Based Planning
Model Free RL
Exploration-Exploitation Tradeoff

Lecture 9: Policy-Gradient & Actor-Critic methods

Policy
Policy + Value
Advantage Actor-Critic (A2C), Asynchronous Advantage
Actor-Critic (A3C), and Proximal Policy Optimization
(PPO)
Lecture 10: Approximate Dynamic Programming
Complex Sequential decision Problems
Value Iteration, Policy Iteration, Q-Learning, SARSA,
Approximate Policy Iteration (API), and Dual Heuristic
Programming (DHP)
Lecture 11: Multi-step & Off Policy
n-step SARSA
n-step Q-learning
Expected SARSA
Q-learning with Experience Replay
TD($\lambda$):
Lecture 12: Deep Reinforcement Learning #1
Lecture 13: Deep Reinforcement Learning #2
Deep Q-Network (DQN)
Proximal Policy Optimization (PPO)
Deep Deterministic Policy Gradient (DDPG)
Trust Region Policy Optimization (TRPO)
Twin Delayed Deep Deterministic Policy Gradient (TD3)
Soft Actor-Critic (SAC)

Pre-requisites Linear Algebra, Probability and Statistics, Fundamentals of

Machine Learning

Teaching & Learning Lecture, assignments, projects and exercises

Methods

Assessment/Evaluation & ● Mid Exam - 15

Grading System ● Seminar - 15
○ Three Seminars
● Lab Work and Quizzes - 15
● Project - 30
● Final Exam -25

Attendance Requirements 85% attendance is required.

Refernces Textbook:
Richard S. Sutton and Andrew G. Barto, "Reinforcement
Learning: An Introduction"

Target Institute: Deep Mind

MTech Reinforcement Learning Course
No ratings yet
MTech Reinforcement Learning Course
2 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
2 pages
CS 4501-Introduction To Reinforcement Learning
No ratings yet
CS 4501-Introduction To Reinforcement Learning
7 pages
Course Code: Course Title TPC Version No. Course Pre-Requisites/ Co-Requisites Anti-Requisites (If Any) - Objectives
No ratings yet
Course Code: Course Title TPC Version No. Course Pre-Requisites/ Co-Requisites Anti-Requisites (If Any) - Objectives
2 pages
Deep Reinforcement Learning Handout v2.0
0% (1)
Deep Reinforcement Learning Handout v2.0
6 pages
Reinforcement Learning Course
No ratings yet
Reinforcement Learning Course
2 pages
21cse417t - Fundamentals of Reinforcement Learning Syllabus
No ratings yet
21cse417t - Fundamentals of Reinforcement Learning Syllabus
2 pages
Advanced Reinforcement Learning
No ratings yet
Advanced Reinforcement Learning
3 pages
Advanced Reinforcement Learning
No ratings yet
Advanced Reinforcement Learning
2 pages
20CM1111
No ratings yet
20CM1111
3 pages
Gujarat Technological University: Bachelor of Engineering Syllabus Subject Code: Subject Name
No ratings yet
Gujarat Technological University: Bachelor of Engineering Syllabus Subject Code: Subject Name
3 pages
00 Syllabus Copy-21am71
No ratings yet
00 Syllabus Copy-21am71
2 pages
Deep Reinforcement Learning Nanodegree
No ratings yet
Deep Reinforcement Learning Nanodegree
13 pages
Algorithm For RL
No ratings yet
Algorithm For RL
99 pages
Reinforcement Learning Basics and Beyond
No ratings yet
Reinforcement Learning Basics and Beyond
1 page
RLcourseoutline 2025
No ratings yet
RLcourseoutline 2025
2 pages
Reinforcement Learning2018
No ratings yet
Reinforcement Learning2018
5 pages
20ad41e8 - Reinforcement Learning
No ratings yet
20ad41e8 - Reinforcement Learning
2 pages
Lecture 30 Reinforcement-Learning
No ratings yet
Lecture 30 Reinforcement-Learning
50 pages
Mlunit 5
No ratings yet
Mlunit 5
10 pages
Deep Reinforcement Learning: Lecture Notes
No ratings yet
Deep Reinforcement Learning: Lecture Notes
60 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
1 page
Reinforcement Learning
No ratings yet
Reinforcement Learning
3 pages
Introduction to Reinforcement Learning
No ratings yet
Introduction to Reinforcement Learning
7 pages
Unitwise Important Questions: Reinforcement Learning
No ratings yet
Unitwise Important Questions: Reinforcement Learning
5 pages
Understanding Reinforcement Learning
No ratings yet
Understanding Reinforcement Learning
27 pages
Introduction To Deep Reinforcement Learning
No ratings yet
Introduction To Deep Reinforcement Learning
7 pages
RL
No ratings yet
RL
1 page
Reinforcement Learning
No ratings yet
Reinforcement Learning
28 pages
B.Tech ML: Reinforcement Learning
No ratings yet
B.Tech ML: Reinforcement Learning
10 pages
w7 - Reinforcement Learning
No ratings yet
w7 - Reinforcement Learning
5 pages
Deep Reinforcement Learning Mohit Sewak
No ratings yet
Deep Reinforcement Learning Mohit Sewak
6 pages
Deep Reinforcement Learning Course Overview
No ratings yet
Deep Reinforcement Learning Course Overview
66 pages
Reinforcement Learning-1
No ratings yet
Reinforcement Learning-1
19 pages
Modern Deep Reinforcement Learning Algorithms
No ratings yet
Modern Deep Reinforcement Learning Algorithms
56 pages
Assignment 2 - Policy Gradients
No ratings yet
Assignment 2 - Policy Gradients
7 pages
Lecture Notes v1.0 687 F22
No ratings yet
Lecture Notes v1.0 687 F22
115 pages
Deep Reinforcement Learning Overview
No ratings yet
Deep Reinforcement Learning Overview
13 pages
Reinforcement Learning Syl-Shashimam
No ratings yet
Reinforcement Learning Syl-Shashimam
2 pages
RL Unit - Iii
No ratings yet
RL Unit - Iii
20 pages
Reinforcement Learning Notes ?
No ratings yet
Reinforcement Learning Notes ?
40 pages
Alg RLearning Ejemplo
No ratings yet
Alg RLearning Ejemplo
99 pages
Four
No ratings yet
Four
5 pages
Lecture 5
No ratings yet
Lecture 5
28 pages
ML 10
No ratings yet
ML 10
9 pages
cs224r L01 Intro
No ratings yet
cs224r L01 Intro
51 pages
CMPE257 - W10C13 - Reinforcement Learning
No ratings yet
CMPE257 - W10C13 - Reinforcement Learning
161 pages
15) EXPLAIN Fitted Q and Deep Q-Learning
No ratings yet
15) EXPLAIN Fitted Q and Deep Q-Learning
17 pages
Reinforcement Learning With Python
No ratings yet
Reinforcement Learning With Python
24 pages
Reinforcement Learning: Intelligent Decisions
No ratings yet
Reinforcement Learning: Intelligent Decisions
2 pages
Advanced Machine Learning Syallabus
No ratings yet
Advanced Machine Learning Syallabus
2 pages
Algorithms For Reinforced Learning
No ratings yet
Algorithms For Reinforced Learning
98 pages
Syllabus 2
No ratings yet
Syllabus 2
3 pages
RL Algorithms in Gymnasium
No ratings yet
RL Algorithms in Gymnasium
59 pages
Reinforcement Learning Algorithms
No ratings yet
Reinforcement Learning Algorithms
98 pages
Topic Level Outcome Unit 6
No ratings yet
Topic Level Outcome Unit 6
1 page
Exit Exam Question 2015
No ratings yet
Exit Exam Question 2015
20 pages
سحب عينات الدم الطبعة الخامسة Phlebotomy
89% (44)
سحب عينات الدم الطبعة الخامسة Phlebotomy
147 pages
Cyber - Access Control and Cryptographic Concepts
No ratings yet
Cyber - Access Control and Cryptographic Concepts
28 pages
Lecture 2 - Exploration and Control - Slides
No ratings yet
Lecture 2 - Exploration and Control - Slides
51 pages
Introduction To Cognitive Science
No ratings yet
Introduction To Cognitive Science
49 pages
CV CH - 4 - Low Level Feature Extraction
No ratings yet
CV CH - 4 - Low Level Feature Extraction
72 pages
CV CH - 1 Introduction
No ratings yet
CV CH - 1 Introduction
42 pages
CV CH - 5 - High Level Feature Extraction
No ratings yet
CV CH - 5 - High Level Feature Extraction
62 pages
Understanding Homophily in Networks
No ratings yet
Understanding Homophily in Networks
24 pages
03 NetworkStructure
No ratings yet
03 NetworkStructure
75 pages
C++ Exam for Mekelle University
No ratings yet
C++ Exam for Mekelle University
3 pages
Bluetooth Home Automation System Guide
No ratings yet
Bluetooth Home Automation System Guide
19 pages
FANUC Field System
No ratings yet
FANUC Field System
22 pages
Real-Time Language Translation Using Transformer Models in Python
No ratings yet
Real-Time Language Translation Using Transformer Models in Python
5 pages
Ubiquitous Computing Systems: Reading 1: Weiser, M. 1991. The Computer For The 21St Century
No ratings yet
Ubiquitous Computing Systems: Reading 1: Weiser, M. 1991. The Computer For The 21St Century
10 pages
Comparison Sheet
No ratings yet
Comparison Sheet
19 pages
My Detailed Marketing Strategies
No ratings yet
My Detailed Marketing Strategies
4 pages
WhatsUp Gold 14.3 Distributed Deployment Guide
No ratings yet
WhatsUp Gold 14.3 Distributed Deployment Guide
42 pages
Install
No ratings yet
Install
23 pages
Customer Support Contacts
No ratings yet
Customer Support Contacts
2 pages
Linux Fundamentals Overview
No ratings yet
Linux Fundamentals Overview
11 pages
IoT - Policy - Sept26 Ismail Shah
No ratings yet
IoT - Policy - Sept26 Ismail Shah
87 pages
Xerox Copier Quotation Singapore
No ratings yet
Xerox Copier Quotation Singapore
2 pages
Data Engineer
No ratings yet
Data Engineer
1 page
Heatmiser Senior / Plus User Operators Guide Revision 2: Support Tel: 01254 776343
No ratings yet
Heatmiser Senior / Plus User Operators Guide Revision 2: Support Tel: 01254 776343
8 pages
x86 Stderr
No ratings yet
x86 Stderr
3 pages
2011 HSC Maths Ext 1 Exam
No ratings yet
2011 HSC Maths Ext 1 Exam
12 pages
Autosar Sws E2etransformer
No ratings yet
Autosar Sws E2etransformer
44 pages
Emergency Alert and Tracking Device For Women Safety
No ratings yet
Emergency Alert and Tracking Device For Women Safety
52 pages
Data Structure Q&A for C Programming
No ratings yet
Data Structure Q&A for C Programming
32 pages
DevOps Guide: Principles and Practices
No ratings yet
DevOps Guide: Principles and Practices
8 pages
1622 DDD GCS200093 NguyenDuyKhang Assignment-2 Resubmit
No ratings yet
1622 DDD GCS200093 NguyenDuyKhang Assignment-2 Resubmit
38 pages
Operating System Structures (Chapter 2) : References
No ratings yet
Operating System Structures (Chapter 2) : References
20 pages
Overview of MATLAB Applications
No ratings yet
Overview of MATLAB Applications
12 pages
4.4 Tiger Hash
No ratings yet
4.4 Tiger Hash
10 pages
Class, Objects and Constructors
No ratings yet
Class, Objects and Constructors
23 pages
Personalized Federated Learning via MAML
No ratings yet
Personalized Federated Learning via MAML
29 pages
Abhishek Gupta - Test Automation Expert Resume
No ratings yet
Abhishek Gupta - Test Automation Expert Resume
3 pages
OMC SD 091125 Clean
No ratings yet
OMC SD 091125 Clean
221 pages
PvPManager Configuration Guide
No ratings yet
PvPManager Configuration Guide
6 pages

Reinforcement Learning Syllabus

Uploaded by

Reinforcement Learning Syllabus

Uploaded by

Course Number

Course Title Reinforcement Learning

ECTS Credits 5 ects (3 Cr.)

Contact Hours (per week) Lectures Tutorial Practice or Laboratory

Course Objectives &

●​ Knowledge Acquisition: Students will acquire a

Lecture 9: Policy-Gradient & Actor-Critic methods

Pre-requisites Linear Algebra, Probability and Statistics, Fundamentals of

Teaching & Learning Lecture, assignments, projects and exercises

Assessment/Evaluation & ●​ Mid Exam - 15

Attendance Requirements 85% attendance is required.

Target Institute: Deep Mind

You might also like

● Knowledge Acquisition: Students will acquire a

Assessment/Evaluation & ● Mid Exam - 15