Q-Learning Algorithm

The document presents a group project on the Q-learning algorithm, a reinforcement learning technique that allows machines to learn from interactions with their environment. It covers key concepts such as the Q-function, Q-table, and the steps involved in the Q-learning algorithm, along with examples and applications. The presentation also discusses the advantages and disadvantages of Q-learning.

Uploaded by

anum.ashraf237

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

126 views13 pages

Q-Learning Algorithm

Uploaded by

anum.ashraf237

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 13

GOVT.

RABIA BASRI GRADUATE COLLEGE (W)

WALTON ROAD LAHORE

Presentation Topic: Q-learning Algorithm

Group no 6:
Samia Anwar(116)
Fatima Liaqat(105)
Mahnoor(122)
Nimra Mehboob(123)

Content:
 Reinforcement Learning Technique: Q-learning
 Some imp terms in Q-Learning
 Factors and Algorithm of Q-learning
 Steps with examples
 Advantages and disadvantages Applications

What is reinforcement learning?

 Reinforcement Learning (RL) is a branch of
machine learning
 RL allows machines to learn by interacting with an
environment and receiving feedback based on their
actions. This feedback comes is in the form
of rewards or penalties.
Q-LEARNING:
 Q-Learning means quality learning.
 It is off-policy, model-free and value-based
reinforcement learning algorithm.
 Agent has to actively learn through the experience of
interactions with the environment.
 off-policy RLA(according to situation which action is
performed on which state).
 model-free RLA(learn the consequences of their
actions through experience without transition and
reward function).
 value-based RLA(train the value function to learn
which state is more valuable and take action).
 Agent uses trail and error to determine which actions
result in rewards(good outcome) and penalties(bad
outcome or negative reward).
 The decision making of q learning is improved day
by day due to updation in q table .

Some important terms in Q-learning:

Factors of Q-learning:
 There are 2 factors of q learning i.e., Q-
function(Bellman equation) and the other one is Q-
table.
1. Q-function(Bellman Equation):
 It is a recursive formula used to calculate value of
given state and determine the optimal action.

 Q(s,a)=R(s,a)+ *max[Q(s’,a’)].
Whereas:

 Q(s,a) is the Q value for given state and action pair.

 R(s,a) is the immediate reward for taking action in
state s.
 (Gamma) is the discount factor
representing importance of future rewards.
 Max Q(s’,a’) is the maximum q value for the next
state s’ and all possible actions a’.
Q-table:
 Q table is a data structure of sets of actions and states
and we use q learning algorithm to update q values in
q table.
 Combinations of actions and states.
 State no=no. of rows
 Action no = no. of columns
 Initially q table is initialized with value=0.
 The agent will use a q table to take the best possible
action based on the expected reward for each state in
the environment
 In simple words a q table is a data structure of step
of actions ans states and we use the q learning
algorithm to update the values in the table.
Q-Learning algorithm:
Steps to follow in q learning algorithm:
 Step1:Create an initial Q-Table with all values
initialized to 0
 Step 2:Choose an action and perform it.Update value
in table.
 Step 3:Get the value of the reward and calculate the
Q-value using bellman equation(Q-function).
Step 4:Continue the same process until the table is
filled or an episode ends.
Example:
 Here Rooms: States(s) and Doors: Actions(a).

 Suppose that we have 5 rooms in a building.We will number the rooms from 0 to 4 and the
outside of building can be thought of as one big room(5).

 We can represent each room as a node (states) and each door as a link(action).

 We have to get into the room 5 that’s why Our goal state is room 5 .

 Imp points: Goal room:5

 The doors that leads immediately to room 5 have reward 100.

 Others that have been not directly connected to room5 have 0 reward.

 Where there is no link between node(states:room) then reward is -1 (invalid link).

 Discount factor gamma:0.8

Application:
References:
https://www.datacamp.com/tutorial/introduction-q-learning-beginner-tutorial

https://www.geeksforgeeks.org/q-learning-in-python/

https://youtu.be/QRMNPCsnSHk

https://youtu.be/3Rx2x2traxw

https://youtu.be/ibBEEZNQZtk

https://youtu.be/5MC8Wdo-hS8

Unit 5
No ratings yet
Unit 5
65 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
12 pages
Intro To Reinforcement Learning - DQ Q AC A3C
No ratings yet
Intro To Reinforcement Learning - DQ Q AC A3C
36 pages
Q Learning
No ratings yet
Q Learning
9 pages
Unit 5
No ratings yet
Unit 5
54 pages
AI Seminar RL
No ratings yet
AI Seminar RL
27 pages
Q Learning SARSA Deep Q Learning
No ratings yet
Q Learning SARSA Deep Q Learning
4 pages
Unit 5
No ratings yet
Unit 5
70 pages
Exp1 D16AD 60
No ratings yet
Exp1 D16AD 60
11 pages
Understanding Reinforcement Learning Basics
No ratings yet
Understanding Reinforcement Learning Basics
11 pages
Reinforcement Learning - Ipynb - Colaboratory
No ratings yet
Reinforcement Learning - Ipynb - Colaboratory
7 pages
Q Learning
No ratings yet
Q Learning
38 pages
Lec 09
No ratings yet
Lec 09
26 pages
39-Q Learning Numerical
No ratings yet
39-Q Learning Numerical
13 pages
RL Class Mtech
No ratings yet
RL Class Mtech
67 pages
Hota ML ReinforcementLearning
No ratings yet
Hota ML ReinforcementLearning
12 pages
Unit-5 MLT
No ratings yet
Unit-5 MLT
13 pages
Q Learning
No ratings yet
Q Learning
6 pages
Deep Learning Binoy-19-3-RL Q Learning
No ratings yet
Deep Learning Binoy-19-3-RL Q Learning
26 pages
Unit-5 Part C 1) Explain The Q Function and Q Learning Algorithm Assuming Deterministic Rewards and Actions With Example. Ans)
No ratings yet
Unit-5 Part C 1) Explain The Q Function and Q Learning Algorithm Assuming Deterministic Rewards and Actions With Example. Ans)
11 pages
Q Learning Ejemplo
100% (1)
Q Learning Ejemplo
11 pages
7 - Reinforcement Learning
No ratings yet
7 - Reinforcement Learning
23 pages
Q Learning
No ratings yet
Q Learning
38 pages
Simulation of The Navigation of A Mobile Robot by The Q-Learning Using Artificial Neuron Networks
No ratings yet
Simulation of The Navigation of A Mobile Robot by The Q-Learning Using Artificial Neuron Networks
12 pages
Q-Learning Implementation in OpenAI Gym
No ratings yet
Q-Learning Implementation in OpenAI Gym
34 pages
p1 Piotr
No ratings yet
p1 Piotr
7 pages
Adobe Scan Nov 18, 2024
No ratings yet
Adobe Scan Nov 18, 2024
13 pages
Reinforcement Learning with Latent Confounding
No ratings yet
Reinforcement Learning with Latent Confounding
7 pages
Q-Learning for Room Navigation Simulation
100% (1)
Q-Learning for Room Navigation Simulation
15 pages
Q Learing
No ratings yet
Q Learing
30 pages
ML - Unit 3 - Part II
No ratings yet
ML - Unit 3 - Part II
51 pages
Speeding Up Q-learning with Signal Injection
No ratings yet
Speeding Up Q-learning with Signal Injection
4 pages
Reinforedu
No ratings yet
Reinforedu
46 pages
Q-Learning: Reinforcement Learning Basic Q-Learning Algorithm Common Modifications
No ratings yet
Q-Learning: Reinforcement Learning Basic Q-Learning Algorithm Common Modifications
22 pages
Artificial Intelligence: Lecture 11 - Reinforcement Learning II Dr. Shivanjali Khare
No ratings yet
Artificial Intelligence: Lecture 11 - Reinforcement Learning II Dr. Shivanjali Khare
52 pages
Exam
No ratings yet
Exam
7 pages
Ex No4rl
No ratings yet
Ex No4rl
3 pages
Q Learning
No ratings yet
Q Learning
18 pages
ml4r 2025 05
No ratings yet
ml4r 2025 05
22 pages
Lecture Notes On Reinforcement Learning Basics
No ratings yet
Lecture Notes On Reinforcement Learning Basics
6 pages
Q Learning
No ratings yet
Q Learning
12 pages
Q-Learning for Optimal Pathfinding
No ratings yet
Q-Learning for Optimal Pathfinding
2 pages
Reinforcement Learning Basics
No ratings yet
Reinforcement Learning Basics
14 pages
Reinforcement Learning Overview
No ratings yet
Reinforcement Learning Overview
14 pages
Reinforcement Learning II
No ratings yet
Reinforcement Learning II
28 pages
Nidhish RLAI-Lab1
No ratings yet
Nidhish RLAI-Lab1
18 pages
MDPs Solving
No ratings yet
MDPs Solving
19 pages
A Painless Q-Learning Tutorial
No ratings yet
A Painless Q-Learning Tutorial
6 pages
Unit 1
No ratings yet
Unit 1
18 pages
Unit5 MLT
No ratings yet
Unit5 MLT
26 pages
Filippov Theory in ϵ-Greedy Q-Learning
No ratings yet
Filippov Theory in ϵ-Greedy Q-Learning
66 pages
Q Learning
No ratings yet
Q Learning
187 pages
4.3 Reinforcement Learning
No ratings yet
4.3 Reinforcement Learning
27 pages
Unit - 5
No ratings yet
Unit - 5
43 pages
Reinforcement Learning II
No ratings yet
Reinforcement Learning II
28 pages
Ai (It) Unit-5
No ratings yet
Ai (It) Unit-5
43 pages
3964 Double Q Learning
No ratings yet
3964 Double Q Learning
9 pages
10 Deep Reinforcement
No ratings yet
10 Deep Reinforcement
40 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
12 pages
Assid
No ratings yet
Assid
9 pages
Course Title Artificial Intelligence Lab Course Code DC-324L Credit Hours Category Prerequisite Co-Requisite Follow-Up
No ratings yet
Course Title Artificial Intelligence Lab Course Code DC-324L Credit Hours Category Prerequisite Co-Requisite Follow-Up
1 page
Artificial Intelligence Recent Trends and Applicat
No ratings yet
Artificial Intelligence Recent Trends and Applicat
13 pages
Untitled Presentation
No ratings yet
Untitled Presentation
12 pages
Miss Erum Mahood Topic: KNN Algorthim: Presentator BY: Zobia Malaika Maryam Minahil
No ratings yet
Miss Erum Mahood Topic: KNN Algorthim: Presentator BY: Zobia Malaika Maryam Minahil
10 pages
Reinforcement Learning 1
No ratings yet
Reinforcement Learning 1
14 pages
Advances in Engineering Materials: R. K. Tyagi Pallav Gupta Prosenjit Das Rajiv Prakash
No ratings yet
Advances in Engineering Materials: R. K. Tyagi Pallav Gupta Prosenjit Das Rajiv Prakash
377 pages
Transport Query - Variants &amp Layouts
No ratings yet
Transport Query - Variants &amp Layouts
15 pages
Project Stage I Report Format
No ratings yet
Project Stage I Report Format
9 pages
Lovecraft, H.P. - Selected Writings PDF
100% (1)
Lovecraft, H.P. - Selected Writings PDF
823 pages
CIPW Norm Calculation Guide
No ratings yet
CIPW Norm Calculation Guide
5 pages
A200 Contactor
No ratings yet
A200 Contactor
3 pages
Yeast Respiration Rates with Sugars
0% (1)
Yeast Respiration Rates with Sugars
2 pages
Longines - Cal. L700.2 Repair Manual - en
No ratings yet
Longines - Cal. L700.2 Repair Manual - en
17 pages
Subiecte Bilingv 2014
No ratings yet
Subiecte Bilingv 2014
2 pages
CT-01 Solutions
No ratings yet
CT-01 Solutions
44 pages
Teleprotection Systems in Power Grids
100% (2)
Teleprotection Systems in Power Grids
36 pages
Engineering Graphics Lab Manual 2021-22
No ratings yet
Engineering Graphics Lab Manual 2021-22
56 pages
Metal Proc. Mid Exam
No ratings yet
Metal Proc. Mid Exam
3 pages
Epson TM-L90 Liner-Free Compatible Label Printer Brochure
No ratings yet
Epson TM-L90 Liner-Free Compatible Label Printer Brochure
2 pages
Silex
No ratings yet
Silex
112 pages
Grade VIII ICSE Coursework Tasks
No ratings yet
Grade VIII ICSE Coursework Tasks
5 pages
RF Microcontroller Device Control
No ratings yet
RF Microcontroller Device Control
7 pages
Cube Ultrasonic Sensor Manual
No ratings yet
Cube Ultrasonic Sensor Manual
3 pages
ML06
No ratings yet
ML06
16 pages
Sheet Pile Design
100% (1)
Sheet Pile Design
51 pages
Mobile App Development Approaches
No ratings yet
Mobile App Development Approaches
15 pages
Lec#02 PDC - Design Aspects of A Process Control System
No ratings yet
Lec#02 PDC - Design Aspects of A Process Control System
19 pages
Chrysler PARTE1
No ratings yet
Chrysler PARTE1
35 pages
MSC-IT Part I Regular Sem 1 Nov 2022
No ratings yet
MSC-IT Part I Regular Sem 1 Nov 2022
7 pages
Bender Gestalt Test
86% (21)
Bender Gestalt Test
40 pages
Thin-Walled Structures: Daniel C.T. Cardoso, Barbara S. Togashi
No ratings yet
Thin-Walled Structures: Daniel C.T. Cardoso, Barbara S. Togashi
12 pages
Flex Abis
No ratings yet
Flex Abis
36 pages
SA227 Airframe Manual Update
No ratings yet
SA227 Airframe Manual Update
16 pages
Ultrasonic Sensors for Automation Solutions
No ratings yet
Ultrasonic Sensors for Automation Solutions
12 pages
Car Brochure Hyundai Ioniq PX 929 R
No ratings yet
Car Brochure Hyundai Ioniq PX 929 R
13 pages