Reinforcement Learning

Uploaded by

Jessa Mae Carranza

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views3 pages

Reinforcement Learning

Uploaded by

Jessa Mae Carranza

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Reinforcement Learning: An Overview

Reinforcement Learning (RL) is a branch of machine learning focused on making

decisions to maximize cumulative rewards in a given situation. Unlike supervised
learning, which relies on a training dataset with predefined answers, RL involves
learning through experience. In RL, an agent learns to achieve a goal in an
uncertain, potentially complex environment by performing actions and receiving
feedback through rewards or penalties.
Key Concepts of Reinforcement Learning
 Agent: The learner or decision-maker. (e.g., a robot, game character).
 Environment: Everything the agent interacts with. (e.g., a maze, a game, or
a simulated world).
 State: A specific situation in which the agent finds itself. (e.g., the robot's
position in the maze).
 Action: All possible moves the agent can make. (e.g., move up, down, left, or
right).
 Reward: Feedback from the environment based on the action taken.

WHAT ARE REWARDS AND PENALTIES IN RL?

 Rewards: Positive feedback given to the agent when it performs an action
that helps achieve its goal.
o Example: A robot gets a reward of +10 points for successfully
navigating to the end of a maze.
 Penalties: Negative feedback given to the agent when it makes a mistake or
takes an action that hinders its progress.
o Example: The same robot gets a penalty of -5 points if it crashes into a
wall.
Example to Understand Rewards and Penalties:
Imagine you are training a robot dog to fetch a ball:
 If the robot moves toward the ball, it gets a reward (e.g., +1 point).
 If the robot moves away from the ball, it gets a penalty (e.g., -1 point).
 If it picks up the ball and returns it to you, it gets a big reward (e.g., +50
points).
The robot learns through trial and error by trying different actions, receiving
feedback (rewards or penalties), and gradually improving its decisions to maximize
its total score.
How RL is Different from Other Learning:
 No predefined answers: Unlike supervised learning, where a dataset
contains labeled examples (e.g., input and correct output), RL doesn’t give
the agent direct instructions on what to do.
 Learning through interaction: The agent learns by exploring the
environment, taking actions, and observing their consequences.
Key Idea:
The agent learns a policy (a strategy) that helps it decide the best action to take in
each situation. Over time, it becomes better at selecting actions that maximize
long-term rewards.

ANOTHER Example:
Let’s train a robot to cross a street:
1. State: The robot’s current position and whether the traffic light is green or
red.
2. Actions: Walk, stop, or wait.
3. Reward:
o +10 for safely crossing.

o -10 for walking when the light is red (penalty).

By repeatedly trying actions and adjusting based on feedback, the robot learns
when to cross safely.

 Model-Based RL involves creating or learning a model of

the environment. This model predicts: How the environment
changes when actions are taken (state transitions).
The agent uses this model to simulate outcomes and plan its
actions without directly interacting with the environment all the
time.
When to Use Model-Based RL:
1. Well-Defined and Unchanging Environments: Example:
Chess or other board games where rules are fixed and well-
understood.
 Model-Free RL skips building a model of the environment.
Instead, the agent learns through trial and error by directly
interacting with the environment. It gradually learns the best
actions to take by observing the rewards or penalties it
receives.
When to Use Model-Free RL:
1. Large, Complex, and Unpredictable Environments:
Example: A self-driving car navigating traffic, where
conditions vary widely (weather, road rules, other vehicles).

In summary:
 Model-Based RL is ideal for environments that are predictable and where
real-world testing is costly or impractical.
 Model-Free RL shines in environments that are unpredictable, complex, or
easy to interact with directly for learning.

Reinforcement Learning 1
No ratings yet
Reinforcement Learning 1
14 pages
Unit 5 ML
No ratings yet
Unit 5 ML
49 pages
RL Unit-1
No ratings yet
RL Unit-1
52 pages
Introduction - Week 1
No ratings yet
Introduction - Week 1
52 pages
RL Week - 1
No ratings yet
RL Week - 1
53 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
9 pages
UNIT-V-Reinforcement Learning
No ratings yet
UNIT-V-Reinforcement Learning
4 pages
UNIT V Reinforcement Learning
No ratings yet
UNIT V Reinforcement Learning
8 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
25 pages
ML Unit2
No ratings yet
ML Unit2
17 pages
tiếng anhi
No ratings yet
tiếng anhi
7 pages
RL & DL Notes
No ratings yet
RL & DL Notes
43 pages
Module 1
No ratings yet
Module 1
72 pages
Reinforcement Learning Overview
No ratings yet
Reinforcement Learning Overview
73 pages
ML U5 Notes
No ratings yet
ML U5 Notes
26 pages
RL Introduction
No ratings yet
RL Introduction
225 pages
Module 1
No ratings yet
Module 1
81 pages
7.reinforcement Learning-Introduction-The Learning Task Q-Learning
No ratings yet
7.reinforcement Learning-Introduction-The Learning Task Q-Learning
34 pages
RL
No ratings yet
RL
94 pages
Module 01
No ratings yet
Module 01
66 pages
Unit-5 (AI)
No ratings yet
Unit-5 (AI)
21 pages
Lecture1 Introduction Part1
No ratings yet
Lecture1 Introduction Part1
17 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
4 pages
RL 1
No ratings yet
RL 1
29 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
10 pages
Reinforcement Learning-1
No ratings yet
Reinforcement Learning-1
19 pages
Reinforcement Learning vs Supervised Learning
No ratings yet
Reinforcement Learning vs Supervised Learning
9 pages
Reinforcement Learning Notes ?
No ratings yet
Reinforcement Learning Notes ?
40 pages
Unit 5 - Reinforcement Learning
No ratings yet
Unit 5 - Reinforcement Learning
15 pages
Introduction To Reinforcement Learning (RL)
No ratings yet
Introduction To Reinforcement Learning (RL)
3 pages
AI Unit - 3
No ratings yet
AI Unit - 3
102 pages
DRL Final Notes
No ratings yet
DRL Final Notes
281 pages
ML Assignment 2
No ratings yet
ML Assignment 2
6 pages
A Primer Chapter On Reinforcement Learning-Final
No ratings yet
A Primer Chapter On Reinforcement Learning-Final
22 pages
Reinforced Learning
No ratings yet
Reinforced Learning
25 pages
Unit 4
No ratings yet
Unit 4
56 pages
Unit-5 Mla
No ratings yet
Unit-5 Mla
22 pages
Sara Reinforcement Learning
No ratings yet
Sara Reinforcement Learning
69 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
25 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
9 pages
Winter Semester 2023-24 - CSE4037 - ETH - AP2023246000594 - 2024-01-05 - Reference-Material-I
No ratings yet
Winter Semester 2023-24 - CSE4037 - ETH - AP2023246000594 - 2024-01-05 - Reference-Material-I
35 pages
A (Long) Peek Into Reinforcement Learning - Lil'Log
No ratings yet
A (Long) Peek Into Reinforcement Learning - Lil'Log
23 pages
ML Module 5
No ratings yet
ML Module 5
76 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
4 pages
L11 Reinforcement Learning 1
No ratings yet
L11 Reinforcement Learning 1
18 pages
RL Cab
No ratings yet
RL Cab
9 pages
ML Unit-4
No ratings yet
ML Unit-4
10 pages
Unit 1 - Reinforcement Learning, Overfitting, Training, Validation Sets, Metrics, Bias and Variance
No ratings yet
Unit 1 - Reinforcement Learning, Overfitting, Training, Validation Sets, Metrics, Bias and Variance
16 pages
Exp-14 Reinforcement Learning
No ratings yet
Exp-14 Reinforcement Learning
11 pages
RL Unit - Iii
No ratings yet
RL Unit - Iii
20 pages
Unit - 5 RL
No ratings yet
Unit - 5 RL
38 pages
Reinforcement Learning & Robotics
No ratings yet
Reinforcement Learning & Robotics
35 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
38 pages
Reinforcement Learning For IoT - Final
No ratings yet
Reinforcement Learning For IoT - Final
45 pages
37 RL
No ratings yet
37 RL
18 pages
What Is Reinforcement Learning
No ratings yet
What Is Reinforcement Learning
15 pages
First Reinforcement Learning Blog Post
No ratings yet
First Reinforcement Learning Blog Post
2 pages
Reinforcemnet Learning
No ratings yet
Reinforcemnet Learning
8 pages
Reinforcement learning-WPS Office
No ratings yet
Reinforcement learning-WPS Office
1 page
PAGE Descriptions of Robots
No ratings yet
PAGE Descriptions of Robots
3 pages
TCS Ninja & Digital Interview Questions
100% (1)
TCS Ninja & Digital Interview Questions
23 pages
THL Range
No ratings yet
THL Range
10 pages
Correlation and Regression Notes
No ratings yet
Correlation and Regression Notes
5 pages
Cyber Crime & Cyber Cecurity Aditya
No ratings yet
Cyber Crime & Cyber Cecurity Aditya
9 pages
Understanding MissingNo. Variants
No ratings yet
Understanding MissingNo. Variants
18 pages
Grade 12 Media Literacy Reviewer
No ratings yet
Grade 12 Media Literacy Reviewer
3 pages
IEEE Standard For Interconnection and Interoperability of Distributed Energy Resources With Associated Electric Power Systems Interfaces
No ratings yet
IEEE Standard For Interconnection and Interoperability of Distributed Energy Resources With Associated Electric Power Systems Interfaces
16 pages
1 s2.0 S0020025523001354 Main2024
No ratings yet
1 s2.0 S0020025523001354 Main2024
14 pages
2023 Notes Chapter 6 (2nd Law of Thermo)
No ratings yet
2023 Notes Chapter 6 (2nd Law of Thermo)
26 pages
Slide Switches Te
No ratings yet
Slide Switches Te
74 pages
Digital Electronics Basics
100% (1)
Digital Electronics Basics
58 pages
Inoxalum Catalog
No ratings yet
Inoxalum Catalog
12 pages
VCM-3 Condition Monitoring Overview
No ratings yet
VCM-3 Condition Monitoring Overview
10 pages
Inter-Processor Communication and Synchronization
No ratings yet
Inter-Processor Communication and Synchronization
23 pages
Big Data Analytics For Smart Grid Optimization
No ratings yet
Big Data Analytics For Smart Grid Optimization
4 pages
Data Mining in e-Governance: Bhoomi
No ratings yet
Data Mining in e-Governance: Bhoomi
2 pages
Fourth Admission List - 2079
No ratings yet
Fourth Admission List - 2079
2 pages
Visual Basic and MS Access Project Report in Electricity Billing System
85% (60)
Visual Basic and MS Access Project Report in Electricity Billing System
107 pages
1 Comparison of SANY and SINO HOWO 10m3 Transit Mixer
No ratings yet
1 Comparison of SANY and SINO HOWO 10m3 Transit Mixer
1 page
Enr 2.1 - 1 Air Traffic Services Airspace
No ratings yet
Enr 2.1 - 1 Air Traffic Services Airspace
29 pages
Project Manager - Bhavnagar
No ratings yet
Project Manager - Bhavnagar
6 pages
CTSIM
No ratings yet
CTSIM
5 pages
SGV-advantage-9-22-2023-APAC No.10001108
No ratings yet
SGV-advantage-9-22-2023-APAC No.10001108
32 pages
OHIO-BRASS XL Transmission Insulators 133916595
No ratings yet
OHIO-BRASS XL Transmission Insulators 133916595
67 pages
MERALCO Standard Ratings
No ratings yet
MERALCO Standard Ratings
7 pages
Synchronous Generator Problem Set
No ratings yet
Synchronous Generator Problem Set
3 pages
DAX Functions for Date Calculations
No ratings yet
DAX Functions for Date Calculations
3 pages
Final Project Report
No ratings yet
Final Project Report
42 pages
Yanmar Shop - FIG 25. COOLING FRESH WATER PUMP Schematic
No ratings yet
Yanmar Shop - FIG 25. COOLING FRESH WATER PUMP Schematic
3 pages