Reinforcement Learning: Gaussian noise #3515

tareknaser · 2023-07-25T17:14:43Z

This pull request introduces the GaussianNoise class and reorganizes the reinforcement learning test suite.

Implementation Details:

GaussianNoise Class:
- GaussianNoise class provides the necessary noise object required for the DDPG algorithm.
- The flexibility of the GaussianNoise class allows for potential usage in other RL algorithms, where a noise object might be necessary for training and convergence.
Reorganization of Tests:
- The reinforcement learning test suite has been split into two files for improved code organization.
- The new policy_gradient_test.cpp file includes the tests for DDPG, TD3, and SAC.

How Has This Been Tested?:

The DDPG algorithm is tested with the integration of the GaussianNoise class for exploration.
Another test to verify that the returned noise sample has the right mean and standard deviation.

This commit adds GaussianNoise class which is just a wrapper around arma::randi that is built to interact with reinforcement learning agents that expect an object of type Noise. As of now, there is only one reinforcement learning agent that expect a Noise instance which is DDPG Signed-off-by: Tarek <[email protected]>

This commit adds 2 new tests for the GaussianNoise class: 1- GaussianNoiseTest which makes sure that the returned noise sample has the right mean and standard deviation. 2- PendulumWithGaussianDDPG which makes sure that the DDPG agent converges when supplied with a GaussianNoise object Signed-off-by: Tarek <[email protected]>

This commit separates the reinforcement learning policy gradient algorithms tests from other reinforcement learning agents' tests Signed-off-by: Tarek <[email protected]>

This allows for reusability in /tests/q_learning_test.cpp, /tests/policy_gradient_test.cpp and any future reinforcement learning testing file Signed-off-by: Tarek <[email protected]>

Signed-off-by: Tarek <[email protected]>

src/mlpack/methods/reinforcement_learning/noise/gaussian.hpp

Signed-off-by: Tarek <[email protected]>

zoq

Thanks for putting this together, no further comments from my side.

mlpack-bot

Second approval provided automatically after 24 hours. 👍

zoq · 2023-08-01T20:38:36Z

Thanks for another great contribution.

tareknaser added 4 commits July 25, 2023 20:12

feat(rl): split the reinforcement learning tests into 2 files

2ae3c08

This commit separates the reinforcement learning policy gradient algorithms tests from other reinforcement learning agents' tests Signed-off-by: Tarek <[email protected]>

feat(tests): move testAgent function to a separate file

19a8700

This allows for reusability in /tests/q_learning_test.cpp, /tests/policy_gradient_test.cpp and any future reinforcement learning testing file Signed-off-by: Tarek <[email protected]>

mlpack-bot bot added s: needs review s: unanswered s: unlabeled labels Jul 25, 2023

zoq added c: methods t: added feature and removed s: unanswered s: unlabeled labels Jul 25, 2023

add pr 3515 to history file

d1cb169

Signed-off-by: Tarek <[email protected]>

zoq reviewed Jul 28, 2023

View reviewed changes

src/mlpack/methods/reinforcement_learning/noise/gaussian.hpp Outdated Show resolved Hide resolved

fix(rl): make the variables constants for GaussianNoise

be237b6

Signed-off-by: Tarek <[email protected]>

zoq approved these changes Jul 31, 2023

View reviewed changes

mlpack-bot bot approved these changes Aug 1, 2023

View reviewed changes

mlpack-bot bot removed the s: needs review label Aug 1, 2023

zoq merged commit fea1700 into mlpack:master Aug 1, 2023

rcurtin mentioned this pull request Sep 5, 2023

Release version 4.2.1 #3533

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Reinforcement Learning: Gaussian noise #3515

Reinforcement Learning: Gaussian noise #3515

Uh oh!

tareknaser commented Jul 25, 2023

Uh oh!

Uh oh!

zoq left a comment

Uh oh!

mlpack-bot bot left a comment

Uh oh!

zoq commented Aug 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Reinforcement Learning: Gaussian noise #3515

Reinforcement Learning: Gaussian noise #3515

Uh oh!

Conversation

tareknaser commented Jul 25, 2023

Implementation Details:

How Has This Been Tested?:

Uh oh!

Uh oh!

zoq left a comment

Choose a reason for hiding this comment

Uh oh!

mlpack-bot bot left a comment

Choose a reason for hiding this comment

Uh oh!

zoq commented Aug 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants