Update Input Template of RL Policy to Improve Module Flexisiblity by Jinyu-W · Pull Request #589 · microsoft/maro

Jinyu-W · 2023-03-28T08:25:12Z

Description

RL Workflow:

add customized_callbacks to RLComponentBundle
add env.tick to replace the default None in AbsEnvSampler._get_global_and_agent_state()

RL Algorithms:

fix RL algorithms to_device issue
add **kwargs to support more problem setting (e.g., Graph based ones)
- add **kwargs to RL models' forward funcs and _shape_check()
- add **kwargs to RL policies' get_action related funcs and _post_check()
- add **kwargs to choose_actions of AbsEnvSampler; remain it None in current sample() and eval()
add detached loss to the return value of update_critic() and update_actor() of current TrainOps; add default False early_stop to update_actor() of current TrainOps

Linked issue(s)/Pull request(s)

issue_number

Type of Change

Related Component

Simulation toolkit
RL toolkit
Distributed toolkit

Has Been Tested

OS:
- Windows
- Mac OS
- Linux
Python version:
- 3.7
- 3.8
- 3.9
Key information snapshot(s):

Needs Follow Up Actions

New release package
New docker image

Checklist

Add/update the related comments
Add/update the related tests
Add/update the related documentations
Update the dependent downstream modules usage

…_and_agent_state()

…ctor() of current TrainOps; add default False early_stop to update_actor() of current TrainOps

…ent sample() and eval()

lihuoran · 2023-03-29T10:52:18Z

maro/rl/policy/abs_policy.py

        self,
        states: torch.Tensor,
        actions: torch.Tensor = None,
+        **kwargs,


Is **kwargs used in _shape_check()? Or you just want to keep it for future flexibility?

Not used yet. Only for future flexibility.

Jinyu Wang added 7 commits March 28, 2023 06:00

add customized_callbacks to RLComponentBundle

46e23a9

add env.tick to replace the default None in AbsEnvSampler._get_global…

c2f5ab4

…_and_agent_state()

fix rl algorithms to_device issue

cd8d830

add kwargs to RL models' forward funcs and _shape_check()

cc26924

add kwargs to RL policies' get_action related funcs and _post_check()

fafec64

add detached loss to the return value of update_critic() and update_a…

1e1c4ef

…ctor() of current TrainOps; add default False early_stop to update_actor() of current TrainOps

add kwargs to choose_actions of AbsEnvSampler; remain it None in curr…

3697861

…ent sample() and eval()

Jinyu-W requested a review from lihuoran March 28, 2023 08:25

Jinyu Wang added 2 commits March 28, 2023 08:50

ufix line length issue

7dde9a9

fix line break issue

5661e87

Jinyu-W mentioned this pull request Mar 28, 2023

V0.3: Upgrade RL Workflow; Add RL Benchmarks; Update Package Version #588

Merged

21 tasks

lihuoran reviewed Mar 29, 2023

View reviewed changes

lihuoran approved these changes Mar 29, 2023

View reviewed changes

Jinyu-W changed the title ~~V0.3 update rl input~~ Update Input Template of RL Policy to Improve Module Flexisiblity Mar 29, 2023

Jinyu-W merged commit 71157f8 into v0.3 Mar 29, 2023

Jinyu-W deleted the v0.3_update_rl_input branch March 29, 2023 13:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Input Template of RL Policy to Improve Module Flexisiblity#589

Update Input Template of RL Policy to Improve Module Flexisiblity#589
Jinyu-W merged 9 commits intov0.3from
v0.3_update_rl_input

Jinyu-W commented Mar 28, 2023

Uh oh!

lihuoran Mar 29, 2023

Uh oh!

Jinyu-W Mar 29, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Jinyu-W commented Mar 28, 2023

Description

Linked issue(s)/Pull request(s)

Type of Change

Related Component

Has Been Tested

Needs Follow Up Actions

Checklist

Uh oh!

lihuoran Mar 29, 2023

Choose a reason for hiding this comment

Uh oh!

Jinyu-W Mar 29, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants