OpenRLHF / OpenRLHF Public

Notifications You must be signed in to change notification settings
Fork 906
Star 9.3k

Code
Issues 294
Pull requests 24
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: OpenRLHF/OpenRLHF

Labels 13 Milestones 0

New pull request New

24 Open 433 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

adding CFPO to OpenRLHF

#1184 opened Feb 9, 2026 by asparius

Loading…

feat: add AgentTokenHandler with defensive token concatenation for agentic training (#1128)

#1181 opened Jan 30, 2026 by ichbinlucaskim

Loading…

feat: Switch vLLM rollout sampling to oversampling.

#1179 opened Jan 20, 2026 by Freder-chen

Loading…

feat: add the support of fsdp2 and remove deepspeed (new version of PR 1115)

#1176 opened Jan 16, 2026 by LYMDLUT

Loading…

Default overlap_comm on for ZeRO-2+ RLHF runs

#1154 opened Nov 26, 2025 by MagellaX

Loading…

Enhance PPO logging with entropy, reward stats, and grad norm insights

#1148 opened Nov 8, 2025 by MagellaX

Loading…

Star Attention topology support with model integration and --attn_topology flag

#1122 opened Aug 26, 2025 by MagellaX

Loading…

Add FSDP backend and --dist_backend flag across CLIs; introduce FSDPStrategy

#1115 opened Aug 23, 2025 by MagellaX

Loading…

CLI support for top_k

#1104 opened Aug 13, 2025 by JoNeedsSleep

Loading…

Update utils.py

#1101 opened Aug 6, 2025 by LiyuanLucasLiu

Loading…

Enable logits extraction from vLLM for training

#1075 opened Jul 9, 2025 by MooMoo-Yang

Loading…

Reward model outputting one reward per rollout.

#1037 opened May 28, 2025 by NotTheStallion

Loading…

Merge lmm-r1 for Multimodal PPO

#989 opened Apr 23, 2025 by TideDra

Loading…

Fix Tokenizer Behavior for Special Placeholder Token

#894 opened Mar 20, 2025 by YuchenFan48

Loading…

Support unbiased off-policy GRPO

#840 opened Mar 7, 2025 by LYMDLUT

Loading…

added LoRA adapter disabling for computing KL divergence in single no…

#836 opened Mar 6, 2025 by wilkincr

Loading…

Add support for max time per run

#711 opened Feb 5, 2025 by titu1994

Loading…

Support SFT and DPO training for Qwen2VL

#665 opened Jan 10, 2025 by LiuXTao

Loading…

Integrate SGLang into OpenRLHF. Non-Hybrid Engine Only

#661 opened Jan 9, 2025 by zhaochenyang20

Loading…

Support rl logging board

#658 opened Jan 9, 2025 by HarderThenHarder

Loading…

Ensure train datasets do not contain eval datasets

#594 opened Dec 17, 2024 by dingyuan-shi

Loading…

Support broadcast vllm params by chunks

#593 opened Dec 17, 2024 by zhuzilin

Loading…

Make sure there is always _some_ eval data

#582 opened Dec 13, 2024 by frrad

Loading…

Support TRL's RLOO

#553 opened Dec 4, 2024 by songxxzp

Loading…

ProTip! Mix and match filters to narrow down what you’re looking for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!