-
Notifications
You must be signed in to change notification settings - Fork 906
Pull requests: OpenRLHF/OpenRLHF
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: add AgentTokenHandler with defensive token concatenation for agentic training (#1128)
#1181
opened Jan 30, 2026 by
ichbinlucaskim
Loading…
feat: Switch vLLM rollout sampling to oversampling.
#1179
opened Jan 20, 2026 by
Freder-chen
Loading…
feat: add the support of fsdp2 and remove deepspeed (new version of PR 1115)
#1176
opened Jan 16, 2026 by
LYMDLUT
Loading…
Enhance PPO logging with entropy, reward stats, and grad norm insights
#1148
opened Nov 8, 2025 by
MagellaX
Loading…
Star Attention topology support with model integration and --attn_topology flag
#1122
opened Aug 26, 2025 by
MagellaX
Loading…
Add FSDP backend and --dist_backend flag across CLIs; introduce FSDPStrategy
#1115
opened Aug 23, 2025 by
MagellaX
Loading…
Reward model outputting one reward per rollout.
#1037
opened May 28, 2025 by
NotTheStallion
Loading…
Fix Tokenizer Behavior for Special Placeholder Token
#894
opened Mar 20, 2025 by
YuchenFan48
Loading…
added LoRA adapter disabling for computing KL divergence in single no…
#836
opened Mar 6, 2025 by
wilkincr
Loading…
Integrate SGLang into OpenRLHF. Non-Hybrid Engine Only
#661
opened Jan 9, 2025 by
zhaochenyang20
Loading…
Ensure train datasets do not contain eval datasets
#594
opened Dec 17, 2024 by
dingyuan-shi
Loading…
ProTip!
Mix and match filters to narrow down what you’re looking for.