- 🔭 RL+Reasoning, RL+Agent, PEFT
- 🌱 PhD-ing at University of Amsterdam, Interning at Microsoft Research
- 😄 welcome to my ⭐Personal Homepage⭐
🎯
Focusing
PhD candidate @ltl-uva for NLP
-
University of Amsterdam
- Netherlands
- https://sites.google.com/view/baohaoliao
Pinned Loading
-
RLHFlow/Reinforce-Ada
RLHFlow/Reinforce-Ada PublicAn adaptive sampling framework for Reinforce-style LLM post training.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.




