BaohaoLiao

Follow

🎯

Focusing

Baohao Liao BaohaoLiao

🎯

Focusing

Follow

PhD candidate @ltl-uva for NLP

29 followers · 13 following

University of Amsterdam
Netherlands
https://sites.google.com/view/baohaoliao

Achievements

Achievements

baohaoliao/README.md

Hello world 👋

🔭 RL+Reasoning, RL+Agent, PEFT
🌱 PhD-ing at University of Amsterdam, Interning at Microsoft Research
😄 welcome to my ⭐Personal Homepage⭐

Pinned Loading

SAGE SAGE Public

Self-Hinting Language Models Enhance Reinforcement Learning

Python 24 3
RLHFlow/Reinforce-Ada RLHFlow/Reinforce-Ada Public

An adaptive sampling framework for Reinforce-style LLM post training.

Python 93 17
frac-cot frac-cot Public

An efficient sampling method for long-CoT LLM with fractured CoT.

Python 16
RSD RSD Public

[ICML 2025] Reward-guided Speculative Decoding (RSD) for efficiency and effectiveness.

Python 56 6
ApiQ ApiQ Public

[EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs

Python 15 2
mefts mefts Public

[NeurIPS 2023] Make Your Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning

Python 33 1