Prefix-RFT

This is the codebase of the paper: Blending Supervised and Reinforcement Fine-Tuning with Prefix Sampling. In this paper, we propose a hybrid approach for LLM post-training.

Structure

The code base in based on veRL. All prefix-rft related codes are in recipe/prefix_rft

Cite us

@article{huang2025blending,
    title={Blending Supervised and Reinforcement Fine-Tuning with Prefix Sampling},
    author={Huang, Zeyu and Cheng, Tianhao and Qiu, Zihan and Wang, Zili and Xu, Yinghui and Ponti, Edoardo M and Titov, Ivan},
    journal={arXiv preprint arXiv:2507.01679},
    year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
docker		docker
docs		docs
examples		examples
patches		patches
recipe		recipe
scripts		scripts
tests		tests
verl		verl
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
.style.yapf		.style.yapf
LICENSE		LICENSE
Notice.txt		Notice.txt
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
requirements_sglang.txt		requirements_sglang.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Prefix-RFT

Structure

Cite us

About

Uh oh!

Releases

Packages

Languages

License

ZeroYuHuang/prefix_rft

Folders and files

Latest commit

History

Repository files navigation

Prefix-RFT

Structure

Cite us

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages