Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond The Base Model?
Published in Adavances on Neural Information Processing Systems (Neurips) 2025, 2025
Yang Yue, Zhiqi Chen, Rui Lu, Andrew Zhao, Zhaokai Wang, Shiji Song, Gao Huang
Download here
