Skip to content

[ACL 2025] Official repository for "Rethinking Reward Model Evaluation Through the Lens of Reward Overoptimization""

Notifications You must be signed in to change notification settings

kimsh0507/rethinking_rm_eval

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

5 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Rethinking Reward Model Evaluation Through the Lens of Reward Overoptimization

Official repo for Rethinking Reward Model Evaluation Through the Lens of Reward Overoptimization (ACL 2025 Main Conference)

๐Ÿ› ๏ธ Setup

conda create -n rm_eval python=3.10 -y
conda activate rm_eval
pip install -r requirements.txt

To evaluate results, MARIO EVAL needs to be installed.

Install MARIO EVAL as Python package

git clone https://github.com/MARIO-Math-Reasoning/MARIO_EVAL.git
cd MARIO_EVAL
cd latex2sympy && pip install . && cd ..
pip install -e .

๐Ÿš€ Quick Start

Inference Classifier-based Reward Models

bash scripts/run_classifier_rm.sh

Inference Process Reward Mmodels

bash scripts/run_prm.sh

๐Ÿ‘ Acknowledgements

The underlying codebase for evaluating reward model from RewardBench.

Citation

@article{kim2025rethinking,
  title={Rethinking Reward Model Evaluation Through the Lens of Reward Overoptimization},
  author={Kim, Sunghwan and Kang, Dongjin and Kwon, Taeyoon and Chae, Hyungjoo and Lee, Dongha and Yeo, Jinyoung},
  journal={arXiv preprint arXiv:2505.12763},
  year={2025}
}

About

[ACL 2025] Official repository for "Rethinking Reward Model Evaluation Through the Lens of Reward Overoptimization""

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published