Code-for-HAF-RM

Code for HAF-RM (anonymous)

There are some absolute paths in the codes, which only work with the same code structure as below:

/root/
|--- exp-modeling/
|--- |--- train_rm.py
|--- |--- data/
|--- |--- |--- harmless-train.json
|--- |--- |--- ......
|--- |--- |--- sampled_data/
|--- |--- |--- sampled_data_mistral/
|--- |--- output/
|--- |--- |--- checkpoint/
|--- |--- |--- eval_result/
|--- |--- |--- gpt_judge/
|--- |--- |--- infer_result/
|--- |--- generation/
|--- |--- tensorboard/
|--- |--- src/
|--- |--- |--- train_dpo_lora.py
|--- |--- |--- ......
|--- |--- script/
|--- |--- |--- ......
|--- model/
|--- |--- Mistral-7B-v02
|--- |--- phi-2
|--- trl/

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
script		script
src		src
README.md		README.md
requirements.txt		requirements.txt
train_rm.py		train_rm.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Code-for-HAF-RM

About

Uh oh!

Releases

Packages

Uh oh!

Languages

haf-rm-anonymized/Code-for-HAF-RM

Folders and files

Latest commit

History

Repository files navigation

Code-for-HAF-RM

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages