Some improvement on [Reasoning] #53

YanSong97 · 2024-11-12T02:46:19Z

Add LLM-as-Judge tools
Configurate data loading, model resources;
add huggingface model worker;

2. Configurate LM/RM address; 3. Add tensor parallel example; 4. Black format;

2. refactor prompt resources, data loading; 3. add genrm infer fn and separate Q&A before value inference 4. batch limit for vllm request

2. add lm stop str to cfg file; 3. llm-as-judge also extract clean answer for evaluation; 4. add huggingface model worker; 5. update scripts

2. add offline RM evaluation script

reason/llm_service/utils/skywork_o1_prm_inference/prm_model.py

scripts/eval/beam_search.sh

2. change file type to yaml; 3. fix rstar data loading

YanSong97 added 3 commits December 1, 2024 23:11

1. Add LLM-as-Judge toolkits;

3bf4180

2. Configurate LM/RM address; 3. Add tensor parallel example; 4. Black format;

1. add LM lora config;

a475caa

2. refactor prompt resources, data loading; 3. add genrm infer fn and separate Q&A before value inference 4. batch limit for vllm request

1. refactor data loading and task;

345cd1d

2. add lm stop str to cfg file; 3. llm-as-judge also extract clean answer for evaluation; 4. add huggingface model worker; 5. update scripts

YanSong97 force-pushed the reason_llm_as_judge branch from 2b06d58 to 345cd1d Compare December 8, 2024 21:03

YanSong97 requested a review from ziyuwan December 8, 2024 21:19

1. add gsm8k

2e9cd35

2. add offline RM evaluation script

ziyuwan reviewed Dec 9, 2024

View reviewed changes

reason/llm_service/utils/skywork_o1_prm_inference/prm_model.py Show resolved Hide resolved

scripts/eval/beam_search.sh Show resolved Hide resolved

1. relocate LM, RM name to config;

e6af10f

2. change file type to yaml; 3. fix rstar data loading

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Some improvement on [Reasoning] #53

Some improvement on [Reasoning] #53

Uh oh!

YanSong97 commented Nov 12, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Some improvement on [Reasoning] #53

Are you sure you want to change the base?

Some improvement on [Reasoning] #53

Uh oh!

Conversation

YanSong97 commented Nov 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

YanSong97 commented Nov 12, 2024 •

edited

Loading