Skip to content

Conversation

@YanSong97
Copy link
Collaborator

@YanSong97 YanSong97 commented Nov 12, 2024

  1. Add LLM-as-Judge tools
  2. Configurate data loading, model resources;
  3. add huggingface model worker;

2. Configurate LM/RM address;
3. Add tensor parallel example;
4. Black format;
2. refactor prompt resources, data loading;
3. add genrm infer fn and separate Q&A before value inference
4. batch limit for vllm request
2. add lm stop str to cfg file;
3. llm-as-judge also extract clean answer for evaluation;
4. add huggingface model worker;
5. update scripts
@YanSong97 YanSong97 force-pushed the reason_llm_as_judge branch from 2b06d58 to 345cd1d Compare December 8, 2024 21:03
@YanSong97 YanSong97 requested a review from ziyuwan December 8, 2024 21:19
2. add offline RM evaluation script
2. change file type to yaml;
3. fix rstar data loading
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants