Skip to content

Guangxuan-Xiao/GSM8K-eval

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Evaluate LLMs on GSM8k

This repository contains a minimal implementation of the evaluation code for LLMs on GSM8k.

Requirements

PyTorch, transformers, numpy, pandas, sklearn, tqdm

Example Usage

MODEL=meta-llama/Llama-2-7b-hf
device=0
CUDA_VISIBLE_DEVICES=$device python main.py \
    --model_name_or_path $MODEL \
    --output_dir outputs/$MODEL

References

https://github.com/alibaba/FederatedScope/blob/dev/llm/federatedscope/llm/eval/eval_for_gsm8k/eval.py

https://github.com/kojima-takeshi188/zero_shot_cot

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages