GitHub - google-research-datasets/GSM-IC: Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant sentences in problem descriptions. GSM-IC is constructed to evaluate the distractibility of language models.

Grade-School Math with Irrelevant Context (GSM-IC)

This repository contains the dataset Grade-School Math with Irrelevant Context (GSM-IC) used in this paper: Large Language Models Can Be Easily Distracted by Irrelevant Context.

Data Format

GSM8K_validation.jsonl: the development split of GSM8K dataset used in the experiments.

Field name	Value
question	Input question.
answer	The ground truth answer.
n_steps	The number of intermediate steps to calculate the answer.

GSM-IC_2step.json: GSM-IC split with problems that require 2 intermediate steps.

Field name	Value
original_question	Original question from the GSM8K development set.
new_question	The new question with irrelevant context added to the original question.
answer	The ground truth answer.
n_steps	The number of intermediate steps to calculate the answer.
role_label, number_label, sentence_label	Categories of the added irrelevant context. Needed for result analysis, not needed for model prediction.
role, number, sentence_template	Added irrelevant context. Not needed for experiments.

GSM-IC_mstep.json: GSM-IC split with problems that require more than 2 intermediate steps. Same format as GSM-IC_2step.json.

Citation

If you use the data released through this repository, please cite the following paper:

@article{shi2023large,
  title={Large Language Models Can Be Easily Distracted by Irrelevant Context},
  author={Shi, Freda and Chen, Xinyun and Misra, Kanishka and Scales, Nathan and Dohan, David and Chi, Ed and Schärli, Nathanael and Zhou, Denny},
  journal={arXiv preprint arXiv:2302.00093},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
GSM-IC_2step.json		GSM-IC_2step.json
GSM-IC_mstep.json		GSM-IC_mstep.json
README.md		README.md
gsm8k_validation.jsonl		gsm8k_validation.jsonl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Grade-School Math with Irrelevant Context (GSM-IC)

Data Format

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors 1

Folders and files

Latest commit

History

Repository files navigation

Grade-School Math with Irrelevant Context (GSM-IC)

Data Format

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors 1

Packages