This is the dataset for the paper "Hi-ToM: A Benchmark for Evaluating Higher-Order Theory of Mind Reasoning in Large Language Models".
Hi-ToM_data/Hi-ToM_data.json contains 1.2k higher-order ToM question-answer pairs.
Please run the script generate_tomh.sh to automatically generate new stories along questions and answers.
