(GVIL) - Visual Illusion Dataset

Setup

Unzip the data.

unzip dataset.zip

PS: The dataset is also available on Huggingface: link.

Install some utilities.

pip install matplotlib tqdm

Inference

Add your model in model.py.
Add the build function in inference.py.
Run inference.

python inference.py --task vqa --model YOUR_MODEL_NAME

Evaluation

We provide the model predictions from OFA and Unified-IO for reference. For example, you can run evaluation on the prediction of the OFA-Large model after unzipping unzip predictions.zip.

python eval.py \
    --vqa_predictions predictions/vqa__ofa_large.json \
    --vg_predictions predictions/vg__ofa_large.json

Citation

@inproceedings{zhang2023grounding,
    title={Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?},
    author={Zhang, Yichi and Pan, Jiayi and Zhou, Yuchen and Pan, Rui and Chai, Joyce},
    booktitle={Proceedings of Conference of Empirical Methods in Natural Language Processing},
    year={2023},
    organization={EMNLP 2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
README.md		README.md
dataset.zip		dataset.zip
eval.py		eval.py
inference.py		inference.py
model.py		model.py
predictions.zip		predictions.zip
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

(GVIL) - Visual Illusion Dataset

Setup

Inference

Evaluation

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

vl-illusion/GVIL

Folders and files

Latest commit

History

Repository files navigation

(GVIL) - Visual Illusion Dataset

Setup

Inference

Evaluation

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages