LiveXiv - A Multi-Modal Live Benchmark Based on Arxiv Papers Content

Welcome to our GitHub repository! This repository is based on the ideas introduced in

Shabtay, Nimrod, Felipe Maia Polo, Sivan Doveh, Wei Lin, M. Jehanzeb Mirza, Leshem Chosen, Mikhail Yurochkin et al. "LiveXiv--A Multi-Modal Live Benchmark Based on Arxiv Papers Content." arXiv preprint arXiv:2410.10783 (2024).

Installation

To use the code in this repository, clone the repo and create a conda environment using:

conda env create --file=environment.yaml
conda activate sloth

Data

Our data can be found on HuggingFace.

VQA Generation

Go to vqa_generation

python main.py artifacts_dir=<artifacts_dir>

Comments:

There are many configuration options inside vqa_generation/confing/conf.yaml so be sure to check it before you start.
Api-keys for GPT and/or claude are needed to run the code.
Additional installation is required - pip install vqa_generation/requirements.txt
You will need to clone and install the llava repo to work with the stand-alone blind filtering (you can switch to any other model with minimal changes)

Efficient evaluation

Quick start

If you are interested in checking how our efficient eval method works in practice, please check this notebook.

Reproducing results from the paper

Please check our notebooks.

Citing

If you find LiveXiv useful for your research and applications, please cite using this BibTeX:

@misc{shabtay2024livexivmultimodallive,
      title={LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content}, 
      author={Nimrod Shabtay and Felipe Maia Polo and Sivan Doveh and Wei Lin and M. Jehanzeb Mirza and Leshem Chosen and Mikhail Yurochkin and Yuekai Sun and Assaf Arbelle and Leonid Karlinsky and Raja Giryes},
      year={2024},
      eprint={2410.10783},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2410.10783}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
data		data
livexiv		livexiv
notebooks		notebooks
plots		plots
vqa_generation		vqa_generation
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LiveXiv - A Multi-Modal Live Benchmark Based on Arxiv Papers Content

Installation

Data

VQA Generation

Efficient evaluation

Quick start

Reproducing results from the paper

Citing

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

NimrodShabtay/LiveXiv

Folders and files

Latest commit

History

Repository files navigation

LiveXiv - A Multi-Modal Live Benchmark Based on Arxiv Papers Content

Installation

Data

VQA Generation

Efficient evaluation

Quick start

Reproducing results from the paper

Citing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages