Skip to content

Conversation

@Dousia
Copy link
Contributor

@Dousia Dousia commented Jul 7, 2024

@Luodian
Copy link
Contributor

Luodian commented Jul 8, 2024

Thanks for your contribution! Could you please add a screenshot of a model result with this PR?

@Luodian
Copy link
Contributor

Luodian commented Jul 8, 2024

img_v3_02cj_5fd1e286-1e2a-416a-8a82-d21da42f909g

The author has provided the screenshot, thanks!

@Luodian Luodian merged commit 2ebec77 into EvolvingLMMs-Lab:main Jul 8, 2024
Luodian added a commit that referenced this pull request Jul 9, 2024
* feat: Update LMMS evaluation configuration and models

- Update `activitynetqa_generation.yaml` to remove `dataset_name` field and update `task` field to "activitynetqa"
- Update `utils.py` to add default values for `API_URL` and `API_KEY` when `API_TYPE` is not "openai" or "azure"
- Update `batch_gpt4.py` and `gpt4v.py` to rename `max_frames_for_video` parameter to `max_frames_num`
- Update `reka.py` to rename `max_frames_for_video` parameter to `max_frames_num` and add support for `continual_mode` with a persistent response cache

This commit updates the LMMS evaluation configuration and models to improve compatibility and add new features.

* Update LMMS evaluation configuration and models

* Update LMMS evaluation configuration and models

* feat: Update LMMS evaluation configuration and models

- Update `activitynetqa_generation.yaml` to remove `dataset_name` field and update `task` field to "activitynetqa"
- Update `utils.py` to add default values for `API_URL` and `API_KEY` when `API_TYPE` is not "openai" or "azure"
- Update `batch_gpt4.py` and `gpt4v.py` to rename `max_frames_for_video` parameter to `max_frames_num`
- Update `reka.py` to rename `max_frames_for_video` parameter to `max_frames_num` and add support for `continual_mode` with a persistent response cache

This commit updates the LMMS evaluation configuration and models to improve compatibility and add new features.

* Refactor error handling in GPT4V model evaluation

* Refactor error handling in GPT4V model evaluation

* Refactor video decoding backend to use "decord" instead of "pyav"

* Refactor image aspect ratio handling in Llava_OneVision model

* Refactor GPT4V model to fix bug in visuals encoding

* add exception for azure gpt

* feat: fix bugs

* feat: update

* Refactor image aspect ratio handling in Llava_OneVision model

* Refactor image aspect ratio handling in Llava_OneVision model

* update interleave bench
Luodian added a commit that referenced this pull request Sep 1, 2024
* chore: Update gpt_eval_model_name to "gpt-3.5-turbo" in mathvista.yaml

* Squashed commit of the following:

commit 994c9f97a2f8db3e9b7d7933d1e1680acde5b70b
Author: Yan Shu <[email protected]>
Date:   Mon Jul 8 17:21:23 2024 +0800

    Add files via upload

* Squashed commit of the following:

commit e31cd78
Author: Bo Li <[email protected]>
Date:   Wed Jul 10 12:08:08 2024 +1000

    chore: Update lmms_eval/models/vila.py and lmms_eval/tasks/__init__.py

commit 1d8c980
Author: kcz358 <[email protected]>
Date:   Tue Jul 9 02:08:52 2024 +0000

    Rename xcomposer 4KHD

commit 6da76f3
Author: Bo Li <[email protected]>
Date:   Tue Jul 9 11:55:56 2024 +1000

    Upgrade lmms-eval to version 0.2.1

commit cd18585
Author: Bo Li <[email protected]>
Date:   Tue Jul 9 11:52:23 2024 +1000

    Upgrade lmms-eval to support more models and evaluation tasks

commit 672d7e5
Author: Bo Li <[email protected]>
Date:   Tue Jul 9 11:43:41 2024 +1000

    feat: Add tie_weights parameter to Llava model initialization

commit 2037a86
Merge: e6844db a5c1869
Author: Bo Li <[email protected]>
Date:   Tue Jul 9 11:37:12 2024 +1000

    Fix gen kwargs image aspect ratio in internvl2

commit a5c1869
Merge: 2ebec77 557083a
Author: Li Bo <[email protected]>
Date:   Tue Jul 9 09:15:56 2024 +0800

    Merge pull request #137 from shuyansy/main

    add MLVU task

commit 557083a
Author: Yan Shu <[email protected]>
Date:   Mon Jul 8 16:56:50 2024 +0800

    Add files via upload

commit 2ebec77
Merge: 211bfed b23d349
Author: Li Bo <[email protected]>
Date:   Mon Jul 8 11:53:06 2024 +0800

    Merge pull request #136 from Dousia/main

    Add detailcaps

commit b23d349
Author: ByteDance <[email protected]>
Date:   Sun Jul 7 23:24:19 2024 +0800

    Add install capture_metric in env

commit c6e211d
Author: ByteDance <[email protected]>
Date:   Sun Jul 7 23:04:13 2024 +0800

    Add detailcaps

commit 211bfed
Merge: 7c208b7 79514ee
Author: Li Bo <[email protected]>
Date:   Tue Jul 2 23:05:12 2024 +0800

    Merge pull request #133 from EvolvingLMMs-Lab/dev/wild_vision

    Add wild vision bench

commit 79514ee
Author: kcz358 <[email protected]>
Date:   Mon Jul 1 15:10:02 2024 +0000

    Fixing handling None filtered score

commit 725fac2
Author: kcz358 <[email protected]>
Date:   Mon Jul 1 08:25:42 2024 +0000

    Fixing dataset name

commit 8d963e1
Author: kcz358 <[email protected]>
Date:   Mon Jul 1 08:24:51 2024 +0000

    Fixing scoring logic

commit e2990d0
Author: kcz358 <[email protected]>
Date:   Mon Jul 1 06:06:57 2024 +0000

    Hardcode to keep image for wild vision

commit ed38173
Author: kcz358 <[email protected]>
Date:   Mon Jul 1 06:06:38 2024 +0000

    Add wild vision 0617

commit 7c208b7
Author: Li Bo <[email protected]>
Date:   Mon Jul 1 11:53:31 2024 +0800

    Update README.md

commit 39d40de
Merge: e19b43a ba7081c
Author: Li Bo <[email protected]>
Date:   Mon Jul 1 11:47:09 2024 +0800

    Merge pull request #129 from Dannoopsy/mmbench_ru

    add task MMBench-ru

commit e19b43a
Merge: 11fd7e3 a0de897
Author: Li Bo <[email protected]>
Date:   Mon Jul 1 11:46:58 2024 +0800

    Merge pull request #128 from Dannoopsy/gqa-ru

    add task gqa-ru

commit 11fd7e3
Merge: 383e7fe a752259
Author: Li Bo <[email protected]>
Date:   Mon Jul 1 11:46:16 2024 +0800

    Merge pull request #130 from lscpku/vitatecs

    Add task VITATECS

commit a752259
Author: lscpku <[email protected]>
Date:   Fri Jun 28 20:37:06 2024 +0800

    create new task vitatecs

commit ba7081c
Author: Dannoopsy <[email protected]>
Date:   Fri Jun 28 12:21:05 2024 +0300

    change prompt to ru

commit 27ea9c0
Author: Dannoopsy <[email protected]>
Date:   Thu Jun 27 17:17:29 2024 +0000

    add mmbench_ru_dev

commit 383e7fe
Merge: 06fa000 ed2e7f7
Author: Li Bo <[email protected]>
Date:   Fri Jun 28 00:14:10 2024 +0800

    Merge pull request #126 from lorenzomammana/feature/external-package-integration

    External package integration using plugins

commit ed2e7f7
Merge: 03947e1 06fa000
Author: Lorenzo Mammana <[email protected]>
Date:   Thu Jun 27 15:38:10 2024 +0000

    Merge branch 'main' into feature/external-package-integration

commit a0de897
Author: Dannoopsy <[email protected]>
Date:   Tue Jun 25 11:11:37 2024 +0000

    new task gqa-ru

commit 06fa000
Author: kcz358 <[email protected]>
Date:   Tue Jun 25 06:41:13 2024 +0000

    Fix vid mme post prompt issue

commit b388d79
Author: Li Bo <[email protected]>
Date:   Sun Jun 23 22:31:16 2024 +0800

    Update activitynetqa_generation.yaml

commit 8f9d620
Author: Li Bo <[email protected]>
Date:   Sun Jun 23 14:02:25 2024 +0800

    Update pyproject.toml

commit 6341b7c
Merge: fce85f1 903b042
Author: Li Bo <[email protected]>
Date:   Sun Jun 23 14:02:02 2024 +0800

    Merge pull request #125 from EvolvingLMMs-Lab/dev/interleave

    [Model] aligned llava-interleave model results on video tasks

commit 903b042
Author: kcz358 <[email protected]>
Date:   Sat Jun 22 12:07:13 2024 +0000

    Remove unnecessary lines for video llava

commit d78ec86
Merge: ebe7217 fce85f1
Author: Li Bo <[email protected]>
Date:   Sat Jun 22 13:57:31 2024 +0800

    Merge branch 'main' into dev/interleave

commit ebe7217
Author: kcz358 <[email protected]>
Date:   Sat Jun 22 02:57:08 2024 +0000

    Delete unnecessary lines

commit 120c474
Author: kcz358 <[email protected]>
Date:   Fri Jun 21 08:38:41 2024 +0000

    Revise model registry for llava_hf and longva

commit 7d6201f
Author: kcz358 <[email protected]>
Date:   Fri Jun 21 08:38:24 2024 +0000

    Add longva

commit 12f4806
Author: kcz358 <[email protected]>
Date:   Fri Jun 21 08:35:39 2024 +0000

    Remove unnecessary lines since use batched visuals now in llava

commit 12cea76
Author: Bo Li <[email protected]>
Date:   Thu Jun 20 18:15:32 2024 +0000

    chore: Add loguru for logging in lmms_eval package

commit 03947e1
Author: Lorenzo Mammana <[email protected]>
Date:   Wed Jun 5 13:40:41 2024 +0000

    feat: Allow including external tasks from plugins

commit b80a91f
Author: Lorenzo Mammana <[email protected]>
Date:   Wed Jun 5 13:04:55 2024 +0000

    feat: Allow loading model configurations from other packages

commit 8ef2474
Author: Bo Li <[email protected]>
Date:   Thu Jun 20 12:11:03 2024 +0000

    chore: Remove unused models from lmms_eval package

commit af38885
Author: Bo Li <[email protected]>
Date:   Thu Jun 20 12:07:09 2024 +0000

    chore: Handle ImportError when importing models

    Handle the ImportError exception when importing models in the lmms_eval package. This change adds a try-except block to catch the ImportError and print an error message indicating the failed import. This will help with troubleshooting and identifying any issues with the model imports.

commit fce85f1
Merge: dbe6329 d94f83c
Author: Li Bo <[email protected]>
Date:   Thu Jun 20 20:02:12 2024 +0800

    Merge pull request #120 from EvolvingLMMs-Lab/pufanyi/hf_dataset_docs

    Add docs for datasets upload to HF

commit dbe6329
Author: choiszt <[email protected]>
Date:   Thu Jun 20 15:14:21 2024 +0800

    update ablation for videomme datasets

commit d94f83c
Author: Li Bo <[email protected]>
Date:   Thu Jun 20 13:30:59 2024 +0800

    Update README.md

commit cab8159
Author: Li Bo <[email protected]>
Date:   Thu Jun 20 13:30:29 2024 +0800

    Update README.md

commit 4587665
Author: kcz358 <[email protected]>
Date:   Thu Jun 20 03:55:30 2024 +0000

    Add llava_hf back to registry

commit 3463651
Author: kcz358 <[email protected]>
Date:   Thu Jun 20 03:54:33 2024 +0000

    Remove handling non-visual loop in llava

commit cb0d3f4
Author: Fanyi Pu <[email protected]>
Date:   Thu Jun 20 02:11:18 2024 +0800

    update readme

commit 813877b
Author: Fanyi Pu <[email protected]>
Date:   Wed Jun 19 15:37:52 2024 +0800

    to sh script

commit a14684b
Author: Fanyi Pu <[email protected]>
Date:   Wed Jun 19 15:37:04 2024 +0800

    lint

commit d0f8851
Author: Fanyi Pu <[email protected]>
Date:   Wed Jun 19 15:36:48 2024 +0800

    small fix

commit 63748e9
Author: Fanyi Pu <[email protected]>
Date:   Wed Jun 19 15:36:43 2024 +0800

    small fix

commit 7f1159a
Author: Fanyi Pu <[email protected]>
Date:   Wed Jun 19 15:35:05 2024 +0800

    update preparation

commit 19f9bd6
Author: Fanyi Pu <[email protected]>
Date:   Wed Jun 19 15:23:24 2024 +0800

    docs

commit ce6f889
Author: Fanyi Pu <[email protected]>
Date:   Wed Jun 19 15:04:16 2024 +0800

    tutorial

commit f513c52
Author: Bo Li <[email protected]>
Date:   Wed Jun 19 06:51:19 2024 +0000

    chore: Update dependencies to fix potential risks and improve compatibility

commit efb5295
Author: kcz358 <[email protected]>
Date:   Wed Jun 19 10:25:58 2024 +0800

    Release llava-wilder

commit 742651f
Author: Fanyi Pu <[email protected]>
Date:   Wed Jun 19 07:44:26 2024 +0800

    feat: Add support for auto downloading tar format videos

commit 511b625
Merge: 22a4958 050b2c3
Author: Bo Li <[email protected]>
Date:   Tue Jun 18 17:01:03 2024 +0000

    Merge branch 'main' of https://github.com/EvolvingLMMs-Lab/lmms-eval

commit 050b2c3
Merge: 74facb4 ef30651
Author: Li Bo <[email protected]>
Date:   Tue Jun 18 13:13:38 2024 +0800

    Merge pull request #114 from zjysteven/add-tinyllava

    add tinyllava

commit ef30651
Author: Jingyang Zhang <[email protected]>
Date:   Mon Jun 17 17:57:02 2024 -0400

    fix typo

commit 9bab677
Merge: dbfb238 74facb4
Author: Jingyang Zhang <[email protected]>
Date:   Sun Jun 16 10:56:05 2024 -0400

    Merge branch 'EvolvingLMMs-Lab:main' into add-tinyllava

commit 74facb4
Merge: 8ba192f d5df72d
Author: Li Bo <[email protected]>
Date:   Sun Jun 16 17:59:19 2024 +0800

    Merge pull request #118 from teowu/main

    Fix the potential risk by PR #117

commit d5df72d
Merge: 5bf59ed 8ba192f
Author: Teo (Timothy) Wu Haoning <[email protected]>
Date:   Sun Jun 16 15:32:13 2024 +0800

    Merge branch 'EvolvingLMMs-Lab:main' into main

commit 5bf59ed
Author: teowu <[email protected]>
Date:   Sun Jun 16 07:27:28 2024 +0000

    fix #117, allow auto download with tar format videos

commit 98b3955
Merge: a056f11 be9dada
Author: teowu <[email protected]>
Date:   Sun Jun 16 07:25:07 2024 +0000

    Merge branch 'main' of https://github.com/teowu/lmms-eval into main

commit a056f11
Author: teowu <[email protected]>
Date:   Sun Jun 16 07:23:54 2024 +0000

    fix #117, allow auto download with tar format videos

commit 8ba192f
Merge: 7cc2890 be9dada
Author: Li Bo <[email protected]>
Date:   Sat Jun 15 17:30:59 2024 +0800

    Merge pull request #117 from teowu/main

    LongVideoBench for LMMs-Eval

commit be9dada
Merge: 62ea8ce 7cc2890
Author: Teo (Timothy) Wu Haoning <[email protected]>
Date:   Sat Jun 15 16:39:20 2024 +0800

    Merge pull request #1 from EvolvingLMMs-Lab/main

    Merge pull request #113 from teowu/main

commit