-
Notifications
You must be signed in to change notification settings - Fork 456
Add detailcaps #136
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Add detailcaps #136
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Contributor
|
Thanks for your contribution! Could you please add a screenshot of a model result with this PR? |
Contributor
Luodian
added a commit
that referenced
this pull request
Jul 9, 2024
* feat: Update LMMS evaluation configuration and models - Update `activitynetqa_generation.yaml` to remove `dataset_name` field and update `task` field to "activitynetqa" - Update `utils.py` to add default values for `API_URL` and `API_KEY` when `API_TYPE` is not "openai" or "azure" - Update `batch_gpt4.py` and `gpt4v.py` to rename `max_frames_for_video` parameter to `max_frames_num` - Update `reka.py` to rename `max_frames_for_video` parameter to `max_frames_num` and add support for `continual_mode` with a persistent response cache This commit updates the LMMS evaluation configuration and models to improve compatibility and add new features. * Update LMMS evaluation configuration and models * Update LMMS evaluation configuration and models * feat: Update LMMS evaluation configuration and models - Update `activitynetqa_generation.yaml` to remove `dataset_name` field and update `task` field to "activitynetqa" - Update `utils.py` to add default values for `API_URL` and `API_KEY` when `API_TYPE` is not "openai" or "azure" - Update `batch_gpt4.py` and `gpt4v.py` to rename `max_frames_for_video` parameter to `max_frames_num` - Update `reka.py` to rename `max_frames_for_video` parameter to `max_frames_num` and add support for `continual_mode` with a persistent response cache This commit updates the LMMS evaluation configuration and models to improve compatibility and add new features. * Refactor error handling in GPT4V model evaluation * Refactor error handling in GPT4V model evaluation * Refactor video decoding backend to use "decord" instead of "pyav" * Refactor image aspect ratio handling in Llava_OneVision model * Refactor GPT4V model to fix bug in visuals encoding * add exception for azure gpt * feat: fix bugs * feat: update * Refactor image aspect ratio handling in Llava_OneVision model * Refactor image aspect ratio handling in Llava_OneVision model * update interleave bench
Luodian
added a commit
that referenced
this pull request
Sep 1, 2024
* chore: Update gpt_eval_model_name to "gpt-3.5-turbo" in mathvista.yaml * Squashed commit of the following: commit 994c9f97a2f8db3e9b7d7933d1e1680acde5b70b Author: Yan Shu <[email protected]> Date: Mon Jul 8 17:21:23 2024 +0800 Add files via upload * Squashed commit of the following: commit e31cd78 Author: Bo Li <[email protected]> Date: Wed Jul 10 12:08:08 2024 +1000 chore: Update lmms_eval/models/vila.py and lmms_eval/tasks/__init__.py commit 1d8c980 Author: kcz358 <[email protected]> Date: Tue Jul 9 02:08:52 2024 +0000 Rename xcomposer 4KHD commit 6da76f3 Author: Bo Li <[email protected]> Date: Tue Jul 9 11:55:56 2024 +1000 Upgrade lmms-eval to version 0.2.1 commit cd18585 Author: Bo Li <[email protected]> Date: Tue Jul 9 11:52:23 2024 +1000 Upgrade lmms-eval to support more models and evaluation tasks commit 672d7e5 Author: Bo Li <[email protected]> Date: Tue Jul 9 11:43:41 2024 +1000 feat: Add tie_weights parameter to Llava model initialization commit 2037a86 Merge: e6844db a5c1869 Author: Bo Li <[email protected]> Date: Tue Jul 9 11:37:12 2024 +1000 Fix gen kwargs image aspect ratio in internvl2 commit a5c1869 Merge: 2ebec77 557083a Author: Li Bo <[email protected]> Date: Tue Jul 9 09:15:56 2024 +0800 Merge pull request #137 from shuyansy/main add MLVU task commit 557083a Author: Yan Shu <[email protected]> Date: Mon Jul 8 16:56:50 2024 +0800 Add files via upload commit 2ebec77 Merge: 211bfed b23d349 Author: Li Bo <[email protected]> Date: Mon Jul 8 11:53:06 2024 +0800 Merge pull request #136 from Dousia/main Add detailcaps commit b23d349 Author: ByteDance <[email protected]> Date: Sun Jul 7 23:24:19 2024 +0800 Add install capture_metric in env commit c6e211d Author: ByteDance <[email protected]> Date: Sun Jul 7 23:04:13 2024 +0800 Add detailcaps commit 211bfed Merge: 7c208b7 79514ee Author: Li Bo <[email protected]> Date: Tue Jul 2 23:05:12 2024 +0800 Merge pull request #133 from EvolvingLMMs-Lab/dev/wild_vision Add wild vision bench commit 79514ee Author: kcz358 <[email protected]> Date: Mon Jul 1 15:10:02 2024 +0000 Fixing handling None filtered score commit 725fac2 Author: kcz358 <[email protected]> Date: Mon Jul 1 08:25:42 2024 +0000 Fixing dataset name commit 8d963e1 Author: kcz358 <[email protected]> Date: Mon Jul 1 08:24:51 2024 +0000 Fixing scoring logic commit e2990d0 Author: kcz358 <[email protected]> Date: Mon Jul 1 06:06:57 2024 +0000 Hardcode to keep image for wild vision commit ed38173 Author: kcz358 <[email protected]> Date: Mon Jul 1 06:06:38 2024 +0000 Add wild vision 0617 commit 7c208b7 Author: Li Bo <[email protected]> Date: Mon Jul 1 11:53:31 2024 +0800 Update README.md commit 39d40de Merge: e19b43a ba7081c Author: Li Bo <[email protected]> Date: Mon Jul 1 11:47:09 2024 +0800 Merge pull request #129 from Dannoopsy/mmbench_ru add task MMBench-ru commit e19b43a Merge: 11fd7e3 a0de897 Author: Li Bo <[email protected]> Date: Mon Jul 1 11:46:58 2024 +0800 Merge pull request #128 from Dannoopsy/gqa-ru add task gqa-ru commit 11fd7e3 Merge: 383e7fe a752259 Author: Li Bo <[email protected]> Date: Mon Jul 1 11:46:16 2024 +0800 Merge pull request #130 from lscpku/vitatecs Add task VITATECS commit a752259 Author: lscpku <[email protected]> Date: Fri Jun 28 20:37:06 2024 +0800 create new task vitatecs commit ba7081c Author: Dannoopsy <[email protected]> Date: Fri Jun 28 12:21:05 2024 +0300 change prompt to ru commit 27ea9c0 Author: Dannoopsy <[email protected]> Date: Thu Jun 27 17:17:29 2024 +0000 add mmbench_ru_dev commit 383e7fe Merge: 06fa000 ed2e7f7 Author: Li Bo <[email protected]> Date: Fri Jun 28 00:14:10 2024 +0800 Merge pull request #126 from lorenzomammana/feature/external-package-integration External package integration using plugins commit ed2e7f7 Merge: 03947e1 06fa000 Author: Lorenzo Mammana <[email protected]> Date: Thu Jun 27 15:38:10 2024 +0000 Merge branch 'main' into feature/external-package-integration commit a0de897 Author: Dannoopsy <[email protected]> Date: Tue Jun 25 11:11:37 2024 +0000 new task gqa-ru commit 06fa000 Author: kcz358 <[email protected]> Date: Tue Jun 25 06:41:13 2024 +0000 Fix vid mme post prompt issue commit b388d79 Author: Li Bo <[email protected]> Date: Sun Jun 23 22:31:16 2024 +0800 Update activitynetqa_generation.yaml commit 8f9d620 Author: Li Bo <[email protected]> Date: Sun Jun 23 14:02:25 2024 +0800 Update pyproject.toml commit 6341b7c Merge: fce85f1 903b042 Author: Li Bo <[email protected]> Date: Sun Jun 23 14:02:02 2024 +0800 Merge pull request #125 from EvolvingLMMs-Lab/dev/interleave [Model] aligned llava-interleave model results on video tasks commit 903b042 Author: kcz358 <[email protected]> Date: Sat Jun 22 12:07:13 2024 +0000 Remove unnecessary lines for video llava commit d78ec86 Merge: ebe7217 fce85f1 Author: Li Bo <[email protected]> Date: Sat Jun 22 13:57:31 2024 +0800 Merge branch 'main' into dev/interleave commit ebe7217 Author: kcz358 <[email protected]> Date: Sat Jun 22 02:57:08 2024 +0000 Delete unnecessary lines commit 120c474 Author: kcz358 <[email protected]> Date: Fri Jun 21 08:38:41 2024 +0000 Revise model registry for llava_hf and longva commit 7d6201f Author: kcz358 <[email protected]> Date: Fri Jun 21 08:38:24 2024 +0000 Add longva commit 12f4806 Author: kcz358 <[email protected]> Date: Fri Jun 21 08:35:39 2024 +0000 Remove unnecessary lines since use batched visuals now in llava commit 12cea76 Author: Bo Li <[email protected]> Date: Thu Jun 20 18:15:32 2024 +0000 chore: Add loguru for logging in lmms_eval package commit 03947e1 Author: Lorenzo Mammana <[email protected]> Date: Wed Jun 5 13:40:41 2024 +0000 feat: Allow including external tasks from plugins commit b80a91f Author: Lorenzo Mammana <[email protected]> Date: Wed Jun 5 13:04:55 2024 +0000 feat: Allow loading model configurations from other packages commit 8ef2474 Author: Bo Li <[email protected]> Date: Thu Jun 20 12:11:03 2024 +0000 chore: Remove unused models from lmms_eval package commit af38885 Author: Bo Li <[email protected]> Date: Thu Jun 20 12:07:09 2024 +0000 chore: Handle ImportError when importing models Handle the ImportError exception when importing models in the lmms_eval package. This change adds a try-except block to catch the ImportError and print an error message indicating the failed import. This will help with troubleshooting and identifying any issues with the model imports. commit fce85f1 Merge: dbe6329 d94f83c Author: Li Bo <[email protected]> Date: Thu Jun 20 20:02:12 2024 +0800 Merge pull request #120 from EvolvingLMMs-Lab/pufanyi/hf_dataset_docs Add docs for datasets upload to HF commit dbe6329 Author: choiszt <[email protected]> Date: Thu Jun 20 15:14:21 2024 +0800 update ablation for videomme datasets commit d94f83c Author: Li Bo <[email protected]> Date: Thu Jun 20 13:30:59 2024 +0800 Update README.md commit cab8159 Author: Li Bo <[email protected]> Date: Thu Jun 20 13:30:29 2024 +0800 Update README.md commit 4587665 Author: kcz358 <[email protected]> Date: Thu Jun 20 03:55:30 2024 +0000 Add llava_hf back to registry commit 3463651 Author: kcz358 <[email protected]> Date: Thu Jun 20 03:54:33 2024 +0000 Remove handling non-visual loop in llava commit cb0d3f4 Author: Fanyi Pu <[email protected]> Date: Thu Jun 20 02:11:18 2024 +0800 update readme commit 813877b Author: Fanyi Pu <[email protected]> Date: Wed Jun 19 15:37:52 2024 +0800 to sh script commit a14684b Author: Fanyi Pu <[email protected]> Date: Wed Jun 19 15:37:04 2024 +0800 lint commit d0f8851 Author: Fanyi Pu <[email protected]> Date: Wed Jun 19 15:36:48 2024 +0800 small fix commit 63748e9 Author: Fanyi Pu <[email protected]> Date: Wed Jun 19 15:36:43 2024 +0800 small fix commit 7f1159a Author: Fanyi Pu <[email protected]> Date: Wed Jun 19 15:35:05 2024 +0800 update preparation commit 19f9bd6 Author: Fanyi Pu <[email protected]> Date: Wed Jun 19 15:23:24 2024 +0800 docs commit ce6f889 Author: Fanyi Pu <[email protected]> Date: Wed Jun 19 15:04:16 2024 +0800 tutorial commit f513c52 Author: Bo Li <[email protected]> Date: Wed Jun 19 06:51:19 2024 +0000 chore: Update dependencies to fix potential risks and improve compatibility commit efb5295 Author: kcz358 <[email protected]> Date: Wed Jun 19 10:25:58 2024 +0800 Release llava-wilder commit 742651f Author: Fanyi Pu <[email protected]> Date: Wed Jun 19 07:44:26 2024 +0800 feat: Add support for auto downloading tar format videos commit 511b625 Merge: 22a4958 050b2c3 Author: Bo Li <[email protected]> Date: Tue Jun 18 17:01:03 2024 +0000 Merge branch 'main' of https://github.com/EvolvingLMMs-Lab/lmms-eval commit 050b2c3 Merge: 74facb4 ef30651 Author: Li Bo <[email protected]> Date: Tue Jun 18 13:13:38 2024 +0800 Merge pull request #114 from zjysteven/add-tinyllava add tinyllava commit ef30651 Author: Jingyang Zhang <[email protected]> Date: Mon Jun 17 17:57:02 2024 -0400 fix typo commit 9bab677 Merge: dbfb238 74facb4 Author: Jingyang Zhang <[email protected]> Date: Sun Jun 16 10:56:05 2024 -0400 Merge branch 'EvolvingLMMs-Lab:main' into add-tinyllava commit 74facb4 Merge: 8ba192f d5df72d Author: Li Bo <[email protected]> Date: Sun Jun 16 17:59:19 2024 +0800 Merge pull request #118 from teowu/main Fix the potential risk by PR #117 commit d5df72d Merge: 5bf59ed 8ba192f Author: Teo (Timothy) Wu Haoning <[email protected]> Date: Sun Jun 16 15:32:13 2024 +0800 Merge branch 'EvolvingLMMs-Lab:main' into main commit 5bf59ed Author: teowu <[email protected]> Date: Sun Jun 16 07:27:28 2024 +0000 fix #117, allow auto download with tar format videos commit 98b3955 Merge: a056f11 be9dada Author: teowu <[email protected]> Date: Sun Jun 16 07:25:07 2024 +0000 Merge branch 'main' of https://github.com/teowu/lmms-eval into main commit a056f11 Author: teowu <[email protected]> Date: Sun Jun 16 07:23:54 2024 +0000 fix #117, allow auto download with tar format videos commit 8ba192f Merge: 7cc2890 be9dada Author: Li Bo <[email protected]> Date: Sat Jun 15 17:30:59 2024 +0800 Merge pull request #117 from teowu/main LongVideoBench for LMMs-Eval commit be9dada Merge: 62ea8ce 7cc2890 Author: Teo (Timothy) Wu Haoning <[email protected]> Date: Sat Jun 15 16:39:20 2024 +0800 Merge pull request #1 from EvolvingLMMs-Lab/main Merge pull request #113 from teowu/main commit

Adding detail image caption evaluation with DetailCaps-4870 benchmark and CAPTURE metric proposed in Benchmarking and Improving Detail Image Caption.
Dataset: https://huggingface.co/datasets/foundation-multimodal-models/DetailCaps-4870
Code: https://github.com/foundation-multimodal-models/CAPTURE