Skip to content

Commit 59cce73

Browse files
committed
Squashed commit of the following:
commit 050b2c3 Merge: 74facb4 ef30651 Author: Li Bo <[email protected]> Date: Tue Jun 18 13:13:38 2024 +0800 Merge pull request #114 from zjysteven/add-tinyllava add tinyllava commit ef30651 Author: Jingyang Zhang <[email protected]> Date: Mon Jun 17 17:57:02 2024 -0400 fix typo commit 9bab677 Merge: dbfb238 74facb4 Author: Jingyang Zhang <[email protected]> Date: Sun Jun 16 10:56:05 2024 -0400 Merge branch 'EvolvingLMMs-Lab:main' into add-tinyllava commit 74facb4 Merge: 8ba192f d5df72d Author: Li Bo <[email protected]> Date: Sun Jun 16 17:59:19 2024 +0800 Merge pull request #118 from teowu/main Fix the potential risk by PR #117 commit d5df72d Merge: 5bf59ed 8ba192f Author: Teo (Timothy) Wu Haoning <[email protected]> Date: Sun Jun 16 15:32:13 2024 +0800 Merge branch 'EvolvingLMMs-Lab:main' into main commit 5bf59ed Author: teowu <[email protected]> Date: Sun Jun 16 07:27:28 2024 +0000 fix #117, allow auto download with tar format videos commit 98b3955 Merge: a056f11 be9dada Author: teowu <[email protected]> Date: Sun Jun 16 07:25:07 2024 +0000 Merge branch 'main' of https://github.com/teowu/lmms-eval into main commit a056f11 Author: teowu <[email protected]> Date: Sun Jun 16 07:23:54 2024 +0000 fix #117, allow auto download with tar format videos commit 8ba192f Merge: 7cc2890 be9dada Author: Li Bo <[email protected]> Date: Sat Jun 15 17:30:59 2024 +0800 Merge pull request #117 from teowu/main LongVideoBench for LMMs-Eval commit be9dada Merge: 62ea8ce 7cc2890 Author: Teo (Timothy) Wu Haoning <[email protected]> Date: Sat Jun 15 16:39:20 2024 +0800 Merge pull request #1 from EvolvingLMMs-Lab/main Merge pull request #113 from teowu/main commit 62ea8ce Author: teowu <[email protected]> Date: Sat Jun 15 08:30:11 2024 +0000 LongVideoBench support: image LMMs (idefics2, phi3) and video LMMs (LLaVA-Next-Video-34B) commit 7cc2890 Merge: 4bc7224 ea14cd4 Author: Li Bo <[email protected]> Date: Sat Jun 15 14:10:22 2024 +0800 Merge pull request #113 from teowu/main Q-Bench, Q-Bench2, A-Bench commit dbfb238 Author: Jingyang <[email protected]> Date: Fri Jun 14 16:20:42 2024 -0400 add tinyllava commit ea14cd4 Author: teowu <[email protected]> Date: Fri Jun 14 15:01:52 2024 +0000 Add qbench, qbench2, abench; fix phi3v as its current implementation does not support multi-image commit 4bc7224 Merge: 2797987 bf14cb8 Author: Li Bo <[email protected]> Date: Fri Jun 14 02:14:43 2024 +0800 Merge pull request #111 from XinrunDu/main add II-Bench commit bf14cb8 Author: XinrunDu <[email protected]> Date: Thu Jun 13 09:37:02 2024 +0000 fix dataset_path commit 6248113 Author: XinrunDu <[email protected]> Date: Thu Jun 13 09:32:06 2024 +0000 add II-Bench commit 2797987 Merge: 63d82f1 66d4bb2 Author: Li Bo <[email protected]> Date: Thu Jun 13 11:14:47 2024 +0800 Merge pull request #109 from EvolvingLMMs-Lab/pufanyi/update_version [Small Update] Update the version of LMMs-Eval commit 66d4bb2 Author: Fanyi Pu <[email protected]> Date: Thu Jun 13 11:13:00 2024 +0800 update version commit 63d82f1 Author: Li Bo <[email protected]> Date: Thu Jun 13 11:04:32 2024 +0800 Update README.md commit 44a3379 Merge: 5ed0035 0ce46d0 Author: Li Bo <[email protected]> Date: Thu Jun 13 04:00:12 2024 +0800 Merge pull request #105 from tianyu-z/main Include VCR commit 0ce46d0 Author: Suyuchen <[email protected]> Date: Wed Jun 12 15:56:34 2024 -0400 update README.md commit 46a88d8 Merge: 47b13b9 5ed0035 Author: Suyuchen <[email protected]> Date: Wed Jun 12 15:50:26 2024 -0400 merged readme.md commit 47b13b9 Author: Suyuchen <[email protected]> Date: Wed Jun 12 15:30:52 2024 -0400 update aggregation function for vcr_wiki commit 5ed0035 Author: Li Bo <[email protected]> Date: Thu Jun 13 03:21:42 2024 +0800 Update README.md commit ed88068 Author: Li Bo <[email protected]> Date: Thu Jun 13 03:13:59 2024 +0800 Update README.md commit fea3806 Merge: d99a24a 05dc8e8 Author: Li Bo <[email protected]> Date: Thu Jun 13 03:11:49 2024 +0800 Merge pull request #108 from EvolvingLMMs-Lab/internal_main_dev [Upgrade to v0.2] Embracing Video Evaluations with LMMs-Eval commit 05dc8e8 Author: Bo Li <[email protected]> Date: Wed Jun 12 15:56:04 2024 +0000 chore: Update lmms-eval to support video evaluations for LLaVA models commit cbeee20 Author: Bo Li <[email protected]> Date: Wed Jun 12 15:50:30 2024 +0000 chore: Update lmms-eval to support video evaluations for LLaVA models commit f00d549 Author: Bo Li <[email protected]> Date: Wed Jun 12 15:46:33 2024 +0000 Update image alignment in README.md commit 3415633 Author: Bo Li <[email protected]> Date: Wed Jun 12 15:43:16 2024 +0000 Update llava conv_template in lmms_eval/models/llava.py commit 50575a9 Author: Bo Li <[email protected]> Date: Wed Jun 12 15:39:03 2024 +0000 chore: Update lmms-eval to support video evaluations for LLaVA models commit c9b2252 Author: Bo Li <[email protected]> Date: Wed Jun 12 15:33:48 2024 +0000 Bump version to 0.2.0.dev0 commit 465bd42 Merge: e43bd84 d99a24a Author: Bo Li <[email protected]> Date: Wed Jun 12 15:04:25 2024 +0000 Merge branch 'main' of https://github.com/EvolvingLMMs-Lab/lmms-eval into internal_main_dev commit e43bd84 Author: Bo Li <[email protected]> Date: Wed Jun 12 14:54:06 2024 +0000 chore: Remove unnecessary files and code related to live_bench and sft_eval tasks commit d99a24a Merge: 374590b a66003b Author: Li Bo <[email protected]> Date: Wed Jun 12 19:45:57 2024 +0800 Merge pull request #107 from AtsuMiyai/new_task/upd_update update gpt-3.5-turbo version commit a66003b Author: AtsuMiyai <[email protected]> Date: Wed Jun 12 17:05:17 2024 +0900 update gpt-3.5-turbo version commit ee91f27 Author: AtsuMiyai <[email protected]> Date: Wed Jun 12 16:50:53 2024 +0900 update gpt-3.5-turbo version commit 326b969 Author: tianyu-z <[email protected]> Date: Mon Jun 10 20:07:40 2024 -0400 include std and confidence interval commit cd050d4 Author: Suyuchen <[email protected]> Date: Mon Jun 10 18:49:47 2024 -0400 update vcr_wiki tasks in README.md commit 205721e Author: Suyuchen <[email protected]> Date: Mon Jun 10 18:43:15 2024 -0400 update vcr_wiki tasks commit db8e718 Author: tianyu-z <[email protected]> Date: Mon Jun 10 16:13:58 2024 -0400 include the try-except logic for spacy commit 427dabb Author: Suyuchen <[email protected]> Date: Mon Jun 10 15:51:05 2024 -0400 add crossed_text to vcr_wiki output commit 043b483 Author: tianyu-z <[email protected]> Date: Mon Jun 10 15:47:00 2024 -0400 switch logic commit e1f04db Author: tianyu-z <[email protected]> Date: Mon Jun 10 02:38:21 2024 -0400 modify the form of VCR commit 96e8d98 Author: tianyu-z <[email protected]> Date: Mon Jun 10 00:10:30 2024 -0400 init include vcr commit 374590b Merge: 504685e cb3b9ce Author: Kaichen Zhang - NTU <[email protected]> Date: Fri Jun 7 20:25:48 2024 +0800 Merge pull request #101 from Gumpest/main Update conbench in README commit 504685e Author: Li Bo <[email protected]> Date: Thu Jun 6 15:42:15 2024 +0800 Update README.md commit cb3b9ce Merge: c9793b3 67b64ea Author: Yuan Zhang <[email protected]> Date: Thu Jun 6 11:22:24 2024 +0800 Merge branch 'EvolvingLMMs-Lab:main' into main commit c9793b3 Author: Yuan Zhang <[email protected]> Date: Thu Jun 6 11:21:05 2024 +0800 update README commit 67b64ea Merge: 8ee7848 5fd6845 Author: Li Bo <[email protected]> Date: Wed Jun 5 23:12:58 2024 +0800 Merge pull request #100 from Gumpest/main add Conbench commit 5fd6845 Author: Yuan Zhang <[email protected]> Date: Wed Jun 5 21:52:31 2024 +0800 add conbench commit 8ee7848 Merge: 747e197 6fefaf7 Author: Li Bo <[email protected]> Date: Tue Jun 4 17:09:33 2024 +0800 Merge pull request #95 from AtsuMiyai/new_task/upd add MM-UPD commit 747e197 Merge: 4854a34 0584307 Author: Li Bo <[email protected]> Date: Tue Jun 4 17:09:04 2024 +0800 Merge pull request #97 from CaraJ7/update Add MathVerse in README.md commit 6fefaf7 Author: AtsuMiyai <[email protected]> Date: Tue Jun 4 17:36:39 2024 +0900 update utils.py for leaderboard submission commit 5f4fe36 Author: AtsuMiyai <[email protected]> Date: Sun Jun 2 23:28:27 2024 +0900 slightly change query_prompt for the reproduction commit 0584307 Author: CaraJ7 <[email protected]> Date: Sun Jun 2 17:05:28 2024 +0800 Add MathVerse in README.md commit 0581ab3 Author: AtsuMiyai <[email protected]> Date: Fri May 31 16:09:45 2024 +0900 merge model_specific_prompt_kwargs and dataset_name into each task yaml commit 4854a34 Author: Pu Fanyi <[email protected]> Date: Sat May 4 19:23:39 2024 +0800 Group MMMU images into one image (#83) * update * update font * Add matplotlib.font_manager import in utils.py * Refactor font handling in add_order_label function in utils.py * group mmmu --------- Co-authored-by: Li Bo <[email protected]> commit d224794 Author: AtsuMiyai <[email protected]> Date: Wed May 29 15:15:59 2024 +0900 add upd commit 453e793 Author: AtsuMiyai <[email protected]> Date: Wed May 29 15:03:30 2024 +0900 add upd commit 909edd6 Author: AtsuMiyai <[email protected]> Date: Wed May 29 12:52:21 2024 +0900 add upd commit 7c1ac97 Author: AtsuMiyai <[email protected]> Date: Wed May 29 12:50:32 2024 +0900 add upd commit 811301c Author: AtsuMiyai <[email protected]> Date: Wed May 29 12:46:58 2024 +0900 add upd commit 71401ba Author: AtsuMiyai <[email protected]> Date: Wed May 29 12:41:21 2024 +0900 add upd commit 24dc435 Author: Bo Li <[email protected]> Date: Mon May 27 10:17:32 2024 +0000 fix compatibility issue of older version llava commit 616edf4 Author: Bo Li <[email protected]> Date: Mon May 27 09:32:26 2024 +0000 [Fix] import issues of multilingual llava and olympiadbench commit 4c5a99e Merge: 45c05b2 b05c3e2 Author: Li Bo <[email protected]> Date: Mon May 27 14:19:53 2024 +0800 Merge pull request #87 from vfragoso/vifragos/phi3v Adding microsoft/Phi-3-vision-128k-instruct model. commit b05c3e2 Author: Victor Fragoso <[email protected]> Date: Fri May 24 16:36:37 2024 +0000 Adding documentation of Phi3v class. commit c200897 Author: Victor Fragoso <[email protected]> Date: Fri May 24 16:25:02 2024 +0000 Adding prompt arguments for Phi3v on MathVista-TestMini commit 7f9fb6b Author: Victor Fragoso <[email protected]> Date: Fri May 24 13:24:16 2024 +0000 Adding Phi3v model. commit 45c05b2 Author: kcz358 <[email protected]> Date: Thu May 23 03:47:36 2024 +0000 Set printing info for llava_hf to debug level commit 53f013e Author: kcz358 <[email protected]> Date: Thu May 23 03:41:39 2024 +0000 Fix pope random name in pope full commit 22520a9 Author: kcz358 <[email protected]> Date: Thu May 23 03:41:14 2024 +0000 Add separated pope tasks by category commit d1eefb1 Author: kcz358 <[email protected]> Date: Thu May 9 08:36:02 2024 +0000 Update gitignore commit b2b4dbd Author: kcz358 <[email protected]> Date: Mon May 20 07:45:11 2024 +0000 Comment out Spice in caption task so that don't need to download stanford nlp model commit 662f05c Author: kcz358 <[email protected]> Date: Mon May 20 03:13:13 2024 +0000 Comment out parse result in xcomposer commit 0932932 Author: kcz358 <[email protected]> Date: Thu May 16 03:55:39 2024 +0000 Fix instructblip qformer size mismatch and multi-images problem commit 557a6a3 Author: kcz358 <[email protected]> Date: Thu May 16 03:11:41 2024 +0000 Remove redundant code in fuyu commit 6aeb550 Author: kcz358 <[email protected]> Date: Thu May 16 01:45:24 2024 +0000 Fix idefics2 llava in the wild bugs commit aea80e6 Author: kcz358 <[email protected]> Date: Wed May 15 11:07:35 2024 +0000 Better task list_with_num commit 3c12a08 Author: Li Bo <[email protected]> Date: Sat May 18 02:35:52 2024 +0800 Update LICENSE commit 82317a6 Author: Li Bo <[email protected]> Date: Sat May 18 02:29:09 2024 +0800 Update LICENSE commit a8bba1c Author: Li Bo <[email protected]> Date: Sat May 18 02:28:03 2024 +0800 Create LICENSE commit caa5893 Merge: c094448 423b006 Author: Li Bo <[email protected]> Date: Mon May 13 11:45:26 2024 +0800 Merge pull request #73 from EvolvingLMMs-Lab/kc/qwen_vl_api [Feat] Add qwen vl api commit c094448 Author: kcz358 <[email protected]> Date: Sat May 11 06:11:19 2024 +0000 Fix llava_hf image tokens number issue commit 64f07e4 Author: kcz358 <[email protected]> Date: Thu May 9 02:04:10 2024 +0000 Fix endless warning for llava_hf generation commit 8aaa828 Author: Bo Li <[email protected]> Date: Thu May 2 06:13:56 2024 +0000 Add model_name parameter to Llava constructor commit 7847dc4 Author: kcz358 <[email protected]> Date: Tue May 7 03:15:59 2024 +0000 Parse result for llava_hf 1.6 commit 3e56b4f Author: kcz358 <[email protected]> Date: Tue May 7 03:09:56 2024 +0000 Fix llava_hf generation for 1.6 commit fa3ff92 Author: kcz358 <[email protected]> Date: Mon May 6 08:32:57 2024 +0000 Fix llava conv template for llama3 commit 423b006 Author: kcz358 <[email protected]> Date: Sun May 5 07:54:52 2024 +0000 Add qwen vl api commit b7fd7a9 Merge: 986139a c5a130b Author: Li Bo <[email protected]> Date: Sun May 5 13:19:48 2024 +0800 Merge pull request #59 from EvolvingLMMs-Lab/add_idefics2 add idefics2 commit 986139a Merge: b46239c 8d3526c Author: Li Bo <[email protected]> Date: Fri May 3 01:18:18 2024 +0800 Merge pull request #36 from cocoshe/main [Fix] repr llava doc commit b46239c Merge: bc69a74 373265f Author: Li Bo <[email protected]> Date: Fri May 3 01:17:34 2024 +0800 Merge pull request #56 from gagan3012/main Multilingual LLava bench commit bc69a74 Merge: eef3aeb 626e8a9 Author: Li Bo <[email protected]> Date: Fri May 3 01:12:14 2024 +0800 Merge pull request #70 from hunterheiden/hsh/new_task/WebSRC Bugfix: WebSRC should be token-level F1 NOT character-level commit 626e8a9 Author: Hunter Heidenreich <[email protected]> Date: Thu May 2 09:31:03 2024 -0400 Bugfix: WebSRC should be token-level F1 NOT character-level commit eef3aeb Merge: c4e9dd9 9bca441 Author: Li Bo <[email protected]> Date: Thu May 2 14:38:17 2024 +0800 Merge pull request #69 from hunterheiden/hsh/new_task/WebSRC [New Task] WebSRC (multimodal Q&A on web screenshots) commit 9bca441 Author: Hunter Heidenreich <[email protected]> Date: Wed May 1 11:07:29 2024 -0400 Add code to enable compilation of submission for WebSRC test split commit 7687495 Author: Hunter Heidenreich <[email protected]> Date: Wed May 1 10:47:32 2024 -0400 Draft and validate websrc eval on dev split commit 4eebd3e Author: Hunter Heidenreich <[email protected]> Date: Wed May 1 10:46:54 2024 -0400 Update main README with new task names commit 35fe80b Author: Hunter Heidenreich <[email protected]> Date: Wed May 1 10:46:20 2024 -0400 Draft README for WebSRC commit 955bd06 Author: Hunter Heidenreich <[email protected]> Date: Tue Apr 30 10:16:21 2024 -0400 Init webSRC commit c4e9dd9 Merge: d8a3a99 319afcc Author: Li Bo <[email protected]> Date: Fri Apr 26 14:37:22 2024 +0800 Merge pull request #63 from hunterheiden/hsh/new_task/screenspot New Task: ScreenSpot - Grounding (REC) and instruction generation (REG) on screens commit 319afcc Author: Hunter Heidenreich <[email protected]> Date: Thu Apr 25 11:44:34 2024 -0400 slight update commit 2f3811c Author: Hunter Heidenreich <[email protected]> Date: Thu Apr 25 11:41:04 2024 -0400 Add README file specific to ScreenSpot commit 28962cb Author: Hunter Heidenreich <[email protected]> Date: Wed Apr 24 11:52:33 2024 -0400 Update README to reflect new tasks commit e457cfb Author: Hunter Heidenreich <[email protected]> Date: Tue Apr 23 18:33:16 2024 -0400 Create ScreenSpot on clean branch commit d8a3a99 Merge: 3dcd015 ed17129 Author: Li Bo <[email protected]> Date: Tue Apr 23 10:34:03 2024 +0800 Merge pull request #61 from tupini07/patch-1 Fix typo in Qwen-VL that was causing "reference before assignment" commit ed17129 Author: Andrea Tupini <[email protected]> Date: Mon Apr 22 14:56:41 2024 -0600 refactor query construction for clarity commit cd87420 Author: Andrea Tupini <[email protected]> Date: Mon Apr 22 14:54:29 2024 -0600 convert contexts to list if necessary and remove unnecessary construction of `questions` commit 8557367 Author: Andrea Tupini <[email protected]> Date: Mon Apr 22 14:47:33 2024 -0600 Fix typo in qwen_vl that was causing "reference before assignment" commit 3dcd015 Merge: 95df9fe 743673a Author: Li Bo <[email protected]> Date: Sat Apr 20 22:03:16 2024 +0800 Merge pull request #60 from CaraJ7/main Add MathVerse commit 743673a Merge: c1a5472 95df9fe Author: CaraJ7 <[email protected]> Date: Sat Apr 20 21:49:02 2024 +0800 Merge branch 'main' of https://github.com/EvolvingLMMs-Lab/lmms-eval commit c1a5472 Author: CaraJ7 <[email protected]> Date: Sat Apr 20 21:45:34 2024 +0800 Add MathVerse commit 373265f Author: Gagan Bhatia <[email protected]> Date: Fri Apr 12 17:21:39 2024 -0700 Add files via upload commit d853051 Author: Gagan Bhatia <[email protected]> Date: Fri Apr 12 17:19:49 2024 -0700 Create README.md commit 8d3526c Author: cocoshe <[email protected]> Date: Thu Mar 28 13:38:36 2024 +0800 fix doc
1 parent 791e087 commit 59cce73

File tree

187 files changed

+5339
-2986
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

187 files changed

+5339
-2986
lines changed

LICENSE

Lines changed: 56 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,56 @@
1+
# For the main pipeline structure-related code, we maintain the original license provided with lm-evaluation-harness, which is the MIT License.
2+
3+
MIT License
4+
5+
Copyright (c) 2024 LMMs-Lab
6+
7+
Permission is hereby granted, free of charge, to any person obtaining a copy
8+
of this software and associated documentation files (the "Software"), to deal
9+
in the Software without restriction, including without limitation the rights
10+
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
11+
copies of the Software, and to permit persons to whom the Software is
12+
furnished to do so, subject to the following conditions:
13+
14+
The above copyright notice and this permission notice shall be included in all
15+
copies or substantial portions of the Software.
16+
17+
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
18+
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
19+
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
20+
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
21+
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
22+
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
23+
SOFTWARE.
24+
25+
# For the multimodal models and datasets that we have added (defined as code in the lmms_eval/tasks and lmms_eval/models folders), we apply the Apache License.
26+
27+
Apache 2.0 License
28+
29+
Copyright (c) 2024 LMMs-Lab
30+
31+
Licensed under the Apache License, Version 2.0 (the "License");
32+
you may not use this file except in compliance with the License.
33+
You may obtain a copy of the License at
34+
35+
http://www.apache.org/licenses/LICENSE-2.0
36+
37+
Unless required by applicable law or agreed to in writing, software
38+
distributed under the License is distributed on an "AS IS" BASIS,
39+
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
40+
See the License for the specific language governing permissions and
41+
limitations under the License.
42+
43+
When modifying the code, please include the following information about the original lmms-eval source:
44+
# Adopted from lmms-eval from https://github.com/EvolvingLMMs-Lab/lmms-eval. Below is the original copyright:
45+
#
46+
# Licensed under the Apache License, Version 2.0 (the "License");
47+
# you may not use this file except in compliance with the License.
48+
# You may obtain a copy of the License at
49+
#
50+
# http://www.apache.org/licenses/LICENSE-2.0
51+
#
52+
# Unless required by applicable law or agreed to in writing, software
53+
# distributed under the License is distributed on an "AS IS" BASIS,
54+
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
55+
# See the License for the specific language governing permissions and
56+
# limitations under the License.

README.md

Lines changed: 171 additions & 195 deletions
Large diffs are not rendered by default.

docs/current_tasks.md

Lines changed: 122 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,122 @@
1+
# Current Tasks
2+
3+
> () indicates the task name in the lmms_eval. The task name is also used to specify the dataset in the configuration file.
4+
> The following is manually updated documentation. You could use `lmms_eval task --list` to list all supported tasks and their task names.
5+
6+
- AI2D (ai2d)
7+
- ChartQA (chartqa)
8+
- CMMMU (cmmmu)
9+
- CMMMU Validation (cmmmu_val)
10+
- CMMMU Test (cmmmu_test)
11+
- COCO Caption (coco_cap)
12+
- COCO 2014 Caption (coco2014_cap)
13+
- COCO 2014 Caption Validation (coco2014_cap_val)
14+
- COCO 2014 Caption Test (coco2014_cap_test)
15+
- COCO 2017 Caption (coco2017_cap)
16+
- COCO 2017 Caption MiniVal (coco2017_cap_val)
17+
- COCO 2017 Caption MiniTest (coco2017_cap_test)
18+
- [ConBench](https://github.com/foundation-multimodal-models/ConBench) (conbench)
19+
- DOCVQA (docvqa)
20+
- DOCVQA Validation (docvqa_val)
21+
- DOCVQA Test (docvqa_test)
22+
- Ferret (ferret)
23+
- Flickr30K (flickr30k)
24+
- Ferret Test (ferret_test)
25+
- GQA (gqa)
26+
- HallusionBenchmark (hallusion_bench_image)
27+
- Infographic VQA (info_vqa)
28+
- Infographic VQA Validation (info_vqa_val)
29+
- Infographic VQA Test (info_vqa_test)
30+
- LLaVA-Bench (llava_in_the_wild)
31+
- LLaVA-Bench-COCO (llava_bench_coco)
32+
- MathVerse (mathverse)
33+
- MathVerse Text Dominant (mathverse_testmini_text_dominant)
34+
- MathVerse Text Only (mathverse_testmini_text_only)
35+
- MathVerse Text Lite (mathverse_testmini_text_lite)
36+
- MathVerse Vision Dominant (mathverse_testmini_vision_dominant)
37+
- MathVerse Vision Intensive (mathverse_testmini_vision_intensive)
38+
- MathVerse Vision Only (mathverse_testmini_vision_only)
39+
- MathVista (mathvista)
40+
- MathVista Validation (mathvista_testmini)
41+
- MathVista Test (mathvista_test)
42+
- MMBench (mmbench)
43+
- MMBench English (mmbench_en)
44+
- MMBench English Dev (mmbench_en_dev)
45+
- MMBench English Test (mmbench_en_test)
46+
- MMBench Chinese (mmbench_cn)
47+
- MMBench Chinese Dev (mmbench_cn_dev)
48+
- MMBench Chinese Test (mmbench_cn_test)
49+
- MME (mme)
50+
- MMMU (mmmu)
51+
- MMMU Validation (mmmu_val)
52+
- MMMU Test (mmmu_test)
53+
- MMUPD (mmupd)
54+
- MMUPD Base (mmupd_base)
55+
- MMAAD Base (mmaad_base)
56+
- MMIASD Base (mmiasd_base)
57+
- MMIVQD Base (mmivqd_base)
58+
- MMUPD Option (mmupd_option)
59+
- MMAAD Option (mmaad_option)
60+
- MMIASD Option (mmiasd_option)
61+
- MMIVQD Option (mmivqd_option)
62+
- MMUPD Instruction (mmupd_instruction)
63+
- MMAAD Instruction (mmaad_instruction)
64+
- MMIASD Instruction (mmiasd_instruction)
65+
- MMIVQD Instruction (mmivqd_instruction)
66+
- MMVet (mmvet)
67+
- Multi-DocVQA (multidocvqa)
68+
- Multi-DocVQA Validation (multidocvqa_val)
69+
- Multi-DocVQA Test (multidocvqa_test)
70+
- NoCaps (nocaps)
71+
- NoCaps Validation (nocaps_val)
72+
- NoCaps Test (nocaps_test)
73+
- OKVQA (ok_vqa)
74+
- OKVQA Validation 2014 (ok_vqa_val2014)
75+
- POPE (pope)
76+
- RefCOCO (refcoco)
77+
- refcoco_seg_test
78+
- refcoco_seg_val
79+
- refcoco_seg_testA
80+
- refcoco_seg_testB
81+
- refcoco_bbox_test
82+
- refcoco_bbox_val
83+
- refcoco_bbox_testA
84+
- refcoco_bbox_testB
85+
- RefCOCO+ (refcoco+)
86+
- refcoco+_seg
87+
- refcoco+_seg_val
88+
- refcoco+_seg_testA
89+
- refcoco+_seg_testB
90+
- refcoco+_bbox
91+
- refcoco+_bbox_val
92+
- refcoco+_bbox_testA
93+
- refcoco+_bbox_testB
94+
- RefCOCOg (refcocog)
95+
- refcocog_seg_test
96+
- refcocog_seg_val
97+
- refcocog_bbox_test
98+
- refcocog_bbox_val
99+
- ScienceQA (scienceqa_full)
100+
- ScienceQA Full (scienceqa)
101+
- ScienceQA IMG (scienceqa_img)
102+
- ScreenSpot (screenspot)
103+
- ScreenSpot REC / Grounding (screenspot_rec)
104+
- ScreenSpot REG / Instruction Generation (screenspot_reg)
105+
- SeedBench (seedbench)
106+
- SeedBench 2 (seedbench_2)
107+
- ST-VQA (stvqa)
108+
- TextCaps (textcaps)
109+
- TextCaps Validation (textcaps_val)
110+
- TextCaps Test (textcaps_test)
111+
- TextVQA (textvqa)
112+
- TextVQA Validation (textvqa_val)
113+
- TextVQA Test (textvqa_test)
114+
- VizWizVQA (vizwiz_vqa)
115+
- VizWizVQA Validation (vizwiz_vqa_val)
116+
- VizWizVQA Test (vizwiz_vqa_test)
117+
- VQAv2 (vqav2)
118+
- VQAv2 Validation (vqav2_val)
119+
- VQAv2 Test (vqav2_test)
120+
- WebSRC (websrc)
121+
- WebSRC Validation (websrc_val)
122+
- WebSRC Test (websrc_test)

lmms_eval/api/task.py

Lines changed: 42 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -778,6 +778,7 @@ def _download_from_youtube(path):
778778
force_unzip = dataset_kwargs.get("force_unzip", False)
779779
cache_path = snapshot_download(repo_id=self.DATASET_PATH, repo_type="dataset", force_download=force_download, etag_timeout=60)
780780
zip_files = glob(os.path.join(cache_path, "**/*.zip"), recursive=True)
781+
tar_files = glob(os.path.join(cache_path, "**/*.tar*"), recursive=True)
781782

782783
def unzip_video_data(zip_file):
783784
import zipfile
@@ -786,10 +787,51 @@ def unzip_video_data(zip_file):
786787
zip_ref.extractall(cache_dir)
787788
eval_logger.info(f"Extracted all files from {zip_file} to {cache_dir}")
788789

790+
def untar_video_data(tar_file):
791+
import tarfile
792+
793+
with tarfile.open(tar_file, "r") as tar_ref:
794+
tar_ref.extractall(cache_dir)
795+
eval_logger.info(f"Extracted all files from {tar_file} to {cache_dir}")
796+
797+
def concat_tar_parts(tar_parts, output_tar):
798+
with open(output_tar, "wb") as out_tar:
799+
from tqdm import tqdm
800+
801+
for part in tqdm(sorted(tar_parts)):
802+
with open(part, "rb") as part_file:
803+
out_tar.write(part_file.read())
804+
eval_logger.info(f"Concatenated parts {tar_parts} into {output_tar}")
805+
806+
# Unzip zip files if needed
789807
if force_unzip or (not os.path.exists(cache_dir) and len(zip_files) > 0):
790808
for zip_file in zip_files:
791809
unzip_video_data(zip_file)
792810

811+
# Concatenate and extract tar files if needed
812+
if force_unzip or (not os.path.exists(cache_dir) and len(tar_files) > 0):
813+
tar_parts_dict = {}
814+
815+
# Group tar parts together
816+
for tar_file in tar_files:
817+
base_name = tar_file.split(".tar")[0]
818+
if base_name not in tar_parts_dict:
819+
tar_parts_dict[base_name] = []
820+
tar_parts_dict[base_name].append(tar_file)
821+
822+
# Concatenate and untar split parts
823+
for base_name, parts in tar_parts_dict.items():
824+
eval_logger.info(f"Extracting following tar files: {parts}")
825+
output_tar = base_name + ".tar"
826+
if not os.path.exists(output_tar):
827+
eval_logger.info(f"Start concatenating tar files")
828+
829+
concat_tar_parts(parts, output_tar)
830+
eval_logger.info(f"Finish concatenating tar files")
831+
832+
if not os.path.exists(os.path.join(cache_dir, os.path.basename(base_name))):
833+
untar_video_data(output_tar)
834+
793835
accelerator.wait_for_everyone()
794836
dataset_kwargs.pop("cache_dir")
795837
dataset_kwargs.pop("video")

lmms_eval/models/__init__.py

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,9 @@
2626
"reka": "Reka",
2727
"llava_onevision": "Llava_OneVision",
2828
"from_log": "FromLog",
29-
"mplugOwlVideo": "mplug_Owl",
29+
"mplug_owl_video": "mplug_Owl",
30+
"phi3v": "Phi3v",
31+
"tinyllava": "TinyLlava",
3032
}
3133

3234
for model_name, model_class in AVAILABLE_MODELS.items():

0 commit comments

Comments
 (0)