Skip to content

Commit 4854a34

Browse files
pufanyiLuodian
authored andcommitted
Group MMMU images into one image (#83)
* update * update font * Add matplotlib.font_manager import in utils.py * Refactor font handling in add_order_label function in utils.py * group mmmu --------- Co-authored-by: Li Bo <[email protected]>
1 parent 24dc435 commit 4854a34

File tree

4 files changed

+633
-0
lines changed

4 files changed

+633
-0
lines changed
Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,4 @@
1+
group: mmmu_group_img
2+
task:
3+
- mmmu_val_group_img
4+
- mmmu_test_group_img
Lines changed: 19 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,19 @@
1+
dataset_path: lmms-lab/MMMU
2+
task: "mmmu_test_group_img"
3+
test_split: test
4+
output_type: generate_until
5+
doc_to_visual: !function utils_group_img.mmmu_doc_to_visual
6+
doc_to_text: !function utils_group_img.mmmu_doc_to_text
7+
doc_to_target: "answer"
8+
# The return value of process_results will be used by metrics
9+
process_results: !function utils_group_img.mmmu_process_results
10+
# Note that the metric name can be either a registed metric function (such as the case for GQA) or a key name returned by process_results
11+
generation_kwargs:
12+
max_new_tokens: 16
13+
image_aspect_ratio: original
14+
metric_list:
15+
- metric: submission
16+
aggregation: !function utils_group_img.mmmu_test_aggregate_results_for_submission
17+
higher_is_better: true
18+
metadata:
19+
- version: 0.0
Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,21 @@
1+
dataset_path: lmms-lab/MMMU
2+
task: "mmmu_val_group_img"
3+
test_split: validation
4+
output_type: generate_until
5+
doc_to_visual: !function utils_group_img.mmmu_doc_to_visual
6+
doc_to_text: !function utils_group_img.mmmu_doc_to_text
7+
doc_to_target: "answer"
8+
# The return value of process_results will be used by metrics
9+
process_results: !function utils_group_img.mmmu_process_results
10+
# Note that the metric name can be either a registed metric function (such as the case for GQA) or a key name returned by process_results
11+
generation_kwargs:
12+
max_new_tokens: 16
13+
model_specific_generation_kwargs:
14+
llava:
15+
image_aspect_ratio: original
16+
metric_list:
17+
- metric: mmmu_acc
18+
aggregation: !function utils_group_img.mmmu_aggregate_results
19+
higher_is_better: true
20+
metadata:
21+
- version: 0.0

0 commit comments

Comments
 (0)