Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix MMMU-Pro evaluation #296

Merged
merged 1 commit into from
Oct 5, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion lmms_eval/tasks/mmmu_pro/mmmu_pro.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -2,4 +2,4 @@ group: mmmu_pro
task:
- mmmu_pro_vision
# - mmmu_pro_composite # removing composite task in formal MMMU-Pro evaluation
- mmmu_pro_original
- mmmu_pro_standard
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
task: "mmmu_pro_original"
task: "mmmu_pro_standard"
dataset_path: MMMU/MMMU_Pro
dataset_name: original
dataset_name: standard
test_split: test
output_type: generate_until
doc_to_visual: !function utils.mmmu_pro_doc_to_visual
Expand Down
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
task: "mmmu_pro_original_cot"
task: "mmmu_pro_standard_cot"
dataset_path: MMMU/MMMU_Pro
dataset_name: original
dataset_name: standard
test_split: test
output_type: generate_until
doc_to_visual: !function utils.mmmu_pro_doc_to_visual
Expand Down
2 changes: 1 addition & 1 deletion lmms_eval/tasks/mmmu_pro/mmmu_pro_vision.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@ dataset_name: vision
test_split: test
output_type: generate_until
doc_to_visual: !function utils.mmmu_pro_doc_to_visual
doc_to_text: "Answer with the option letter from the given choices directly. The last line of your response should be of the following format: 'Answer: $LETTER' (without quotes) where LETTER is one of options."
doc_to_text: "Answer with the option letter from the given choices directly."
doc_to_target: "{{answer}}"
# The return value of process_results will be used by metrics
process_results: !function utils.mmmu_pro_process_results
Expand Down
Loading