Skip to content

[Feature] Add P-MMEval#1714

Merged
liushz merged 7 commits intoopen-compass:mainfrom
wanyu2018umac:PMMEval
Nov 27, 2024
Merged

[Feature] Add P-MMEval#1714
liushz merged 7 commits intoopen-compass:mainfrom
wanyu2018umac:PMMEval

Conversation

@wanyu2018umac
Copy link
Copy Markdown
Contributor

@wanyu2018umac wanyu2018umac commented Nov 25, 2024

Motivation

This PR introduces the implementation of P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs (see paper link). The P-MMEval benchmark delivers support for evaluating LLMs on multilingual capabilities with examples in 10 languages.

Modification

  • Configs:
    • Add files in configs/datasets/PMMEval for evaluation support. For each subset in P-MMEval (i.e., flores, humaneval-xl, mgsm, mhellaswag, mifeval, mlogiqa, mmmlu, and xnli), each dataset python file is created.
    • Add files in configs/summarizers and configs/summarizers/groups for summarizing the evaluation results on P-MMEval.
  • Datasets
    • Add files in datasets supporting the loading and evaluation for each subset.

Checklist

Before PR:

  • Pre-commit or other linting tools are used to fix the potential lint issues.
  • Bug fixes are fully covered by unit tests, the case that causes the bug should be added in the unit tests.
  • The modification is covered by complete unit tests. If not, please add more unit test to ensure the correctness.
  • The documentation has been modified accordingly, like docstring or example tutorials.

After PR:

  • If the modification has potential influence on downstream or other related projects, this PR should be tested with those projects.
  • CLA has been signed and all committers have signed the CLA in this PR.

@wanyu2018umac wanyu2018umac changed the title [Update] Add P-MMEval [Feature] Add P-MMEval Nov 25, 2024
@liushz liushz merged commit 90efcf2 into open-compass:main Nov 27, 2024
stephen-nju pushed a commit to stephen-nju/opencompass that referenced this pull request May 14, 2025
* Update with PMMEval

* Update

* Update __init__.py

* Fix Bugs

* Delete .pre-commit-config.yaml

* Pull merge

---------

Co-authored-by: liushz <[email protected]>
zyc140345 pushed a commit to zyc140345/opencompass that referenced this pull request Oct 23, 2025
* Update with PMMEval

* Update

* Update __init__.py

* Fix Bugs

* Delete .pre-commit-config.yaml

* Pull merge

---------

Co-authored-by: liushz <[email protected]>
iamkaia pushed a commit to iamkaia/opencompass that referenced this pull request Feb 4, 2026
* Update with PMMEval

* Update

* Update __init__.py

* Fix Bugs

* Delete .pre-commit-config.yaml

* Pull merge

---------

Co-authored-by: liushz <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants