Skip to content

[Feature] Add MultiPL-E & Code Evaluator#1963

Merged
tonysy merged 4 commits intoopen-compass:mainfrom
Zhudongsheng75:multiple_code
Mar 21, 2025
Merged

[Feature] Add MultiPL-E & Code Evaluator#1963
tonysy merged 4 commits intoopen-compass:mainfrom
Zhudongsheng75:multiple_code

Conversation

@Zhudongsheng75
Copy link
Copy Markdown
Collaborator

  1. Developed a client/server-based code evaluator;
  2. Based on this code evaluator, the MultiPL-E dataset has been added. Currently, OC can support evaluation tasks in multiple programming languages.

@tonysy
Copy link
Copy Markdown
Collaborator

tonysy commented Mar 21, 2025

multiple_infer_cfg = dict(
prompt_template=dict(type=PromptTemplate, template='Based on the provided {language} code snippet, complete the subsequent content. The initial part of the completed code must match the provided code snippet exactly:\n{prompt}'),
retriever=dict(type=ZeroRetriever),
inferencer=dict(type=GenInferencer, max_out_len=2048),
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please consider removing this max_out_len to avoid the truncation for the long cot decoding scenario.

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done

change num_repeats>1, otherwise the number in
`.cache/dataset_size.json` might be inconsistent.

Args:
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Doc string is missing

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done

Copy link
Copy Markdown
Collaborator

@tonysy tonysy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@tonysy tonysy merged commit 8a5029b into open-compass:main Mar 21, 2025
8 checks passed
@tonysy tonysy requested a review from MaiziXiao April 28, 2025 02:49
stephen-nju pushed a commit to stephen-nju/opencompass that referenced this pull request May 14, 2025
* multiple_code develop

* multiple_code update

* comments upadate

* index upadate
zyc140345 pushed a commit to zyc140345/opencompass that referenced this pull request Oct 23, 2025
* multiple_code develop

* multiple_code update

* comments upadate

* index upadate
iamkaia pushed a commit to iamkaia/opencompass that referenced this pull request Feb 4, 2026
* multiple_code develop

* multiple_code update

* comments upadate

* index upadate
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants