Skip to content

Conversation

@shchur
Copy link
Contributor

@shchur shchur commented Sep 16, 2025

Issue #, if available:

Description of changes:

  • Update results and task definitions in benchmarks/ to v0.6.0.
  • Allow setting n_resamples=None to disable bootstrap CIs in leaderboard and pairwise_comparison.
  • Handle deprecated fields in results when loading evaluation summaries.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

@review-notebook-app
Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

@shchur shchur changed the title Update results 0.6.0 Update results to v0.6.0 Sep 16, 2025
@shchur shchur merged commit f362bae into pre-v1.0.0 Sep 16, 2025
shchur added a commit that referenced this pull request Sep 16, 2025
@shchur shchur deleted the update-results-0.6.0 branch September 16, 2025 13:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant