ENH Add Multiclass Brier Score Loss by ogrisel · Pull Request #22046 · scikit-learn/scikit-learn

ogrisel · 2021-12-21T14:24:36Z

Resolves #16055.
This PR updates #18699 by @aggvarun01 after a merge with main and resolves merge conflicts. I do not have the permissions to push directly in the original branch and opening a sub-PR pointing to #18699 would lead to an unreadable diff because of the one-year merge sync.

I also added a changelog entry and demonstrate the new function in the multiclass calibration example.

@aggvarun01 if you want feel free to pull the last commit from this commit from this branch to your branch. Alternatively we can finalize the review here.

…score_loss

Co-authored-by: Olivier Grisel <[email protected]>

…e_loss

ogrisel

+1 for merge once https://github.com/scikit-learn/scikit-learn/pull/22046/files#r1943364780 is addressed (and conflicts are resolved).

@lorentzenchr do you agree with the way this PR evolved, in particular the points I raised in #22046 (comment)?

…ore_loss

ogrisel

Still +1 for merge (I cannot approve the PR in github because I am the creator of the PR).

doc/whats_new/upcoming_changes/sklearn.metrics/22046.feature.rst

lorentzenchr

I would have preferred to have all the non-related input validation and test thing changes in a separate PR.

doc/modules/model_evaluation.rst

lorentzenchr · 2025-03-06T22:16:34Z

sklearn/metrics/tests/test_common.py

-    if name in METRICS_REQUIRE_POSITIVE_Y:
-        y_true, y_pred = _require_positive_targets(y_true, y_pred)
+    always_symmetric = True
+    for _ in range(5):


A comment would help: why the loop? (make lucky test passes very unlikely)

lorentzenchr · 2025-03-06T22:18:42Z

sklearn/metrics/tests/test_common.py

+    if always_symmetric:  # pragma: no cover
+        raise ValueError(f"{name} seems to be symmetric")


Suggested change

if always_symmetric: # pragma: no cover

raise ValueError(f"{name} seems to be symmetric")

if not always_symmetric:

raise ValueError(f"{name} seems to be asymmetric")

There should be a test for this, e.g. applying test_not_symmetric_metric to log_loss and test for a fail.

The meta test test_symmetry_tests in 6de5e13 checks test_symmetric_metric and test_not_symmetric_metric.

sklearn/metrics/_classification.py

lorentzenchr · 2025-03-06T22:27:53Z

sklearn/metrics/_classification.py

+    For :math:`N` observations labeled from :math:`C` possible classes, the Brier
+    score is defined as:
+
+    .. math::
+        \\frac{1}{N}\\sum_{i=1}^{N}\\sum_{c=1}^{C}(y_{ic} - \\hat{p}_{ic})^{2}


If I remember correctly, we try to avoid LaTeX in docstrings and just link to the user guide. If LaTeX then only in the the Notes section (this may be a numpy thing) @glemaitre my know better.

The math are moved to the Notes section in 58e5f18. If you feel that's too redundant with the User Guide I can remove the Notes section.

sklearn/metrics/_classification.py

Co-authored-by: Christian Lorentzen <[email protected]> Co-authored-by: Olivier Grisel <[email protected]>

antoinebaker · 2025-03-20T09:39:14Z

Thanks for the reviews @thomasjpfan and @lorentzenchr. I think the PR is ready for a final round of reviews.

antoinebaker · 2025-03-20T09:47:44Z

I would have preferred to have all the non-related input validation and test thing changes in a separate PR.

Yes, sorry that we mix refactoring the brier_score_loss and log_loss input validation with adding multiclass support for Brier score. There will be a follow up PR to harmonize further the log_loss and brier_score_loss API and testing.

doc/modules/model_evaluation.rst

Co-authored-by: Christian Lorentzen <[email protected]>

lorentzenchr · 2025-03-20T18:38:38Z

There will be a follow up PR to harmonize further the log_loss and brier_score_loss API and testing.

If you intent to do that, I would very much like to have the classification metrics better structured, i.e. putting log loss and brier score at the top where they belong. This might also help with not needing to define things twice.

Co-authored-by: Varun Aggarwal <[email protected]> Co-authored-by: Antoine Baker <[email protected]>

Varun Aggarwal and others added 30 commits October 25, 2020 20:47

add multi-class support

f630718

fix swapped y_true y_prob

e08d4f4

fix docstring

eff8854

fix docstring

d864395

fix variable name spelling

32ab60a

add tests

6e73c0d

merge upstream

7ce3f85

import re

9cd4247

fix docstring

1369945

fix linting

a183d06

fix linting

08688d3

remove unused import

4f8a5f2

add multiclass_brier_score_loss

7b51433

add tests

d5c90bf

fix docstring

2243828

Merge remote-tracking branch 'upstream/master' into multiclass_brier_…

9893101

…score_loss

use f-strings

3e4465f

fix tests

eafda42

fix error message

038abf7

fix docstring

838f827

fix linting

5ef41c7

Update sklearn/metrics/_classification.py

4fb4c4f

Co-authored-by: Olivier Grisel <[email protected]>

Apply suggestions from code review

3260bf3

Co-authored-by: Olivier Grisel <[email protected]>

split tests

86d793e

add private function

411ec1a

add warning for labels

f84493c

Merge remote-tracking branch 'origin/main' into multiclass_brier_scor…

79f014d

…e_loss

Fix multiclass_brier_score_loss docstring sections order

50f50ef

Add entry in the changelog

884c434

Update multiclass calibration example

cdc4cc9

antoinebaker added 2 commits January 6, 2025 11:15

remove log_loss mention

1054b83

fix doctest

c163b6a

glemaitre mentioned this pull request Jan 7, 2025

feat: Design of EstimatorReport probabl-ai/skore#997

Merged

19 tasks

update test_common

01fa561

ogrisel commented Feb 5, 2025

View reviewed changes

antoinebaker and others added 4 commits February 6, 2025 11:56

Merge remote-tracking branch 'upstream/main' into multiclass_brier_sc…

e45a660

…ore_loss

return float

4e27bbf

changelog

a5b448b

Merge branch 'main' into multiclass_brier_score_loss

ef0bbe8

ogrisel commented Mar 6, 2025

View reviewed changes

doc/whats_new/upcoming_changes/sklearn.metrics/22046.feature.rst Outdated Show resolved Hide resolved

ogrisel added the Waiting for Reviewer label Mar 6, 2025

lorentzenchr reviewed Mar 6, 2025

View reviewed changes

thomasjpfan reviewed Mar 6, 2025

View reviewed changes

sklearn/metrics/_classification.py Show resolved Hide resolved

lorentzenchr removed the Waiting for Reviewer label Mar 7, 2025

antoinebaker and others added 6 commits March 17, 2025 11:18

Apply suggestions from code review

3595b8e

Co-authored-by: Christian Lorentzen <[email protected]> Co-authored-by: Olivier Grisel <[email protected]>

doc

58e5f18

symmetry tests

6de5e13

test y_proba with two columns

653d4ae

Merge branch 'main' into multiclass_brier_score_loss

242ee3e

Merge branch 'main' into multiclass_brier_score_loss

6359c7d

lorentzenchr approved these changes Mar 20, 2025

View reviewed changes

doc/modules/model_evaluation.rst Outdated Show resolved Hide resolved

doc/modules/model_evaluation.rst Outdated Show resolved Hide resolved

Apply suggestions from code review

e3e406c

Co-authored-by: Christian Lorentzen <[email protected]>

lorentzenchr merged commit 318a282 into scikit-learn:main Mar 20, 2025
33 checks passed

SwathiR1999 pushed a commit to SwathiR1999/scikit-learn that referenced this pull request Mar 21, 2025

ENH Add Multiclass Brier Score Loss (scikit-learn#22046)

a5bc45b

Co-authored-by: Varun Aggarwal <[email protected]> Co-authored-by: Antoine Baker <[email protected]>

ogrisel deleted the multiclass_brier_score_loss branch March 24, 2025 15:34

agriyakhetarpal mentioned this pull request Mar 26, 2025

DOC Use nightly WASM wheels for JupyterLite in the dev documentation #31085

Merged

lucyleeow pushed a commit to lucyleeow/scikit-learn that referenced this pull request Apr 2, 2025

ENH Add Multiclass Brier Score Loss (scikit-learn#22046)

c9eb3cf

Co-authored-by: Varun Aggarwal <[email protected]> Co-authored-by: Antoine Baker <[email protected]>

		if always_symmetric: # pragma: no cover
		raise ValueError(f"{name} seems to be symmetric")

Uh oh!

Conversation

ogrisel commented Dec 21, 2021 • edited by lorentzenchr Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ogrisel left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lorentzenchr left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lorentzenchr Mar 6, 2025

Choose a reason for hiding this comment

Uh oh!

lorentzenchr Mar 6, 2025

Choose a reason for hiding this comment

Uh oh!

antoinebaker Mar 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lorentzenchr Mar 6, 2025

Choose a reason for hiding this comment

Uh oh!

antoinebaker Mar 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

antoinebaker commented Mar 20, 2025

Uh oh!

antoinebaker commented Mar 20, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lorentzenchr commented Mar 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

ogrisel commented Dec 21, 2021 •

edited by lorentzenchr

Loading

ogrisel left a comment •

edited

Loading