ENH add `from_cv_results` in `RocCurveDisplay` (single `RocCurveDisplay`) by lucyleeow · Pull Request #30399 · scikit-learn/scikit-learn

lucyleeow · 2024-12-03T11:27:13Z

Reference Issues/PRs

Supercedes #25939

This is part of a group of draft PRs to determine best API for adding plots for cv results to our displays.

Add multi display class (ENH add from_cv_results in RocCurveDisplay (Multi-display) #30359)
Use list of single display classes (ENH add from_cv_results in RocCurveDisplay (list of displays) #30370)
Amend single display class to optionally return list (this PR)

For all 3 options we take the output of cross_validate, and use the fitted estimator and test indicies. No fitting is done in the display.

We do recalculate the predictions (which would have already been done in cross_validate), which could be avoided if we decided to change cross_validate to optionally return the predictions as well (note this would make cross_val_predict redundant).
See more thread: #25939 (comment)). I think should be outside of the scope of this body of work though.

What does this implement/fix? Explain your changes.

Not 100% I've implemented this optimally.

RocCurveDisplay object may contain data (fpr/tpf etc) for single or multi curves
RocCurveDisplay returns single mpl Artist object, or list of objects for multi curves
RocCurveDisplay.plot handles both single and multi-curve plotting, this has meant a lot more checking is required (c.f. the other 2 implementations, as this is the only case where you can use plot directly to plot multi-curves)

More specific concerns detailed in review comments

Plot looks like:

TODO

We should update visualization.rst after this PR is in to add a section about from_cv_results.

github-actions · 2024-12-03T11:28:31Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: ec04011. Link to the linter CI: here}

sklearn/metrics/_plot/roc_curve.py

lucyleeow · 2024-12-03T11:32:51Z

sklearn/utils/_plotting.py

+        if n_multi is None:
+            name = self.estimator_name if name is None else name
+        else:
+            name = [f"{curve_type} fold {curve_idx}:" for curve_idx in range(n_multi)]


If we go with this implementation, I thought this change could be used for other multi cv displays. Not 100% sure on this change though.

jeremiedbb · 2024-12-11T15:59:18Z

As discussed in today's meeting, this is my favorite solution because it's the simplest and least surprising one from a user point of view, even though it adds a bit more internal complexity than the others. And I think we can mitigate some of it by extracting parts of the plot code into dedicated _plot_single and _plot_multiple methods. Or just into small helpers, that would already help readability.

It also looks like a good portion of the added complexity will be exactly the same for other displays like PRCurveDisplay, so there might be a chance that we'll be able to factorize some parts to be used by several displays.

lucyleeow · 2024-12-31T04:15:29Z

The changes in f0908e1 and 7e77d4c factorizes out common code (compared to #30508), adding helper function to either _BinaryClassifierCurveDisplayMixin (if function relevant to other binary displays) or sklearn/utils/_plotting.py (if function more generally applicable to more diplays - these potentially could be a parent class method?)

sklearn/metrics/_plot/tests/test_roc_curve_display.py

glemaitre

Only a couple of nitpicks. It looks great on my side and almost ready to be merged.

lucyleeow · 2025-04-24T12:07:45Z

I think I have addressed everything, thanks @glemaitre !

lucyleeow · 2025-05-19T03:17:09Z

@jeremiedbb gentle ping on this, thanks!

jeremiedbb

I pushed tiny nitpicks.

LGTM. Thanks !

lucyleeow · 2025-05-26T22:36:57Z

Thanks @jeremiedbb !

Co-authored-by: Jérémie du Boisberranger <[email protected]>

first commit

a1442e5

lucyleeow marked this pull request as draft December 3, 2024 11:27

github-actions bot added module:metrics module:utils labels Dec 3, 2024

lint

f4f9a98

lucyleeow commented Dec 3, 2024

View reviewed changes

This was referenced Dec 3, 2024

ENH add from_cv_results in RocCurveDisplay (Multi-display) #30359

Closed

ENH add from_cv_results in RocCurveDisplay (list of displays) #30370

Closed

fixes

9e87e13

lucyleeow mentioned this pull request Dec 19, 2024

ENH add from_cv_results in PrecisionRecallDisplay (single Display) #30508

Merged

lucyleeow added 2 commits December 31, 2024 15:10

factorize

f0908e1

fix

7e77d4c

lucyleeow mentioned this pull request Jan 7, 2025

Different(wrong?) meaning of pos_label=None #10010

Open

lucyleeow added 14 commits January 15, 2025 11:01

review changes

1a4a2f3

merge main

4e25ffc

lint

042c0f6

ignore mypy

e8a5073

fix validate plot param

4431c55

fix from predictions

34d8051

fix example in docstring

dc6adce

fix tests

bdf2e43

fix docstring example

f5dbb1d

black

144fd13

black

0aa7751

fix tests

70f0127

fix testst

b9e1b0b

fix tests

73984e3

glemaitre reviewed Apr 23, 2025

View reviewed changes

sklearn/metrics/_plot/tests/test_roc_curve_display.py Outdated Show resolved Hide resolved

glemaitre reviewed Apr 23, 2025

View reviewed changes

lucyleeow mentioned this pull request Apr 23, 2025

Use more complex data in test_roc_curve_display.py #31243

Closed

lucyleeow added 2 commits April 24, 2025 13:40

review

df177ab

add multi test

2de0f78

glemaitre approved these changes Apr 30, 2025

View reviewed changes

lucyleeow added 2 commits May 5, 2025 11:43

merge main

b53666b

fix tests for new data

960c683

glemaitre mentioned this pull request May 5, 2025

ENH add CAP curve #28972

Open

17 tasks

merge main

bc5fad4

lucyleeow added the Waiting for Second Reviewer First reviewer is done, need a second one! label May 19, 2025

nitpicks

1c57ef9

jeremiedbb approved these changes May 26, 2025

View reviewed changes

Merge remote-tracking branch 'upstream/main' into pr/lucyleeow/30399

ec04011

jeremiedbb merged commit 1b05e8f into scikit-learn:main May 26, 2025
36 checks passed

lucyleeow mentioned this pull request May 30, 2025

DOC Use from_cv_results in plot_roc_crossval.py #31455

Merged

jeremiedbb added a commit to jeremiedbb/scikit-learn that referenced this pull request May 30, 2025

FEA add from_cv_results in RocCurveDisplay (scikit-learn#30399)

7782c8f

Co-authored-by: Jérémie du Boisberranger <[email protected]>

elhambbi pushed a commit to elhambbi/scikit-learn that referenced this pull request Jun 1, 2025

FEA add from_cv_results in RocCurveDisplay (scikit-learn#30399)

4dd2f54

Co-authored-by: Jérémie du Boisberranger <[email protected]>

jeremiedbb added a commit that referenced this pull request Jun 5, 2025

FEA add from_cv_results in RocCurveDisplay (#30399)

d9432c6

Co-authored-by: Jérémie du Boisberranger <[email protected]>

lucyleeow deleted the cv_results3 branch June 13, 2025 02:39

lucyleeow mentioned this pull request Jun 18, 2025

Fix RocCurveDisplay docstring and parameter order #31578

Merged

JosephBARBIERDARNAL added a commit to JosephBARBIERDARNAL/scikit-learn that referenced this pull request Aug 20, 2025

rename **kwargs to curve_:dict kwargs to follow scikit-learn#30399 style

ff0f328

This was referenced Sep 21, 2025

ENH Add from_cv_results to DetCurveDisplay #32235

Draft

DOC Fix docstring in RocCurveDisplay and add from_cv_results to see also #32237

Merged

StefanieSenger added this to Visualization and displays Oct 13, 2025

StefanieSenger moved this to Done in Visualization and displays Oct 13, 2025

lucyleeow mentioned this pull request Feb 10, 2026

TST Refactor out helper in pos_label display curve tests #33223

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ENH add `from_cv_results` in `RocCurveDisplay` (single `RocCurveDisplay`)#30399

ENH add `from_cv_results` in `RocCurveDisplay` (single `RocCurveDisplay`)#30399
jeremiedbb merged 77 commits intoscikit-learn:mainfrom
lucyleeow:cv_results3

lucyleeow commented Dec 3, 2024 •

edited

Loading

Uh oh!

github-actions bot commented Dec 3, 2024 •

edited

Loading

Uh oh!

Uh oh!

lucyleeow Dec 3, 2024

Uh oh!

jeremiedbb commented Dec 11, 2024

Uh oh!

lucyleeow commented Dec 31, 2024

Uh oh!

Uh oh!

glemaitre left a comment

Uh oh!

lucyleeow commented Apr 24, 2025

Uh oh!

lucyleeow commented May 19, 2025

Uh oh!

jeremiedbb left a comment

Uh oh!

lucyleeow commented May 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

lucyleeow commented Dec 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

Reference Issues/PRs

What does this implement/fix? Explain your changes.

TODO

Uh oh!

github-actions bot commented Dec 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

Uh oh!

lucyleeow Dec 3, 2024

Choose a reason for hiding this comment

Uh oh!

jeremiedbb commented Dec 11, 2024

Uh oh!

lucyleeow commented Dec 31, 2024

Uh oh!

Uh oh!

glemaitre left a comment

Choose a reason for hiding this comment

Uh oh!

lucyleeow commented Apr 24, 2025

Uh oh!

lucyleeow commented May 19, 2025

Uh oh!

jeremiedbb left a comment

Choose a reason for hiding this comment

Uh oh!

lucyleeow commented May 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

lucyleeow commented Dec 3, 2024 •

edited

Loading

github-actions bot commented Dec 3, 2024 •

edited

Loading