evaluation refactor #2679

n-poulsen · 2024-07-19T12:12:06Z

Evaluation refactor

Improvements to the evaluation code, as well as new tests to ensure that mAP scores match the pycocotools implementation.

Change list:

Moved all metric computation code to a deeplabcut/core/metrics folder (as metrics are computed with numpy)
Cleaned metric computation code so the prediction/ground truth matching always happens
- Refactored in a way such that no OOM errors should occur, even on very large datasets (>60k images)
Multi-animal RMSE: only compute RMSE using (ground-truth, detection) matches with non-zero RMSE
Add compute_detection_rmse to compute "detection" RMSE, matching the DeepLabCut 2.X implementation
Fixed the bug for PAF models documented in Evaluation error with PAF heads: ValueError: matrix contains invalid numeric entries #2631

…arge COCO datasets

MMathisLab

lgtm, although I did not test

MMathisLab

but see suggested changes to docstrings

deeplabcut/core/metrics/api.py

MMathisLab

maybe in main docs we need to add information about these new metrics as well @n-poulsen

n-poulsen added 14 commits July 10, 2024 10:55

fixed PAF predictor and evaluation of PAF predictors

251f792

cleaned single predictor

07997ec

improved evaluation code; needs to be tested for OOM errors on very l…

bebc43a

…arge COCO datasets

ran black, added docs

c662239

added missing docs

2d82fd1

added metric tests

bf63af3

added single animal RMSE computation

f1c1f93

reproduced fish RMSE

e33af85

reproduce benchmark numbers

aa30891

style improvement

f3c037f

updated naming

eaf6b03

added docs

4dc4a56

black styling

c0b5037

Merge branch 'pytorch_dlc' into niels/eval_oom

bd3bc22

n-poulsen requested review from MMathisLab and jeylau July 19, 2024 12:12

MMathisLab approved these changes Jul 19, 2024

View reviewed changes

n-poulsen added the DLC3.0🔥 label Jul 19, 2024

MMathisLab approved these changes Jul 19, 2024

View reviewed changes

deeplabcut/core/metrics/api.py Outdated Show resolved Hide resolved

deeplabcut/core/metrics/api.py Show resolved Hide resolved

deeplabcut/core/metrics/api.py Outdated Show resolved Hide resolved

updated docs + comments

c0f8068

MMathisLab approved these changes Jul 19, 2024

View reviewed changes

n-poulsen merged commit dc908b9 into pytorch_dlc Jul 19, 2024

n-poulsen deleted the niels/eval_oom branch July 19, 2024 15:11

n-poulsen mentioned this pull request Jul 19, 2024

Evaluation error with PAF heads: ValueError: matrix contains invalid numeric entries #2631

Closed

2 tasks

n-poulsen mentioned this pull request Aug 13, 2024

IndexError during evaluation step of training with the new pytorch engine #2648

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

evaluation refactor #2679

evaluation refactor #2679

Uh oh!

n-poulsen commented Jul 19, 2024

Uh oh!

MMathisLab left a comment

Uh oh!

MMathisLab left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MMathisLab left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

evaluation refactor #2679

evaluation refactor #2679

Uh oh!

Conversation

n-poulsen commented Jul 19, 2024

Evaluation refactor

Uh oh!

MMathisLab left a comment

Choose a reason for hiding this comment

Uh oh!

MMathisLab left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MMathisLab left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants