Remove dtype checking for prediction comparison #1177

PGijsbers · 2022-10-17T16:08:45Z

It looks like the predictions loaded from an arff file are read as floats by the arff reader, which results in a different type (float v int). Because "equality" of values is already checked, I figured dtype is not as imported. That said, I am not sure why there are so many redundant comparisons in the first place? Anyway, the difference should be due to pandas inference behavior, and if that is what we want to test, then we should make a small isolated test case instead of integrating it into every prediction unit test.

mfeurer

Fine by me, but why did this fail in the first place 😕

PGijsbers · 2022-10-24T18:00:31Z

If I remember correctly, the numbers are integer. When openml-python executes the experiments, they are also integer (from for loops). When read from ARFF file (attribute annotated as NUMERIC), they are read as float, despite them really being integer.

mfeurer · 2022-10-25T13:30:15Z

That description makes sense, but why did this work at some point at all?

The behavior of the ARFF reader is correct as ARFF does not specify integers. Maybe OpenML should also move away from ARFF for internal data structures.

PGijsbers · 2022-11-07T11:29:04Z

That description makes sense, but why did this work at some point at all?

My best guess is different behaviour of Pandas, but I haven't spent time nailing it down exactly (it's interesting, but didn't seem important).

It looks like the predictions loaded from an arff file are read as floats by the arff reader, which results in a different type (float v int). Because "equality" of values is already checked, I figured dtype is not as imported. That said, I am not sure why there are so many redundant comparisons in the first place? Anyway, the difference should be due to pandas inference behavior, and if that is what we want to test, then we should make a small isolated test case instead of integrating it into every prediction unit test. Finally, over the next year we should move away from ARFF.

PGijsbers added the testing label Oct 17, 2022

PGijsbers requested a review from mfeurer October 17, 2022 16:08

mfeurer approved these changes Oct 24, 2022

View reviewed changes

PGijsbers merged commit f37ebbe into develop Nov 24, 2022

PGijsbers deleted the fix_test_runs branch November 24, 2022 18:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Remove dtype checking for prediction comparison #1177

Remove dtype checking for prediction comparison #1177

Uh oh!

PGijsbers commented Oct 17, 2022

Uh oh!

mfeurer left a comment

Uh oh!

PGijsbers commented Oct 24, 2022

Uh oh!

mfeurer commented Oct 25, 2022

Uh oh!

PGijsbers commented Nov 7, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Remove dtype checking for prediction comparison #1177

Remove dtype checking for prediction comparison #1177

Uh oh!

Conversation

PGijsbers commented Oct 17, 2022

Uh oh!

mfeurer left a comment

Choose a reason for hiding this comment

Uh oh!

PGijsbers commented Oct 24, 2022

Uh oh!

mfeurer commented Oct 25, 2022

Uh oh!

PGijsbers commented Nov 7, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants