Provide clearer error when server provides bad data description XML #1178

PGijsbers · 2022-10-21T13:22:39Z

Debugging the errors of some dataset unit tests were particularly difficult, because the real reason for their errors (malformed XML) got swallowed as if the dataset is still in preprocessing (hence the change in test_dataset_functions). By our own definition, we should only catch OpenMLServerException there ("exception for when the result of the server was not 200").

The malformed XML error was still hard to debug, since only the xml parse ExpartError was provided, which does not provide information on the XML file. Restructuring the _get_dataset_description function has two purposes:

now a new XML file will be downloaded, even if a local cached description xml got malformed somehow, and
check that an XML file from the server is properly formed before storing it, and reporting a clearer error otherwise (provide the exact endpoint that serves the malformed URL).

I ranpytest pytest tests/test_datasets/test_dataset_functions.py::TestOpenMLDataset locally, and all tests still pass (except those with known server or parquet issues).

…penml#1178)

Provide clearer error when server provides bad data description XML

c03e2dc

PGijsbers requested a review from mfeurer October 21, 2022 13:22

mfeurer approved these changes Oct 24, 2022

View reviewed changes

PGijsbers merged commit e6250fa into develop Oct 24, 2022

PGijsbers deleted the improve_error_message_bad_dataset_description branch October 24, 2022 17:58

PGijsbers added a commit to Mirkazemi/openml-python that referenced this pull request Feb 23, 2023

Provide clearer error when server provides bad data description XML (o…

5e2be36

…penml#1178)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Provide clearer error when server provides bad data description XML #1178

Provide clearer error when server provides bad data description XML #1178

Uh oh!

PGijsbers commented Oct 21, 2022 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Provide clearer error when server provides bad data description XML #1178

Provide clearer error when server provides bad data description XML #1178

Uh oh!

Conversation

PGijsbers commented Oct 21, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

PGijsbers commented Oct 21, 2022 •

edited

Loading