Delete files uploaded to test server during unit testing #735

Neeratyoy · 2019-07-11T21:07:24Z

Reference Issue

Addresses #261.

What does this PR implement/fix? Explain your changes.

Keeps track of entities uploaded to test server and iteratively deletes them after the completion of all unit tests.

How should this PR be tested?

By running pytest -s and observing the logs.

… tests are run

Neeratyoy · 2019-07-11T21:24:36Z

@mfeurer
A few things I need advise on.

The following IDs refuse to be deleted:
flow is in use by other content (it is a subflow). Can not be deleted
- flow, 2146
- flow, 2378
Can only delete studies based on runs or studies in preparation
- study, 307 (unit test benchmark suite)
Currently the only way to manually check whether the deletion works is to capture the logs, wherein I have a bunch of print statements. However, though structured and hence visually discernible, but they will evidently populate the logs a lot lot more.

…o fix_261

codecov-io · 2019-07-12T12:20:17Z

Codecov Report

Merging #735 into develop will increase coverage by 0.15%.
The diff coverage is 97.36%.

@@             Coverage Diff             @@
##           develop     #735      +/-   ##
===========================================
+ Coverage    87.82%   87.97%   +0.15%     
===========================================
  Files           36       36              
  Lines         3999     4033      +34     
===========================================
+ Hits          3512     3548      +36     
+ Misses         487      485       -2

Impacted Files	Coverage Δ
openml/testing.py	`96.38% <97.36%> (+0.17%)`	⬆️
openml/_api_calls.py	`88.31% <0%> (+3.89%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 347c4a6...9208c4f. Read the comment docs.

mfeurer

Thanks, this looks good, although I think it needs a bit of simplification.

Also, could you please add to the contribution guide that the test server needs to be cleaned up after the unit test?

openml/testing.py

tests/test_datasets/test_dataset_functions.py

tests/test_tasks/test_clustering_task.py

mfeurer · 2019-07-12T14:52:17Z

The following IDs refuse to be deleted: flow is in use by other content (it is a subflow). Can not be deleted

I'm afraid that you need to recursively add all subflows to the tracker to make sure there are no leftovers.

Can only delete studies based on runs or studies in preparation study, 307 (unit test benchmark suite)

@janvanrijn is this intended behavior?

Currently the only way to manually check whether the deletion works is to capture the logs, wherein I have a bunch of print statements. However, though structured and hence visually discernible, but they will evidently populate the logs a lot lot more.

That's fine by me.

openml/testing.py

Neeratyoy · 2019-07-17T17:11:02Z

@mfeurer I tried to debug the clustering task failure. Here are my observations.

The task being uploaded has the following attributes, as initialized by the setUp of class OpenMLClusteringTaskTest:

task_type_id = 5
estimation_procedure = 17
dataset_id as generated by _get_compatible_rand_dataset()

This is attempted 100 times until the publish() of the task succeeds. Failure of which gives the current ValueError of Could not create a valid task for task type ID 5.

On the test server, the following commands:
l = openml.tasks.list_tasks(task_type_id=5, output_format='dataframe')
print(set(l.did))
result in {2, 10, 11, 14, 22, 24, 25, 35, 36, 37, 42, 91, 92}

If we record all the 100 dataset_id generated by _get_compatible_rand_dataset() during the test case publish attempts, that is exactly {2, 10, 11, 14, 22, 24, 25, 35, 36, 37, 42, 91, 92} again.
This explains why we cannot upload a task in even 100 trials.

Using the apikey we define in class TestBase, I explicitly deleted one of the tasks linked with dataset_id = 22, and then ran the unit test. It passed as expected since the task with those attributes didn't exist.

Are all these expected behaviour?

mfeurer · 2019-07-18T07:26:11Z

Are all these expected behaviour?

No, it turns out that _get_compatible_rand_dataset only takes classification and regression into account. Clustering tasks could be created for any dataset and therefore, the tests should not fail. Could you please extend that function?

Neeratyoy · 2019-07-19T12:40:35Z

@mfeurer

Most test case issue seems to be fixed barring one test which fails consistently.
Could you please explain what this does: https://github.com/openml/openml-python/blob/develop/tests/test_flows/test_flow_functions.py#L277

My local sklearn version is 0.20.2 while flow_id=20 returns code 181: unknown flow from the test server. Please advise on how I can solve this.

Neeratyoy · 2019-07-19T13:40:16Z

@mfeurer just another tiny thing.
I should be adding instructions to collect files being uploaded when adding a unit test.
Should I be editing the contributing.rst or contributing.md for that? Where would you like that information to be included?

mfeurer · 2019-07-19T13:41:59Z

Where would you like that information to be included?

Actually, I think I meant this file: https://raw.githubusercontent.com/openml/openml-python/develop/PULL_REQUEST_TEMPLATE.md

Neeratyoy · 2019-07-19T14:17:06Z

I didn't edit contributing.rst since that page links to contributing.md. I also edited pull_request_template.md with brief instructions on the same.

mfeurer · 2019-07-19T14:20:12Z

Okay, let me know when I should have a look at this again.

Neeratyoy added 10 commits June 27, 2019 13:23

Collecting and cleaning unit test dump

7d6cf28

Adding session level fixture with yield to delay deletion of files

299477a

Adding PEP8 ignore F401

5ecafe8

Changelog update + pytest argument fix

758aa30

Messy first draft of possible designs

fa77610

Leaner implementation without additional imports

2bc519e

Collecting files uploaded to test server and deleting them after unit…

a099234

… tests are run

Reordering flows to delete subflows later

10c8dd8

Updating with design changes for tracking files for deletion

fb97ba1

Handling edge cases

6f4fb5f

Neeratyoy requested a review from mfeurer July 11, 2019 21:24

Neeratyoy added 4 commits July 11, 2019 23:30

Merge branch 'develop' into fix_261

77a4467

Fixing unit test git status

f0951d4

Merge branch 'fix_261' of https://github.com/openml/openml-python int…

360dd99

…o fix_261

Fixing PEP8 issues

09b7f63

FIxing type annotation

28a5f53

mfeurer reviewed Jul 12, 2019

View reviewed changes

Neeratyoy added 3 commits July 16, 2019 13:04

Logging and leaner flow

5794ec8

Merge branch 'develop' into fix_261

5edf3e3

Fixing PEP8 and unit test errors

3085c15

mfeurer reviewed Jul 17, 2019

View reviewed changes

openml/testing.py Outdated Show resolved Hide resolved

Fixing test cases; Renaming function

28dcb02

Neeratyoy mentioned this pull request Jul 17, 2019

Adding __repr__ to top level classes #739

Closed

Fixing clustering task unit test

7c2ed4d

Updating docs for unit test deletion

9208c4f

Neeratyoy requested a review from mfeurer July 19, 2019 14:57

mfeurer approved these changes Jul 19, 2019

View reviewed changes

mfeurer merged commit 56fcc00 into develop Jul 19, 2019

mfeurer deleted the fix_261 branch July 19, 2019 15:03

mfeurer mentioned this pull request Jul 19, 2019

Clean up test server after running unit tests #261

Closed

Uh oh!

Delete files uploaded to test server during unit testing #735

Delete files uploaded to test server during unit testing #735

Uh oh!

Conversation

Neeratyoy commented Jul 11, 2019

Reference Issue

What does this PR implement/fix? Explain your changes.

How should this PR be tested?

Uh oh!

Neeratyoy commented Jul 11, 2019

Uh oh!

codecov-io commented Jul 12, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

mfeurer left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mfeurer commented Jul 12, 2019

Uh oh!

Uh oh!

Neeratyoy commented Jul 17, 2019

Uh oh!

mfeurer commented Jul 18, 2019

Uh oh!

Neeratyoy commented Jul 19, 2019

Uh oh!

Neeratyoy commented Jul 19, 2019

Uh oh!

mfeurer commented Jul 19, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Neeratyoy commented Jul 19, 2019

Uh oh!

mfeurer commented Jul 19, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov-io commented Jul 12, 2019 •

edited

Loading

mfeurer commented Jul 19, 2019 •

edited

Loading