Subclass all test classes from openml test helper #609

mfeurer · 2018-12-19T10:13:25Z

Fixes #392

janvanrijn

LGTM

mfeurer · 2018-12-19T12:57:43Z

Hey @janvanrijn there's an unexpected server error happening: https://api.travis-ci.org/v3/job/469988062/log.txt

janvanrijn · 2018-12-19T13:00:05Z

Will immediately make that error more XML-able.

joaquinvanschoren · 2018-12-19T13:01:29Z

Seems like you're uploading a dataset with an ID that's already taken:

Error Number: 1062

Duplicate entry 'Pandas_testing_dataset-5133' for key 'nameID'

janvanrijn · 2018-12-19T13:18:22Z

Looks like this is a result of the following:

Due to bad luck, two times a dataset with the same name got uploaded at the same time. Should not happen in practice.
Usually, on the main server this would not give a mysql error as they are hidden away and replaced with a xml error message (at least, most, including this one, are). On the test server we run the database in debug mode, meaning that every error will result in a verbose error message like this one. Given that this was only an incident, I feel like we should keep it that way.

codecov-io · 2018-12-21T09:20:47Z

Codecov Report

Merging #609 into develop will increase coverage by 0.01%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           develop    #609      +/-   ##
==========================================
+ Coverage    89.88%   89.9%   +0.01%     
==========================================
  Files           32      32              
  Lines         3016    3020       +4     
==========================================
+ Hits          2711    2715       +4     
  Misses         305     305

Impacted Files	Coverage Δ
openml/testing.py	`93.58% <100%> (+0.34%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 7c0a77d...c90cd42. Read the comment docs.

mfeurer · 2018-12-21T09:22:44Z

Okay, so to not risk this happening any further (because this really confused me and requires getting feedback from the admins) I added a (hopefully) unique sentinel to make sure all datasets have unique names.

mfeurer · 2018-12-21T13:44:12Z

It appears that there are some rather random test failures:

Unexpected failing examples:
/home/travis/build/openml/openml-python/examples/introduction_tutorial.py failed leaving traceback:
Traceback (most recent call last):
  File "/home/travis/build/openml/openml-python/examples/introduction_tutorial.py", line 70, in <module>
    run = openml.runs.run_flow_on_task(flow, task, avoid_duplicate_runs=False)
  File "/home/travis/build/openml/openml-python/openml/runs/functions.py", line 147, in run_flow_on_task
    "same: '%s' vs '%s'" % (str(flow.flow_id), str(flow_id))
ValueError: Result from API call flow_exists and flow.flow_id are not same: '58748' vs 'False'

and happens in

    # in case the flow not exists, flow_id will be False (as returned by
    # flow_exists). Also check whether there are no illegal flow.flow_id values
    # (compared to result of openml.flows.flow_exists)
    if flow_id is False:
        if flow.flow_id is not None:
            raise ValueError('flow.flow_id is not None, but the flow does not'
                             'exist on the server according to flow_exists')
        _publish_flow_if_necessary(flow)

    data_content, trace, fold_evaluations, sample_evaluations = res
    if not isinstance(flow.flow_id, int):
        # This is the usual behaviour, where the flow object was initiated off
        # line and requires some additional information (flow_id, input_id for
        # each hyperparameter) to be usable by this library
        server_flow = get_flow(flow_id)
        openml.flows.flow._copy_server_fields(server_flow, flow)
        openml.flows.assert_flows_equal(flow, server_flow,
                                        ignore_parameter_values=True)
    else:
        # This can only happen when the function is called directly, and not
        # through "run_model_on_task"
        if flow.flow_id != flow_id:
            # This should never happen, unless user made a flow-creation fault
            raise ValueError(
                "Result from API call flow_exists and flow.flow_id are not "
                "same: '%s' vs '%s'" % (str(flow.flow_id), str(flow_id))
            )

Could it be that we need to set flow_id after _publish_flow_if_necessary to avoid this error?

ArlindKadra · 2019-01-20T23:24:21Z

@mfeurer I took a look at the code and I think you are right.
In case there is a flow that is not on the server and it does not have an id, if it gets published successfully

openml-python/openml/runs/functions.py

Lines 129 to 149 in 7c0a77d

    
               _publish_flow_if_necessary(flow) 
        
           data_content, trace, fold_evaluations, sample_evaluations = res 
        
           if not isinstance(flow.flow_id, int): 
        
               # This is the usual behaviour, where the flow object was initiated off 
        
               # line and requires some additional information (flow_id, input_id for 
        
               # each hyperparameter) to be usable by this library 
        
               server_flow = get_flow(flow_id) 
        
               openml.flows.flow._copy_server_fields(server_flow, flow) 
        
               openml.flows.assert_flows_equal(flow, server_flow, 
        
                                               ignore_parameter_values=True) 
        
           else: 
        
               # This can only happen when the function is called directly, and not 
        
               # through "run_model_on_task" 
        
               if flow.flow_id != flow_id: 
        
                   # This should never happen, unless user made a flow-creation fault 
        
                   raise ValueError( 
        
                       "Result from API call flow_exists and flow.flow_id are not " 
        
                       "same: '%s' vs '%s'" % (str(flow.flow_id), str(flow_id)) 
        
                   )

The else part above always gets triggered and fails. Since flow_id is not changed after the flow is published. It is only assigned once in this function call:

openml-python/openml/runs/functions.py

Line 101 in 7c0a77d

flow_id = flow_exists(flow.name, flow.external_version)

mfeurer · 2019-01-29T08:15:01Z

Thanks @ArlindKadra this appears to solve the issue. The build failures are not related.

Subclass all test classes from openml test helper (#609)

Subclass all test classes from openml test helper

0722028

mfeurer mentioned this pull request Dec 19, 2018

CI runs on production server #392

Closed

janvanrijn approved these changes Dec 19, 2018

View reviewed changes

FIX inheritance issues

d618e15

TST add sentinel to dataset upload

d7af002

mfeurer added 2 commits December 21, 2018 11:11

TEST redirect a few tests to the live server again

809976d

MAINT fix pep8

434e030

Trying simple solution

c90cd42

mfeurer merged commit 4a7db0e into develop Jan 29, 2019

mfeurer deleted the fix_392_again branch January 29, 2019 08:15

janvanrijn added a commit that referenced this pull request Feb 11, 2019

Merge pull request #614 from openml/develop

dd2bc24

Subclass all test classes from openml test helper (#609)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Subclass all test classes from openml test helper #609

Subclass all test classes from openml test helper #609

Uh oh!

mfeurer commented Dec 19, 2018

Uh oh!

janvanrijn left a comment

Uh oh!

mfeurer commented Dec 19, 2018

Uh oh!

janvanrijn commented Dec 19, 2018

Uh oh!

joaquinvanschoren commented Dec 19, 2018

Uh oh!

janvanrijn commented Dec 19, 2018

Uh oh!

codecov-io commented Dec 21, 2018 •

edited

Loading

Uh oh!

mfeurer commented Dec 21, 2018

Uh oh!

mfeurer commented Dec 21, 2018

Uh oh!

ArlindKadra commented Jan 20, 2019

Uh oh!

mfeurer commented Jan 29, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Uh oh!

Subclass all test classes from openml test helper #609

Subclass all test classes from openml test helper #609

Uh oh!

Conversation

mfeurer commented Dec 19, 2018

Uh oh!

janvanrijn left a comment

Choose a reason for hiding this comment

Uh oh!

mfeurer commented Dec 19, 2018

Uh oh!

janvanrijn commented Dec 19, 2018

Uh oh!

joaquinvanschoren commented Dec 19, 2018

Uh oh!

janvanrijn commented Dec 19, 2018

Uh oh!

codecov-io commented Dec 21, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

mfeurer commented Dec 21, 2018

Uh oh!

mfeurer commented Dec 21, 2018

Uh oh!

ArlindKadra commented Jan 20, 2019

Uh oh!

mfeurer commented Jan 29, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

codecov-io commented Dec 21, 2018 •

edited

Loading