Skip to content

Conversation

@mfeurer
Copy link
Collaborator

@mfeurer mfeurer commented Feb 13, 2019

No description provided.

joaquinvanschoren and others added 30 commits April 25, 2018 13:26
Synchronize master and development branch again
added serialize run functionality
Bump version number for develop
* Added installs. Removed test for different bit-version of Python.

* For Appveyor, the APIKey should also be saved to run notebook tests.

* Allow a-f only for memory address, but both lower and uppercase.

* Exclude requirement that evaluation takes longer than 0ms for CI_WINDOWS, as these measurements are (probably) less accurate. This should only be a temporary solution, timing should really be accurate regardless of platform.

* Added clone folder.

* OS import
* Removing split pickling

* Refactoring the code

* Removing cache flag

* Fixing bug

* Refactoring code
* Bug fixes when uploading datasets, removed unnecessary variable from test method.

* Added create dataset function in datasets/functions.

* Refactored OpenMLDataset. 

* Refactored _api_calls. 

* Made the necessary changes to the dataset tutorial. Added the tutorial in the unit tests.
* Allow either order of arguments task and flow for method run_flow_on_task.

* Also allow task and model to have swapped order in .

* Two simple tests to check for normal behavior when calling run_model/flow_on_task with swapped arguments.

* Made swapped order default. Added deprecation warning for using old order.
* ADD raise exception when failing to create sklearn flow

* Update changelog
…ation (#300)

* flow deserilization keep defaults

* compatibility with python 2.7

* safe guard against fns without defaults

* improve documentation and remove kwargs
ArlindKadra and others added 26 commits October 17, 2018 14:24
* Add unit test for list of lists dataset upload

* Fixing xml pattern typo

* Fix pep8 no newline at the end of file

* Remove format from definitions

* Restoring format in dataset

* Fixing a couple of unused imports and fixings bugs with create_dataset call

* Adapting unit tests to changes

* Fixing failing unit tests

* fixing typo

* Enforce pep8 style guide, fix doc tutorial trying to invoke create_dataset with format attribute

* Workaround for pep8 style guide

* fix long time typo

* update pep8 failing statement and bug fix for dataset upload tutorial

* fixed problem with arff file

* Fix pep8 line too long

* Extending the unit test for dataset upload, changing upload tutorial

* Workaround for the dataset upload unit test

* Adding example with weather dataset into the dataset upload tutorial

* Fixing builds failure

* Adding support for sparse datasets, implementing corresponding unit tests

* fix bug

* More unit tests and bug fix

* Fixing bugs

* Fix bug and pep8 errors

* Enforcing pep8 and fixing changing the name of attribute format as it is a built-in

* Implementing change in a better way

* Fixing bugs introduced by changing the format in the constructor

* Another try to tackle the bugs

* Small refactor

* Fixing pep8 error

* Fix python2.7 bug

* making changes in accordance with Guillaume's suggestions

* Adding unit tests, small refactoring

* Enforcing pep8 style

* Following Matthias's suggestions

* Fixing bug introduced by variable name change

* Changing the breast_cancer dataset to diabetes, fixing typo with weather dataset, adding creator of weather dataset

* Further changes

* Adding more changes

* Fixing bug

* Pep8 enforce

* few changes

* Fixing typo in dataset name attributes
…545)

* EHN: allow to upload DataFrame and infer dtype and column name

* FIX: check that we raised an error when nominal has mixed type

* DOC: add documentation for the dataframe in the docstring

* FIX: make flake8 stop complaining for top import

* PEP8

* PEP8

* EHN: using pandas inference

* TST: check inference for dataframe

* TST: check bool case and override attributes with dict

* iter

* PEP8

* remove dataset publishing

* DOC: fix docstring numpydoc format

* TST: check that the new attributes is in the uploaded dataset
Fix typos in documentation and example
…et (#586)

* EHN: inferred row_id_attribute from dataframe to create a dataset

* reset the index of dataframe after inference

* TST: check the size of the dataset

* PEP8

* TST: check that an error is raised when row_id_attributes is not a known attribute

* DOC: Update the docstring

* PEP8
* EHN: support SparseDataFrame when creating a dataset

* TST: check attributes inference dtype

* PEP8

* EXA: add sparse dataframe in the example

* Fix typos.

* Fix typo.

* Refactoring task.py (#588)

* [MRG] EHN: inferred row_id_attribute from dataframe to create a dataset (#586)

* EHN: inferred row_id_attribute from dataframe to create a dataset

* reset the index of dataframe after inference

* TST: check the size of the dataset

* PEP8

* TST: check that an error is raised when row_id_attributes is not a known attribute

* DOC: Update the docstring

* PEP8

* add examples to the menu, remove double progress (#554)

* PEP8

* PEP8
* temporary fix for failing unit test

* Changing Jan's suggestion to a run on a binary classification task
* Partial starting fix for single input task, cache dir multiplatform change

* Reduce line size

* changing type to isinstance

* Refactoring the cache directory path to be more general

* Fixing problem with clustering task in accordance with the different tasks implementation

* Fixing flake8 problem, adding unit test for clustering task

* Fixing bug with regression tasks, adding more checks to the get_task unit tests
* DEPR: remove the format parameter from create_dataset

* EHN: check the type of dataframe before the conversion

* TST: remove the format parameter
* reinstantiate flow

* reinstantiate flow fix

* pep8 problems

* pep8 fix
…f each other). Also fixes #564 (#567)

* fixes minor indentation problems

* initial commit

* adds a function to deduce the flow structure

* removes sklearn converter from this PR

* added main functionality

* fix code quality

* adds flow name to setup test file

* adds functionality to return sklearn parameter name into openml flow name

* PEP8 fixes

* changed structure of PR, such that get_structure is not part of flow class.
updated unit tests accordingly

* pep8 fix

* fixes last typo

* flow name doc string

* also added additional filter for task list

* renamed id argument of parameter object (for code quality)

* fix reference to input id

* updated reinitialize model fn

* removed imputer (deprecated)

* fixes PEP8 problems

* pep8

* PEP8

* incorporated changes by Matthias

* fix 604

* bugfix

* flake fix

* import error

* removed sentence

* updated comment
* Subclass all test classes from openml test helper

* FIX inheritance issues

* TST add sentinel to dataset upload

* TEST redirect a few tests to the live server again

* MAINT fix pep8

* Trying simple solution
* added ability to obtain per fold evaluation measures

* added json loads

* updated unit test
* added documentation for running specific tests

* added more info on running specific unit tests

* minor fixes

* Update contributing.rst
* extended check to include missing values

* added more tests

* modularized tests

* extended unit tests

* small fixes

* removed flow check on scikit-learn representation -- bad idea

* exposed sentinel,
incorporated test case according to #602

* work on fixing column transformer bug

* logging output to flow_to_sklearn

* overrides default values in openml flow in case a setup needs to be initialized

* fix unit test

* PEP8

* fix unit tests Python 3.x

* solved unicode issues

* fix 3.5 issue
* MAINT prepare new release

* MAINT update changelog

* MAINT update release notes

* Fix dataframe spelling
* TST add connection retries test-wise

* Improve file style

* MAINT update changelog

* MAINT simplify unit test, change code as requested by Jan

* TST fix python2/3 bug

* please flake
@codecov-io
Copy link

codecov-io commented Feb 14, 2019

Codecov Report

Merging #617 into master will increase coverage by 0.34%.
The diff coverage is 88.88%.

Impacted file tree graph

@@            Coverage Diff             @@
##           master     #617      +/-   ##
==========================================
+ Coverage   89.41%   89.76%   +0.34%     
==========================================
  Files          32       32              
  Lines        2703     3077     +374     
==========================================
+ Hits         2417     2762     +345     
- Misses        286      315      +29
Impacted Files Coverage Δ
openml/tasks/__init__.py 100% <100%> (ø) ⬆️
openml/setups/setup.py 86.36% <100%> (+0.64%) ⬆️
openml/datasets/__init__.py 100% <100%> (ø) ⬆️
openml/flows/__init__.py 100% <100%> (ø) ⬆️
openml/__version__.py 100% <100%> (ø) ⬆️
openml/tasks/split.py 91.91% <100%> (+1.44%) ⬆️
openml/evaluations/evaluation.py 100% <100%> (ø) ⬆️
openml/utils.py 92.85% <100%> (+1.19%) ⬆️
openml/setups/__init__.py 100% <100%> (ø) ⬆️
openml/runs/__init__.py 100% <100%> (ø) ⬆️
... and 18 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 7a612ea...cefd097. Read the comment docs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.