Skip to content

Conversation

@janvanrijn
Copy link
Member

Reference Issue

What does this PR implement/fix? Explain your changes.

How should this PR be tested?

Any other comments?

glemaitre and others added 12 commits October 23, 2018 09:46
…545)

* EHN: allow to upload DataFrame and infer dtype and column name

* FIX: check that we raised an error when nominal has mixed type

* DOC: add documentation for the dataframe in the docstring

* FIX: make flake8 stop complaining for top import

* PEP8

* PEP8

* EHN: using pandas inference

* TST: check inference for dataframe

* TST: check bool case and override attributes with dict

* iter

* PEP8

* remove dataset publishing

* DOC: fix docstring numpydoc format

* TST: check that the new attributes is in the uploaded dataset
Fix typos in documentation and example
…et (#586)

* EHN: inferred row_id_attribute from dataframe to create a dataset

* reset the index of dataframe after inference

* TST: check the size of the dataset

* PEP8

* TST: check that an error is raised when row_id_attributes is not a known attribute

* DOC: Update the docstring

* PEP8
* EHN: support SparseDataFrame when creating a dataset

* TST: check attributes inference dtype

* PEP8

* EXA: add sparse dataframe in the example

* Fix typos.

* Fix typo.

* Refactoring task.py (#588)

* [MRG] EHN: inferred row_id_attribute from dataframe to create a dataset (#586)

* EHN: inferred row_id_attribute from dataframe to create a dataset

* reset the index of dataframe after inference

* TST: check the size of the dataset

* PEP8

* TST: check that an error is raised when row_id_attributes is not a known attribute

* DOC: Update the docstring

* PEP8

* add examples to the menu, remove double progress (#554)

* PEP8

* PEP8
* temporary fix for failing unit test

* Changing Jan's suggestion to a run on a binary classification task
* Partial starting fix for single input task, cache dir multiplatform change

* Reduce line size

* changing type to isinstance

* Refactoring the cache directory path to be more general

* Fixing problem with clustering task in accordance with the different tasks implementation

* Fixing flake8 problem, adding unit test for clustering task

* Fixing bug with regression tasks, adding more checks to the get_task unit tests
* DEPR: remove the format parameter from create_dataset

* EHN: check the type of dataframe before the conversion

* TST: remove the format parameter
@janvanrijn janvanrijn merged commit 8485090 into add_#564 Dec 7, 2018
@mfeurer mfeurer deleted the fix604 branch December 7, 2018 08:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants