Skip to content

Conversation

@glemaitre
Copy link
Contributor

Add support for SparseDataFrame when creating a matrix.

@glemaitre
Copy link
Contributor Author

@amueller @mfeurer @janvanrijn

Here come the support for sparse dataframe during dataset creation.

@glemaitre
Copy link
Contributor Author

ping @amueller @mfeurer @janvanrijn any feedback?

@janvanrijn
Copy link
Member

Personally, I am not super familiar with sparse dataframes. I can have a look at your test-cases if you'd like and see whether they cover interesting test-cases.

Copy link
Collaborator

@mfeurer mfeurer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks good to me. Could you add this to the example, too?

@codecov-io
Copy link

codecov-io commented Nov 14, 2018

Codecov Report

❗ No coverage uploaded for pull request base (develop@c69b0a6). Click here to learn what that means.
The diff coverage is 100%.

Impacted file tree graph

@@            Coverage Diff             @@
##             develop     #583   +/-   ##
==========================================
  Coverage           ?   89.94%           
==========================================
  Files              ?       32           
  Lines              ?     2995           
  Branches           ?        0           
==========================================
  Hits               ?     2694           
  Misses             ?      301           
  Partials           ?        0
Impacted Files Coverage Δ
openml/datasets/functions.py 92% <100%> (ø)

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update c69b0a6...a9a7e2d. Read the comment docs.

@glemaitre
Copy link
Contributor Author

@mfeurer I added a new section in the tutorial

Copy link
Collaborator

@mfeurer mfeurer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks a lot for adding the section to the documentation. I think this can be merged if

  1. you resolve the conflict
  2. you fix the flake8 error

PGijsbers and others added 6 commits November 16, 2018 15:00
…et (openml#586)

* EHN: inferred row_id_attribute from dataframe to create a dataset

* reset the index of dataframe after inference

* TST: check the size of the dataset

* PEP8

* TST: check that an error is raised when row_id_attributes is not a known attribute

* DOC: Update the docstring

* PEP8
@glemaitre
Copy link
Contributor Author

Done!

@mfeurer mfeurer merged commit b9035c4 into openml:develop Nov 16, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants