Option to return dataframes for listing functions #662

Neeratyoy · 2019-04-05T18:22:04Z

What does this PR implement/fix? Explain your changes.

Addresses #364.
An optional parameter has been added to the get_X functions called output_format. Which is by default dict so that nothing breaks. When output_format='dataframe' the output is a pandas data frame.

How should this PR be tested?

The following commands should suffice in demonstrating the change:
openml.datasets.list_datasets(size=10)
openml.datasets.list_datasets(size=10, output_format='dataframe')
openml.tasks.list_tasks(task_type_id=1, size = 10)
openml.tasks.list_tasks(task_type_id=1, size = 10, output_format='dataframe')
openml.evaluations.list_evaluations('predictive_accuracy', size = 10)
openml.evaluations.list_evaluations('predictive_accuracy', size = 10, output_format='object')
openml.evaluations.list_evaluations('predictive_accuracy', size = 10, output_format='dataframe')
openml.flows.list_flows(size=10)
openml.flows.list_flows(size=10, output_format='dataframe')
openml.runs.list_runs(size=10)
openml.runs.list_runs(size=10, output_format='dataframe')
openml.setups.list_setups(size=10)
openml.setups.list_setups(size=10, output_format='object')
openml.setups.list_setups(size=10, output_format='dataframe')

Any other comments?

For design parity, all the __list_X functions have been edited to produce the desired output format. In effect, every list function has an added parameter called output_format including the _list_all in utils.py.
The defaults have been kept as 'dict' and 'object' where applicable to ensure nothing breaks.
Examples which had listing as an example, has been updated.
A small unit test is added for all the listing calls that already being tested

codecov-io · 2019-04-30T07:57:07Z

Codecov Report

Merging #662 into develop will decrease coverage by 0.15%.
The diff coverage is 89.38%.

@@             Coverage Diff             @@
##           develop     #662      +/-   ##
===========================================
- Coverage     90.5%   90.35%   -0.16%     
===========================================
  Files           36       36              
  Lines         3730     3785      +55     
===========================================
+ Hits          3376     3420      +44     
- Misses         354      365      +11

Impacted Files	Coverage Δ
openml/utils.py	`91.66% <100%> (+0.39%)`	⬆️
openml/study/functions.py	`88.23% <66.66%> (-1.69%)`	⬇️
openml/evaluations/functions.py	`90.74% <82.35%> (-4.82%)`	⬇️
openml/datasets/functions.py	`95.66% <90%> (-0.28%)`	⬇️
openml/runs/functions.py	`81.48% <90.9%> (-0.03%)`	⬇️
openml/tasks/functions.py	`84.93% <90.9%> (-0.16%)`	⬇️
openml/flows/functions.py	`87.78% <90.9%> (-0.31%)`	⬇️
openml/setups/functions.py	`93.84% <96.87%> (-0.18%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 72f131a...0a947af. Read the comment docs.

Neeratyoy requested a review from mfeurer April 5, 2019 18:22

Neeratyoy changed the title ~~Adding dataframe output option to listing functions~~ Option to return dataframes for listing functions Apr 5, 2019

Neeratyoy marked this pull request as ready for review April 24, 2019 12:08

Neeratyoy added 6 commits April 25, 2019 16:33

Adding dataframe output option to listing functions

b2c43d7

Adding 'object' as new output format for listing

afe46c3

Editing examples for dataframe output option

898c3a2

Implementing coding standards as per suggestions.

c11a780

Adding test cases for listing as dataframe

a5a4b9d

Convert list to List

155bf80

Neeratyoy force-pushed the fix364 branch from 7898547 to 155bf80 Compare April 25, 2019 14:57

Neeratyoy and others added 5 commits April 29, 2019 23:02

Fixing rebase bugs, flake errors and test cases

04165a9

Fixing new unit test for flow

c0c5100

Fixing bug in unit test for flow

131d3d4

Fixing test case bug

58f2003

Merge branch 'develop' into fix364

8bcd3b4

mfeurer added 3 commits April 30, 2019 10:22

Update functions.py

ead6b9a

Update functions.py

21611d0

Update functions.py

0a947af

mfeurer approved these changes Apr 30, 2019

View reviewed changes

mfeurer merged commit 7129cf0 into develop Apr 30, 2019

mfeurer deleted the fix364 branch April 30, 2019 10:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Option to return dataframes for listing functions #662

Option to return dataframes for listing functions #662

Uh oh!

Neeratyoy commented Apr 5, 2019 •

edited

Loading

Uh oh!

codecov-io commented Apr 30, 2019 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Option to return dataframes for listing functions #662

Option to return dataframes for listing functions #662

Uh oh!

Conversation

Neeratyoy commented Apr 5, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR implement/fix? Explain your changes.

How should this PR be tested?

Any other comments?

Uh oh!

codecov-io commented Apr 30, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Neeratyoy commented Apr 5, 2019 •

edited

Loading

codecov-io commented Apr 30, 2019 •

edited

Loading