Configuration ci #1049

PGijsbers · 2021-04-08T09:34:22Z

Adding a CLI which can be used to configure the fields in the configuration file.
This is a draft PR because it's feature incomplete (some fields are missing), but I wanted some input to see if this is an acceptable way to do it.

Installing openml with pip install openml will also add a cli script openml. It'll expect something like openml ROUTINE ROUTINE_ARGS, the current routine is only configure, but I figured some day we might use it for e.g. dataset download.
The signature of openml configure looks like this:

usage: openml configure [-h] [{apikey,server,cachedir,avoid_duplicate_runs,connection_n_retries,max_retries,all,none}]

Set or read variables in your configuration file. For more help also see 'https://openml.github.io/openml-
python/master/usage.html#configuration'.

positional arguments:
  {apikey,server,cachedir,avoid_duplicate_runs,connection_n_retries,max_retries,all,none}
                        The field you wish to edit.Choosing 'all' lets you configure all fields one by one.Choosing
                        'none' will print out the current configuration.

optional arguments:
  -h, --help            show this help message and exit

Video of current functionality.
~~I'm also planning for a direct option that avoids the explanation/interactive routine (e.g. openml config apikey 1234567890abcdef1234567890abcdef).~~
Update: It's also possible to use a non-interactive direct command, e.g.: openml configure apikey 1234567890abcdef1234567890abcdef.

- Add function to set any field in the configuration file - Add function to read out the configuration file - Towards full configurability from CLI

Autocomplete seems to be incompatible with `choices`, so I'll ignore that for now. We also use `config._defaults` instead of an explicit list to avoid duplication.

PGijsbers · 2021-04-08T09:46:00Z

I do notice some downsides to using the configparser, e.g. you can't see which values are explicitly set or which are default (if the explicitly set value matches the default value). It also seems to complicate the code (e.g. having to create a virtual file with FAKE_SECTION). Considering our configurations haven't gotten any more complicated for years, I think we should be safe to assume the simple field = value format will last us a while longer, and it might be more easily readable to just write a custom parser for it. It should not take more than a few lines of codes, e.g.

set_fields ={}
with open(file, 'r') as fh:
    for line in fh.readlines():
        field, value = line.split('=', 1)
        set_fields[field.strip()] = value.strip()
config = {**_defaults, **set_fields}

Not entirely sure if I want to do that right now (I'd like this CLI done before my leave next week, and while simple I feel it should have extensive testing), but do you think we should consider that?

With the `openml configure FIELD VALUE` command.

Otherwise you have to duplicate all checks in the error message function.

Max_retries is excluded because it should not be user configurable, and will most likely be removed. Verbosity is configurable but is currently not actually used.

And extend it to the bool inputs.

PGijsbers · 2021-04-09T13:50:37Z

Remarks while working on this:

max_retries is contained in _defaults but seems like it should be set by developers only. After discussing we probably want to avoid having such a number at all as long as we employ a decent retry policy (that waits increasingly long times).
the configparser note from above.
verbosity is currently not used in openml-python, I think this is a bug introduced by Rework local openml directory #987.

* Add a non-functional entry point * Allow setting of API key through CLI - Add function to set any field in the configuration file - Add function to read out the configuration file - Towards full configurability from CLI * Remove autocomplete promise, use _defaults Autocomplete seems to be incompatible with `choices`, so I'll ignore that for now. We also use `config._defaults` instead of an explicit list to avoid duplication. * Add server configuration * Allow fields to be set directly non-interactively With the `openml configure FIELD VALUE` command. * Combine error and check functionalities Otherwise you have to duplicate all checks in the error message function. * Share logic about setting/collecting the value * Complete CLI for other fields. Max_retries is excluded because it should not be user configurable, and will most likely be removed. Verbosity is configurable but is currently not actually used. * Bring back sanitizing user input And extend it to the bool inputs. * Add small bit of info about the command line tool * Add API key configuration note in the introduction * Add to progress log * Refactor flow of wait_until_valid_input

PGijsbers added 4 commits April 6, 2021 15:18

Add a non-functional entry point

436b314

Allow setting of API key through CLI

0f812f2

- Add function to set any field in the configuration file - Add function to read out the configuration file - Towards full configurability from CLI

Remove autocomplete promise, use _defaults

ef8434b

Autocomplete seems to be incompatible with `choices`, so I'll ignore that for now. We also use `config._defaults` instead of an explicit list to avoid duplication.

Add server configuration

a64b3e9

PGijsbers requested a review from mfeurer April 8, 2021 09:34

PGijsbers added 9 commits April 9, 2021 10:23

Allow fields to be set directly non-interactively

2ff55d9

With the `openml configure FIELD VALUE` command.

Combine error and check functionalities

8c00fc0

Otherwise you have to duplicate all checks in the error message function.

Share logic about setting/collecting the value

a41c41f

Complete CLI for other fields.

7703d3b

Max_retries is excluded because it should not be user configurable, and will most likely be removed. Verbosity is configurable but is currently not actually used.

Bring back sanitizing user input

75e3783

And extend it to the bool inputs.

Add small bit of info about the command line tool

a41b144

Add API key configuration note in the introduction

fb2c56b

Merge branch 'develop' into configuration_ci

cd36117

Add to progress log

149065d

PGijsbers marked this pull request as ready for review April 9, 2021 13:40

Refactor flow of wait_until_valid_input

935ee25

mfeurer approved these changes Apr 9, 2021

View reviewed changes

Merge branch 'develop' into configuration_ci

8560d8b

PGijsbers merged commit dafe5ac into develop Apr 20, 2021

PGijsbers deleted the configuration_ci branch April 20, 2021 07:49

github-actions bot pushed a commit that referenced this pull request Apr 20, 2021

PGijsbers: Configuration ci (#1049)

6ba4eaf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Configuration ci #1049

Configuration ci #1049

Uh oh!

PGijsbers commented Apr 8, 2021 •

edited

Loading

Uh oh!

PGijsbers commented Apr 8, 2021 •

edited

Loading

Uh oh!

PGijsbers commented Apr 9, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Configuration ci #1049

Configuration ci #1049

Uh oh!

Conversation

PGijsbers commented Apr 8, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

PGijsbers commented Apr 8, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

PGijsbers commented Apr 9, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

PGijsbers commented Apr 8, 2021 •

edited

Loading

PGijsbers commented Apr 8, 2021 •

edited

Loading