Check Settings, Spec, and Coefficients Files #1

andkay · 2025-03-20T22:15:30Z

This PR will address #784.

The proposed approach includes some relatively simple "smoke test" of YAML, SPEC, and coefficients config files:

The component settings are first loaded into the relevant Pydantic data model.
Then, the coefficients file is then read into memory, if defined at the top level of the data model.
Then, the SPEC file is read into memory, similarly, if defined at the top level of the data model.
Finally, one of the two available methods (simulate.eval_coefficients or simulate.eval_nest_coefficients) is run.

As scoped, validating expressions is not included, and the code isn't worried about tables at all.

Currently, there is a small subset of models that get tested, which include simpler configs, one with a preprocessor, and one with nested coefficients. Some basic logging is included. There may be more cases that need to be solved, but I did want to get some feedback / validation before pressing onward. I'll highlight some of my own thoughts below, but also want to get some eyes on this to make sure the contribution meets the issue-brief and will be useful.

Autodiscovery of Model Settings YAML Files
Currently, there is a dict that maps the names of component models to a Pydantic model class, as well as the relevant model file name. This will be pretty cumbersome to maintain.

A better way forward would be identify the default arguments to model_settings (Pydantic) and model_settings_file_name (YAML file) from the signature of each step. I'm not totally sure how to extract this information from the step wrapper around the callable.

Error Handling
An important design decision (which is still TBD) will be if any custom errors or warnings are raised by the settings_checker module (currently, this is not included. In my thinking, the main point of the settings checker is to surface any exceptions or warnings that would otherwise be raised at runtime earlier in the run process -- not necessarily to introduce any new error handling. If that is a decent assumption, responsibility for raising problems should be delegated upstream to the underlying Pydantic models and ActivitySim methods.

Nested Coefficients
Currently, I am using simulate.eval_nested_coefficients to evaluate models with the NESTS property. So far, this seems to behave as I expect -- but I am a little wary that it is misguided. With the settings checker turned off, I have not been been able to identify anything in the prototype_mtc model that actually calls this function.

Calling the checker
The checker is currently being called from cli.run for convenience of development. But I suspect that it needs to be migrated (or added) to the State.run method.

…(accessibility)

… add second model to validate

… settings to independent function

…nction called from main checker. run formatter

jpn-- · 2025-03-24T19:09:28Z

activitysim/abm/models/settings_checker.py

+
+def check_model_settings(state: State) -> None:
+
+    components = state.settings.models  # _RUNNABLE_STEPS.keys() may be better?


state.settings.models is probably the right thing here not _RUNNABLE_STEPS, as the latter includes every step that might be run, but the former includes only that which we want to run right now. It's totally legit to have incomplete/invalid settings for steps we are not trying to run right now.

jpn-- · 2025-03-24T19:11:34Z

activitysim/abm/models/settings_checker.py

+logger = logging.getLogger(__name__)
+file_logger = logger.getChild("logfile")
+
+COMPONENTS_TO_SETTINGS = {


This is probably a necessary evil for now.

…n empty dataframe, even if no top level spec exists

…or disaggregate accessibility (needs more testing)

This reverts commit be5d093.

…main model spec

…in model settings

missing values. because the fields in the settings classes are inherited, it is not usually possible for the settings checker to determine if a path to an external file is actually required for a particular model component. the workaround is to issue a specific warning that a path *may* be expected and users should check the YAML file. the ultimate solution would be to define more robust Pydantic data models that ensure that fields are marked as required when appropriate.

…s to checker entrypoint as argument.

…ettings as dict.

…ot loadable

* initial batch of models with pre and post processing added * nonmand sched and transit pass models * first pass at preprocessing and annotate functionality in all models * fixing bugs in jtf and trip purpose * adding persons back in to locals_d in jtc * model name missing in tour scheduling * missing expressions import in tour sched prob * ci unit tests & fixing estimation test error * addressing review comments --------- Co-authored-by: Ali Etezady <[email protected]>

* first uv sync * explicitly set np.int64 dtype before calling pd set_index() Pandas 2 got rid of the `Int64Index` class, it was downcasting to int32 for the input land use table * Add all dependencies from Conda environments Consider removing unused ones at a later time * Remove orca This dependency was replaced with “state” for workflow orchestration. * Update release instructions for `uv` * Remove `--no-default-groups` from `uv sync` instructions * Trial updating Github Action to uv instead of conda * Bug fixes * Try downgrading sharrow * change core test to windows runner * use win for all core tests * index type mismatch * formatting * unlock sharrow to 2.14 * correct a typo * Update Model Setup page in user guide * Update Ways to Run the Model page * Remove duplicates * Simplify pyproject.toml for github action and use only this group * Update remainder of user guide docs * Update dev guide [makedocs] * Update lockfile from changes to dependencies * Remove conda from other github actions * Debug doc building [makedocs] * Update install instructions [makedocs] * update installation instructions [makedocs] * notes on uv options [makedocs] * address review comment --------- Co-authored-by: Josie Kressner <[email protected]>

into settings-checker

* identify and flag naming conflicts in 2d vs 3d skims * add skims doc * run on workflow_dispatch * add link to docs * write error message in exception also * add error message example * code block formatting * unit tests

…g.yaml from MWCOG example model

* update cat columns in df when new cat exists * blacken * sort two categoricals before union

* minor fixes for SimOR development * blacken * Using PNUM if available in JTFC Required to maintain backwards compatibility in tour indexes in SANDAG models --------- Co-authored-by: Jeffrey Newman <[email protected]>

…activitysim version to logfile (ActivitySim#963) * adding activitysim version to log file * fix duplicate columns in parking location choice as described in ActivitySim#633 * making tour mode choice logsum optional in destionation choice models ActivitySim#847 * logsum settings in location choice, trip dest, tour_od * optional logsums in trip destination * blacken --------- Co-authored-by: Ali Etezady <[email protected]>

andkay · 2025-09-04T23:01:03Z

Closed by ActivitySim#950

andkay added 13 commits March 19, 2025 10:03

feat: adds initial setting_checker module to abm/models

48b7696

chore: add .vscode settings to gitignore

92fd304

feat: adds first test case for prepopulating settings pydantic model …

c7a1ca3

…(accessibility)

feat: adds initial spec checker - wip

fb7d6f1

feat: update settings checker to load spec and evaluate coefficients.…

9053107

… add second model to validate

fix: adds missing arg to eval_coefficients

404190d

feat: adds atworksubtour_frequency to settings checker

2bda8ac

feat: moves loading spec and coefficients to independent functions

0e88b9e

refactor: renames main settings checker function. moves loading model…

fe65f3a

… settings to independent function

refactor: moves load, spec, and coef eval checks to an independent fu…

7333c0b

…nction called from main checker. run formatter

feat: adds load SPEC check for preprocessor settings

138b2a4

chore: run formatter

637e98d

feat: initial checking for model setting with nested coefficients

19af41f

andkay requested review from jpn-- and seanmcatee March 20, 2025 22:16

jpn-- reviewed Mar 24, 2025

View reviewed changes

andkay and others added 14 commits March 30, 2025 15:42

feat: adds additional model settings. force try_load_spec to return a…

f78723c

…n empty dataframe, even if no top level spec exists

feat: adds additional model settings to check. adds special handler f…

47da4b3

…or disaggregate accessibility (needs more testing)

fixing disaggregate accessibility bug in zone sampler

be5d093

Revert "fixing disaggregate accessibility bug in zone sampler"

de6d0ef

This reverts commit be5d093.

feat: adds settings check for initalize_landuse

625908d

refactor: reorders checking for joint tour composition settings

269301e

feat: adds setting check for joint tour destination

e14cc9c

feat: adds settings check for joint tour frequency composition

b1ca08a

feat: adds settings check for joint tour frequency

c724fd5

feat: adds settings check for joint tour participation

172e5be

feat: adds setting check for joint tour scheduling

3e4779b

feat: adds settings checks for workplace location and school location

9dd5073

feat: adds setting check for mandatory tour frequency

8f7b84f

feat: adds settings check for non mandatory tour destination

a25706c

andkay added 9 commits June 7, 2025 18:01

feat: check settings for write_data_dictionary

e7d7b8d

feat: allow for detailed checks for SPEC_SEGMENTS with PTYPE against …

7b1c948

…main model spec

feat: allow for arbitrary loading of spec files from any subsettings …

605c238

…in model settings

feat: allow custom checks for unusual spec/coefficient pairs

6c8ba86

feat: adds custom SettingCheckerError exception for improved logging

471faf7

chore: fix spelling.

003071b

refactor: renames COMPONENTS_TO_SETTINGS -> CHECKER_SETTINGS and feed…

3f386ab

…s to checker entrypoint as argument.

refactor: single entry for settings checker. import extension check s…

351836f

…ettings as dict.

andkay mentioned this pull request Jun 10, 2025

Check YAML, Spec, Coefficients Settings Before Model Runs ActivitySim/activitysim#950

Merged

andkay and others added 18 commits June 9, 2025 23:24

chore: run black formatter

f7de6c2

refactor: remove commented code

dbc2abf

fix: skip errors for write_data_dictionary if optional yaml file is n…

66797a8

…ot loadable

chore: run black formatter

866a114

Merge branch 'main' into settings-checker

e3daa8f

fix(tests): remove unused COEFFICIENTS from MWCOG test model

95e33f4

fix(tests): re-add deleted test_mtc.py test script

2fa9636

Merge branch 'settings-checker' of https://github.com/camsys/activitysim

031b98f

into settings-checker

fix: make model registry for checking a copy

6c8063e

Skim name conflicts (ActivitySim#939)

1027026

* identify and flag naming conflicts in 2d vs 3d skims * add skims doc * run on workflow_dispatch * add link to docs * write error message in exception also * add error message example * code block formatting * unit tests

fix(test): remove unused general SPEC in non_mandatory_tour_schedulin…

e5316b5

…g.yaml from MWCOG example model

Pandas 2 assign_in_place() fix with categoricals (ActivitySim#948)

58dc347

* update cat columns in df when new cat exists * blacken * sort two categoricals before union

Merge branch 'main' into settings-checker

ce0470b

Merge branch 'main' into settings-checker

c306a06

andkay closed this Sep 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Check Settings, Spec, and Coefficients Files #1

Check Settings, Spec, and Coefficients Files #1

Uh oh!

andkay commented Mar 20, 2025

Uh oh!

jpn-- Mar 24, 2025

Uh oh!

jpn-- Mar 24, 2025

Uh oh!

andkay commented Sep 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants


		def check_model_settings(state: State) -> None:

		components = state.settings.models # _RUNNABLE_STEPS.keys() may be better?

Check Settings, Spec, and Coefficients Files #1

Check Settings, Spec, and Coefficients Files #1

Uh oh!

Conversation

andkay commented Mar 20, 2025

Uh oh!

jpn-- Mar 24, 2025

Choose a reason for hiding this comment

Uh oh!

jpn-- Mar 24, 2025

Choose a reason for hiding this comment

Uh oh!

andkay commented Sep 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants