[WIP] MPI update by quaquel · Pull Request #328 · quaquel/EMAworkbench

quaquel · 2023-12-08T15:18:11Z

This pull request makes the MPIEvaluator feature complete. It adds

centralized logging analogous to the MultiprocessingEvaluator and the IpyparallelEvaluator
handling of WorkingDirectoryModels
fixing the use of initializer functions for worker processes
make it possible to use chunksize as a kwarg on perform_experiment.
Various performance enhancements

Some more thoughts

The code has been tested on DelftBlue already using the new example_mpi_lake_model and associated slurm file.
DelftBlue sometimes seems to fail to spawn all workers. This results in a deadlock for the logging because creating an MPI intercommunicator is a blocking operation. The MPI intercommunicator that is created is only used to centralize logging. There are two possible fixes: have a timeout and fail the process or make centralized logging a user-specifiable choice.
The handling of WorkingDirectoryModels is implemented using the same structure as the MultiprocessingEvaluator but still needs to be tested

fixes #261 columns with the same value for all entries don't matter for feature scoring, so they can be ignored.

bunch of things happening at the same time * code reorganization, moving mpi-specific code into a separate module * Return of initializer * update to logging (hacky) to include rank information in all log messages

in preparation for future updates to the mpi code, all evaluator code has been reorganized. All evaluators go into their respective module, and code sthared among 2 or more goes into a new util package. All relevant modules also have been renamed to future_{style of parallelization}

attempted import error fix

review-notebook-app · 2023-12-08T15:18:16Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

coveralls · 2023-12-08T15:21:29Z

Changes unknown
when pulling 7482825 on mpi_update
into ** on master**.

EwoutH · 2023-12-08T15:36:55Z

Looks like an impressive effort. If you want me to review at some point, let me know!

for more information, see https://pre-commit.ci

quaquel · 2023-12-08T15:55:08Z

No rush. This is just in preparation for our meeting next week. I hope to have tested WorkingDirectory models by then as well.

Most of the commits were just stupid tests on DelftBlue. I learned a lot about MPICH and OpenMPI as well as about how slurm interacts with MPI.

…into mpi_update

EwoutH · 2023-12-08T16:02:44Z

Most of the commits were just stupid tests on DelftBlue. I learned a lot about MPICH and OpenMPI as well as about how slurm interacts with MPI.

That really describes my experience in Q1! There are so many hidden assumptions everywhere. So much trial and error.

Even for me, and I really like trial and error.

EwoutH

TODO: Testen working directories

quaquel added 30 commits November 20, 2023 20:38

change to log message and log level in feature scoring

1739302

fixes #261 columns with the same value for all entries don't matter for feature scoring, so they can be ignored.

adds a simple example using the MPI evaluator

d22414a

first step for making MPI work with WorkingDirectoryModels

be58e00

bunch of things happening at the same time * code reorganization, moving mpi-specific code into a separate module * Return of initializer * update to logging (hacky) to include rank information in all log messages

Update example_mpi_lake_model.py

d088b48

ongoing work

506999b

fixes location of mpi tests

2681924

some mocking fixes

f69cf80

Update futures_mpi.py

ff6d6d6

Update futures_multiprocessing.py

4fede84

attemp for doc fix

510c4fd

and one more

7e19066

another attempt

222d4f5

Update example_mpi_lake_model.py

278b684

Merge branch 'reorganization' into mpi_update

284ecd6

cleanup and merging reorganization into mpi_update

ca0d66a

logging works

0d144d9

unit testing for mpi logging

d87f3f0

Update ci.yml

5fd2214

Update futures_util.py

4953b03

unit test for intializer

59d8c6c

more unit tests

e7b1b30

more unit tests

39b004a

backport from mpi_update

04d02d8

Merge branch 'reorganization' into mpi_update

9f26e71

Update test_futures_mpi.py

88c1318

Merge branch 'master' into mpi_update

d365ab0

Update __init__.py

7ec29fc

attempted import error fix

start of testing in delfblue

d5b6158

another attempted fix

042667d

quaquel added 12 commits December 6, 2023 19:32

temporary disabling loggin

d340cf8

more delftblue testing

def91f5

more testing

68c8a0b

Update futures_mpi.py

0d6017e

reenable logging

94b03e4

Update futures_mpi.py

d9b9594

another attempt

d07571e

Update futures_mpi.py

f233075

Update futures_mpi.py

f60973d

Update futures_mpi.py

d0d43fa

updated mpi tutorial

41839bf

make it possible to control chunksize

a185fd0

quaquel added this to the 2.5.0 milestone Dec 8, 2023

quaquel added enhancement performance labels Dec 8, 2023

quaquel and others added 2 commits December 8, 2023 16:53

Merge branch 'master' into mpi_update

bb7f245

[pre-commit.ci] auto fixes from pre-commit.com hooks

9959e87

for more information, see https://pre-commit.ci

quaquel added 2 commits December 8, 2023 16:57

Update test_futures_mpi.py

a690592

Merge branch 'mpi_update' of https://github.com/quaquel/EMAworkbench …

8633407

…into mpi_update

test work again

7482825

EwoutH marked this pull request as draft December 13, 2023 20:56

quaquel marked this pull request as ready for review December 20, 2023 12:29

EwoutH approved these changes Dec 20, 2023

View reviewed changes

quaquel merged commit a968cc8 into master Dec 20, 2023

quaquel deleted the mpi_update branch December 20, 2023 12:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] MPI update#328

[WIP] MPI update#328
quaquel merged 47 commits intomasterfrom
mpi_update

quaquel commented Dec 8, 2023

Uh oh!

review-notebook-app bot commented Dec 8, 2023

Uh oh!

coveralls commented Dec 8, 2023 •

edited

Loading

Uh oh!

EwoutH commented Dec 8, 2023

Uh oh!

quaquel commented Dec 8, 2023

Uh oh!

EwoutH commented Dec 8, 2023

Uh oh!

EwoutH left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

quaquel commented Dec 8, 2023

Uh oh!

review-notebook-app bot commented Dec 8, 2023

Uh oh!

coveralls commented Dec 8, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

EwoutH commented Dec 8, 2023

Uh oh!

quaquel commented Dec 8, 2023

Uh oh!

EwoutH commented Dec 8, 2023

Uh oh!

EwoutH left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

coveralls commented Dec 8, 2023 •

edited

Loading