-
Notifications
You must be signed in to change notification settings - Fork 46
Closed
Description
It appears clear that a hostfile is needed for multi-node execution. Although a host file can be automatically generated by PBS systems, and be picked up automatically by commands such as sos execute and sos run, it is necessary to allow this option so that users can specify it manually to allow multi-node execution of workflows and tasks.
This option should work like this:
- Without it, everything is run locally.
- With it, it should be a name to a host file, similar to the
--hostfileoption of SCOOP, with a similar or identical format. The workers will be created on these hosts. - Under a cluster system with appropriate environmental variables, the hostfiles will be picked up automatically, similar to what SCOOP is doing
The problem is that sos run does not support -- options so we will have to reuse an existing option or find another option.
Once this option is specified, users can use
sos run -j hostfile
to run work flow on multiple hosts.
Use
%PBS ...
sos run workflow
to run entire workflow on a cluster system.
The same mechanism will be used for the execution of tasks, something like
%PBS
sos execute task
Metadata
Metadata
Assignees
Labels
No labels