fsl.check_first(): Utilise PID by Lestropie · Pull Request #2597 · MRtrix3/mrtrix3

Lestropie · 2023-03-07T23:06:18Z

Attempt at resolving #2595.

If possible, it will take the PID reported on stdout and wait for its completion.
Looking at the run_first_all code, I think that fsl_sub is used in such a way that each spawned processes inherits the previously executed processes as its children, as it's the PID of the last executed job that is sent to stdout.

This does as intended on my own system, in that the PID reported by run_first_all no longer exists and so the function can proceed as normal. However it really requires testing on a system with SGE configured. @glasserm is this something you're in a position to do easily?

Also, I've left the path.wait_for() call in place where SGE is easily detected just as a failsafe; I don't expect that code to actually be used anymore since it will only execute if the PID can't be queried.

As per discussion in #2595. Function will utilise the PID reported by the run_first_all script, as this seems to be intended for use in determining when all processing tasks have been completed.

Lestropie · 2023-03-08T00:13:38Z

Note that this solution uses the psutil module, which is seemingly not a "default" and needs to be explicitly installed in some environments. This would need to be addressed not only for CI, but also in package generation, containers, and installation-from-source documentation. It may not be an absolute requisite (I can have the code skip those steps if the module is not available), but we should try to propagate the dependency nevertheless.

glasserm · 2023-03-08T02:17:30Z

Happy to help if I can easily. Currently I have this as a conda install, but I am pretty illiterate with conda and python. What is the quickest way to get my current setup in a state to test this patch?

Lestropie · 2023-03-08T03:39:50Z

Because it's not part of a tagged release, you'd need to clone the feature branch called check_first and then build from source.

glasserm · 2023-03-08T04:11:27Z

I'll see if I can get to this this weekend...

glasserm · 2023-03-20T02:51:03Z

Sorry for the delay. I realized you just changed python files so I applied the patch manually. Unfortunately it doesn't work. SGE does not spit out a process ID, rather it spits out a job ID. You could check to see if it is still in the queue list like this qstat | grep jobID. If it returns the job ID, it hasn't finished running, if it returns nothing, it has.

Lestropie · 2023-03-21T06:19:00Z

Ah OK, it has to be queried much like a SLURM job.
I have no experience with SGE and no ability to test, so I'm taking shots a little blindly here.

Unfortunately there doesn't seem to be any established / mature Python package for querying SGE jobs. So I would indeed be forced to subprocess regular qstat calls. I don't think that just querying the presence of the qstat command in PATH as a proxy for potential use of SGE works.

Alternatively I could change strategy entirely, and directly access & query the various FIRST log files, looking for any errors. I didn't want to go this way as it could theoretically change easily between FSL versions, and would be entirely inapplicable to any other context (as opposed to path.wait_for()). But it might turn out to be simpler...

glasserm · 2023-03-21T12:22:33Z

Could you just fsl_sub -j something and then check for it to finish? Perhaps that would be agnostic to SGE vs SLURM (with fsl_sub handling the queuing system). Your fsl_sub -j command would not run until all of the FIRST jobs had completed (or failed) telling you when it is time to check the outputs.

If possible, use fsl_sub command to halt execution until all jobs have completed. Result of discussion in #2597. Addresses #2595.

If possible, use fsl_sub command to halt execution until all jobs have completed. Result of discussion in #2597. Addresses #2595. Replicates some contents of bd3f19e.

Lestropie · 2023-04-09T06:15:01Z

Closed in favour of #2609.

fsl.check_first(): Utilise PID

bd3f19e

As per discussion in #2595. Function will utilise the PID reported by the run_first_all script, as this seems to be intended for use in determining when all processing tasks have been completed.

Lestropie added bug scripts test wanted labels Mar 7, 2023

Lestropie self-assigned this Mar 7, 2023

fsl.check_first(): Do not use psutil if not available

9677561

Lestropie mentioned this pull request Mar 21, 2023

path.wait_for(): Fix symbol aliasing #2607

Closed

Lestropie added a commit that referenced this pull request Mar 22, 2023

fsl.check_first(): Second attempt at refinement

4d8f158

If possible, use fsl_sub command to halt execution until all jobs have completed. Result of discussion in #2597. Addresses #2595.

Lestropie mentioned this pull request Mar 22, 2023

fsl.check_first(): Second attempt at refinement #2609

Merged

Lestropie closed this Apr 9, 2023

Lestropie deleted the check_first branch April 9, 2023 06:15

Lestropie restored the check_first branch August 26, 2025 07:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fsl.check_first(): Utilise PID#2597

fsl.check_first(): Utilise PID#2597
Lestropie wants to merge 2 commits intomasterfrom
check_first

Lestropie commented Mar 7, 2023

Uh oh!

Lestropie commented Mar 8, 2023

Uh oh!

glasserm commented Mar 8, 2023

Uh oh!

Lestropie commented Mar 8, 2023

Uh oh!

glasserm commented Mar 8, 2023

Uh oh!

glasserm commented Mar 20, 2023 •

edited

Loading

Uh oh!

Lestropie commented Mar 21, 2023

Uh oh!

glasserm commented Mar 21, 2023

Uh oh!

Lestropie commented Apr 9, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Lestropie commented Mar 7, 2023

Uh oh!

Lestropie commented Mar 8, 2023

Uh oh!

glasserm commented Mar 8, 2023

Uh oh!

Lestropie commented Mar 8, 2023

Uh oh!

glasserm commented Mar 8, 2023

Uh oh!

glasserm commented Mar 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Lestropie commented Mar 21, 2023

Uh oh!

glasserm commented Mar 21, 2023

Uh oh!

Lestropie commented Apr 9, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

glasserm commented Mar 20, 2023 •

edited

Loading