fix: Close fds in child processes by MinetaS · Pull Request #914 · Disservin/fastchess

MinetaS · 2025-09-16T16:56:47Z

No description provided.

MinetaS · 2025-09-16T17:07:18Z

Since we dup pipe fds to stdio fds with setup_spawn_file_actions, all pipe fds should be closed after dup otherwise they are leaked.

"self-pipe trick" thing is removed here; I honestly don't see how engines are supposed to use those pipes for signal processing. Can you explain about its use case, I'm curious about it.

EDIT 1:
Now setup_close_file_actions is redundant as O_CLOEXEC flag does clean up fds in child processes during exec. But I'm not that familiar with POSIX libraries so I will leave it to you to arrange.

Disservin · 2025-09-17T09:00:44Z

"self-pipe trick" thing is removed here; I honestly don't see how engines are supposed to use those pipes for signal processing. Can you explain about its use case, I'm curious about it.

Okay so the current implementation uses a poll + read, the timeout on the poll is the remaining thinking time of the engine, meaning if poll times out, the engines time ran out and we return, otherwise we know there is data to be read meaning we can read from the pipe.
If a user specified a long tc like 1h, engines that are about to play the first move get a lot of time, if during that time fastchess receives CTRL + C, the poll will wait for a very long time making the sigint seem unresponsive. In this case the self pipe trick is used, where we write some dummy data so that poll gets notified about the data and we break out of the otherwise blocking poll call.

Meaning in your implementation CTRL + C sent to fastchess will take longer, growing with the engines TC. This is not ideal.

Does your change actually lead to child processes being killed when fastchess receives SIGKILL, I assume so right ?

Maybe we don't need the self pipe trick and instead poll every few cycles (TBD) and implement a timeout in some other cheap way.

Disservin · 2025-09-17T09:00:57Z

didn't mean to close

MinetaS · 2025-09-17T11:46:55Z

Okay so the current implementation uses a poll + read, the timeout on the poll is the remaining thinking time of the engine, meaning if poll times out, the engines time ran out and we return, otherwise we know there is data to be read meaning we can read from the pipe.
If a user specified a long tc like 1h, engines that are about to play the first move get a lot of time, if during that time fastchess receives CTRL + C, the poll will wait for a very long time making the sigint seem unresponsive. In this case the self pipe trick is used, where we write some dummy data so that poll gets notified about the data and we break out of the otherwise blocking poll call.

Thank you for the explanation. Although I still think the comment is somewhat misplaced, because both write/read a NULL byte is done in fastchess (if this is what you mean by "write some dummy data") and completely unrelated to child processes' file descriptors.

Meaning in your implementation CTRL + C sent to fastchess will take longer, growing with the engines TC. This is not ideal.

If I understand correctly (as above), my implementation will never affect to how fastchess manages Ctrl+C signals.

Does your change actually lead to child processes being killed when fastchess receives SIGKILL, I assume so right ?

Yes. But I'd like to reiterate that child processes are not "killed". They terminate once fastchess is killed, because the pipes are closed and getline fails because they receive EOF. The reason why they are not terminated in the current version is that there are open file descriptors inherited from fastchess, and among them especially out_pipe_ is still connected to stdin of stockfish and getline is blocked forever.

https://github.com/official-stockfish/Stockfish/blob/fc54d8730174cdb5cfc4f7074b90128e706e4040/src/uci.cpp#L94-L98

    do
    {
        if (cli.argc == 1
            && !getline(std::cin, cmd))  // Wait for an input or an end-of-file (EOF) indication
            cmd = "quit";

Disservin · 2025-09-17T12:01:29Z

oh i think i might have misunderstood the actual behavior.. the comment (// keep open for self to pipe trick) was there because I thought it stops working when there's a close but it seems this is not the case and O_CLOEXEC seems to be a good choice here too, didn't have this in mind

MinetaS · 2025-09-17T12:08:44Z

Oh macOS doesn't have pipe2... need to fix that. But I don't own any macOS environments so cannot test it myself.

Disservin · 2025-09-17T12:12:19Z

you can probably use the fcntl way instead everywhere instead of relying on pipe2

MinetaS · 2025-09-17T12:22:03Z

Should be fixed by now.
Also removed setup_close_file_actions as it doesn't work and is unnecessary when there are multiple child processes.

Disservin · 2025-09-17T14:52:48Z

Haven't tested it but I think this looks good

Disservin · 2025-09-21T09:43:52Z

thanks again got around to finally test this and it works splendid

Disservin · 2025-09-23T17:28:54Z

okay just for documentation purposes..
I tested the KILL behavior on Windwos too and it looks like there it already works as expected, at least I see no stockfish processes lying around after taskkill /F /IM fastchess.exe

Disservin closed this Sep 17, 2025

Disservin reopened this Sep 17, 2025

fix: Close fds in child processes

506eaa7

MinetaS force-pushed the fix-fd-leak branch from 7ad7274 to 506eaa7 Compare September 17, 2025 12:19

Disservin merged commit 3c33636 into Disservin:master Sep 17, 2025
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Close fds in child processes#914

fix: Close fds in child processes#914
Disservin merged 1 commit intoDisservin:masterfrom
MinetaS:fix-fd-leak

MinetaS commented Sep 16, 2025

Uh oh!

MinetaS commented Sep 16, 2025 •

edited

Loading

Uh oh!

Disservin commented Sep 17, 2025

Uh oh!

Disservin commented Sep 17, 2025

Uh oh!

MinetaS commented Sep 17, 2025 •

edited

Loading

Uh oh!

Disservin commented Sep 17, 2025

Uh oh!

MinetaS commented Sep 17, 2025

Uh oh!

Disservin commented Sep 17, 2025

Uh oh!

MinetaS commented Sep 17, 2025

Uh oh!

Uh oh!

Disservin commented Sep 17, 2025

Uh oh!

Disservin commented Sep 21, 2025

Uh oh!

Disservin commented Sep 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

MinetaS commented Sep 16, 2025

Uh oh!

MinetaS commented Sep 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Disservin commented Sep 17, 2025

Uh oh!

Disservin commented Sep 17, 2025

Uh oh!

MinetaS commented Sep 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Disservin commented Sep 17, 2025

Uh oh!

MinetaS commented Sep 17, 2025

Uh oh!

Disservin commented Sep 17, 2025

Uh oh!

MinetaS commented Sep 17, 2025

Uh oh!

Uh oh!

Disservin commented Sep 17, 2025

Uh oh!

Disservin commented Sep 21, 2025

Uh oh!

Disservin commented Sep 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

MinetaS commented Sep 16, 2025 •

edited

Loading

MinetaS commented Sep 17, 2025 •

edited

Loading