ci/eval: add darwin support by winterqt · Pull Request #406307 · NixOS/nixpkgs

winterqt · 2025-05-12T00:31:40Z

Things done

Add a 👍 reaction to pull requests you find important.

winterqt · 2025-05-12T00:45:57Z

~~No idea why the code owner action can't request y'all's reviews, maybe because it's a draft? GH's API doesn't document such a limitation, though...~~ Ah, dry mode.

~~Happy to split the last commit into a different PR -- in fact, it may make sense to off the bat. Let me know.~~ Last commit has been split off.

lilyball

Just some minor comments on the bash scripting

ci/eval/default.nix

wolfgangwalther · 2025-05-12T19:21:52Z

I made some changes in my fork to run a big matrix of 4x4 jobs, each eval-system on each runner-type.

Made two runs:

before this PR: https://github.com/wolfgangwalther/nixpkgs/actions/runs/14980572553 - the macos runners all fail at the .links stuff
after this PR: https://github.com/wolfgangwalther/nixpkgs/actions/runs/14980579828 - the fail with something about the sandbox

I guess, I'll need to turn the sandbox off for this test. Will continue later.

winterqt · 2025-05-12T19:24:08Z

Read the error in the later run, it tells you to set sandbox configuration option to relaxed — that’s what you have to do to get this to work.

wolfgangwalther · 2025-05-12T19:25:53Z

Here's the run with sandbox=relaxed, but still running: https://github.com/wolfgangwalther/nixpkgs/actions/runs/14980771666/job/42084126711

wolfgangwalther

According to the latest run above, this works on both x86_64-darwin and aarch64-darwin.

I'm thinking that we might want to have some way to confirm this in CI every now and then - eval should keep working on all platforms, so that all contributors can run it locally, too. I have a few ideas, but we don't need to do this in here.

winterqt · 2025-05-12T20:35:05Z

Rebased. Not sure if we should wait till eval.full is fixed Darwin (without allowing bad platforms) via bumping the pinned Nixpkgs, or if we should just do it.

wolfgangwalther · 2025-05-12T20:53:15Z

via bumping the pinned Nixpkgs

I'd like to do that soon anyway, as I'd like to get #405853 into the pin.

winterqt · 2025-05-15T02:34:35Z

Dropped the x86_64-linux eval system change in favor of waiting for #406825, but everything else should be good to go now that I've updated the pin.

Made a few other changes I'd love some eyes on, including addressing @lilyball's suggestions. (Thanks!)

Before this change, the eval derivations would never actually complete on any platform other than sandboxed Linux because the mem stats job would never be terminated. For reasons I don't know, the Linux sandbox somehow results in Bash terminating the job on its own. On every other platform (and when the Linux sandbox is disabled), we have to do it correctly and kill it manually.

winterqt · 2025-05-15T03:05:17Z

ci/eval/default.nix

+            # For some reason, `kill` (or `kill -9`) won't
+            # actually kill an entire process tree on macOS
+            # as it does on Linux. So instead, we'll just
+            # use xargs' native termination feature, which
+            # will at least guarantee that the iteration
+            # will be stopped.
+            #
+            # (Yes, technically we can just keep the `kill`
+            # command so that xargs dies and then the pending
+            # chunks will finish, but I'd rather just do this
+            # if it's going to have the same effect anyways.)
+            exit 255


Maybe there is a way to get this to work if there's a way to get the parent's process group ID? Not sure.

As discussed on Discord, I believe this should actually just be kill 0 for all platforms. AFAICT kill $PPID only works on linux because of the sandbox (that causes the command to fail with an error which set -e causes the builder to exit, and presumably that causes the pid namespace to get torn down), xargs does not proactively kill its spawned processes when it dies.

If you do use kill 0, that signals the entire process group, which is going to include the shell itself since it's a non-interactive shell (which means job control defaults to off). If you want just the xargs process tree to shut down you can enable job control first with set -m, that way the xargs pipeline will be its own process group and the kill 0 will only affect it, at which point bash's set -e behavior will take over. So doing this really just depends on whether there's any benefits to log output (or whether Nix cares at all about whether the builder exits with a code or a signal).

AFAICT kill $PPID only works on linux because of the sandbox

This would match the issue with the memory stats job that I observed and also bisected down to the Linux sandbox. Fun.

Thanks, I’ll give that a shot. I don’t think we want to set -m at least without somehow explicitly detecting whether the xargs pipeline was killed or not (which I’m not sure is possible?).

set -m just enables job control monitor mode, which makes every pipeline get its own process group (and bash will print a line when background jobs exit). It shouldn't actually hurt anything, it just makes bash behave a bit more like interactive mode. and you can presumably turn it off again after the xargs finishes if you like.

I'm not sure what you mean about explicitly detecting whether the xargs pipeline was killed? If job control is enabled, the kill 0 will just kill xargs (and seq if it hasn't already finished) and its child processes, and bash will treat that as an error and exit (due to set -e). If you want to intercept that, you can wrap the xargs pipeline in an if or add a || after and test the status code (and a code over 128 indicates a signal, where the signal number is the exit status minus 128, e.g. SIGTERM is 143). But normal errexit behavior should be fine here? It's what you get with kill $PPID already.

lilyball · 2025-05-18T20:31:14Z

ci/eval/default.nix

          done
        ) &

+        trap "kill %%" EXIT


This isn't going to work if anyone adds another background job later on, since the jobspec here is evaluated at EXIT time instead of right now.

Suggested change

trap "kill %%" EXIT

trap "kill $(jobs -p %%)" EXIT

Also I just want to point out that stdenv installs a default exit handler that this overwrites. That may not matter at all, if nothing installs a failureHook or exitHook, but if you want to preserve that you could switch this over to adding both a failureHook and exitHook (or just a failureHook and adding an explicit kill after the xargs finishes).

wolfgangwalther

Dropped the x86_64-linux eval system change in favor of waiting for #406825, but everything else should be good to go now that I've updated the pin.

Do we still need to wait for #406825 or has the required change maybe already been merged (since I have been splitting off smaller parts repeatedly)?

Is there anything else, especially from the latest comments, that you'd want to address / change in here?

We just had #418153, too, so making eval work on darwin would be valuable to others, too.

ofborg bot added the 6.topic: darwin Running or building packages on Darwin label May 12, 2025

github-actions bot added 6.topic: continuous integration Affects continuous integration (CI) in Nixpkgs, including Ofborg and GitHub Actions backport release-24.11 labels May 12, 2025

winterqt force-pushed the push-lyyvlttunyvk branch from f5ecea6 to 0cc2922 Compare May 12, 2025 00:39

winterqt requested review from infinisil, philiptaron and wolfgangwalther May 12, 2025 00:45

lilyball reviewed May 12, 2025

View reviewed changes

ci/eval/default.nix Outdated Show resolved Hide resolved

ci/eval/default.nix Outdated Show resolved Hide resolved

ci/eval/default.nix Outdated Show resolved Hide resolved

winterqt force-pushed the push-lyyvlttunyvk branch from 0cc2922 to 383d569 Compare May 12, 2025 02:57

github-actions bot added 10.rebuild-darwin: 11-100 This PR causes between 11 and 100 packages to rebuild on Darwin. 10.rebuild-linux: 101-500 This PR causes between 101 and 500 packages to rebuild on Linux. labels May 12, 2025

winterqt force-pushed the push-lyyvlttunyvk branch from 383d569 to 6678847 Compare May 12, 2025 03:07

wolfgangwalther approved these changes May 12, 2025

View reviewed changes

winterqt force-pushed the push-lyyvlttunyvk branch from 6678847 to 3a6da35 Compare May 12, 2025 20:31

This was referenced May 13, 2025

ci/eval/release-checks: init #406825

Open

ci: Update pinned Nixpkgs #406909

Merged

winterqt force-pushed the push-lyyvlttunyvk branch from 3a6da35 to a4c79d1 Compare May 15, 2025 02:33

winterqt requested review from lilyball and wolfgangwalther May 15, 2025 02:34

winterqt added 2 commits May 14, 2025 22:38

ci/eval: add darwin support

c4b5bec

winterqt force-pushed the push-lyyvlttunyvk branch from a4c79d1 to 8d49a2f Compare May 15, 2025 02:38

github-actions bot removed the 10.rebuild-darwin: 11-100 This PR causes between 11 and 100 packages to rebuild on Darwin. label May 15, 2025

winterqt commented May 15, 2025

View reviewed changes

wolfgangwalther added backport release-25.05 and removed backport release-24.11 labels May 16, 2025

lilyball reviewed May 18, 2025

View reviewed changes

philiptaron removed their request for review June 5, 2025 23:40

wegank added the 2.status: merge conflict This PR has merge conflicts with the target branch label Jun 16, 2025

github-actions bot added the 12.approvals: 1 This PR was reviewed and approved by one person. label Jun 16, 2025

wolfgangwalther mentioned this pull request Jun 19, 2025

ci/eval: specify temp Nix store for Nix commands #418153

Closed

7 tasks

wolfgangwalther reviewed Jun 19, 2025

View reviewed changes

wolfgangwalther mentioned this pull request Sep 4, 2025

ci: eval.baseline does not execute on Darwin #440002

Open

3 tasks

nixpkgs-ci bot added the 2.status: stale https://github.com/NixOS/nixpkgs/blob/master/.github/STALE-BOT.md label Dec 16, 2025

mdaniels5757 added backport release-25.11 Backport PR automatically and removed backport release-25.05 labels Jan 2, 2026

Uh oh!

Conversation

winterqt commented May 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Things done

Uh oh!

winterqt commented May 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lilyball left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wolfgangwalther commented May 12, 2025

Uh oh!

winterqt commented May 12, 2025

Uh oh!

wolfgangwalther commented May 12, 2025

Uh oh!

wolfgangwalther left a comment

Choose a reason for hiding this comment

Uh oh!

winterqt commented May 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wolfgangwalther commented May 12, 2025

Uh oh!

winterqt commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

winterqt May 15, 2025

Choose a reason for hiding this comment

Uh oh!

lilyball May 18, 2025

Choose a reason for hiding this comment

Uh oh!

winterqt May 18, 2025

Choose a reason for hiding this comment

Uh oh!

lilyball May 18, 2025

Choose a reason for hiding this comment

Uh oh!

lilyball May 18, 2025

Choose a reason for hiding this comment

Uh oh!

wolfgangwalther left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

winterqt commented May 12, 2025 •

edited

Loading

winterqt commented May 12, 2025 •

edited

Loading

winterqt commented May 12, 2025 •

edited

Loading

winterqt commented May 15, 2025 •

edited

Loading

wolfgangwalther left a comment •

edited

Loading