[26.1 backport] gha: add guardrails timeouts on all jobs by austinvazquez · Pull Request #48647 · moby/moby

austinvazquez · 2024-10-12T01:53:11Z

- What I did

Backports gha: add guardrails timeouts on all jobs #48629 to 26.1
Backports gha: restrict cross and bin-image to 20 minutes #48645 to 26.1

- How I did it

git cherry-pick -xsS 6b7e2783d1e68c4da3764525e3e8e74b85d0d8c8
git cherry-pick -xsS c68c9aed8cb3916669de6d7f2c564279ec83663f

- Description for the changelog

n/a

- A picture of a cute animal (not mandatory but encouraged)

We had a few "runaway jobs" recently, where the job got stuck, and kept running for 6 hours (in one case even 24 hours, probably due some github outage). Some of those jobs could not be terminated. While running these actions on public repositories doesn't cost us, it's still not desirable to have jobs running for that long (as they can still hold up the queue). This patch adds a blanket "2 hours" time-limit to all jobs that didn't have a limit set. We should look at tweaking those limits to actually expected duration, but having a default at least is a start. Also changed the position of some existing timeouts so that we have a consistent order in which it's set; making it easier to spot locations where no limit is defined. Signed-off-by: Sebastiaan van Stijn <[email protected]> (cherry picked from commit 6b7e278) Signed-off-by: Austin Vazquez <[email protected]>

We had a couple of runs where these jobs got stuck and github actions didn't allow terminating them, so that they were only terminated after 120 minutes. These jobs usually complete in 5 minutes, so let's give them a shorter timeout. 20 minutes should be enough (don't @ me). Signed-off-by: Sebastiaan van Stijn <[email protected]> (cherry picked from commit c68c9ae) Signed-off-by: Austin Vazquez <[email protected]>

austinvazquez · 2024-10-12T01:56:16Z

I was thinking it was be good to have these in the maintenance branches.

thaJeztah · 2024-10-12T14:17:34Z

I was thinking it was be good to have these in the maintenance branches.

Yes, it is! I thought it wasn't critical but definitely good to have. And ... evidently we need to be even more aggressive; this PR had one of the bin-image jobs to hang, and it was terminated after 2 hours; looks like we can set those a lot shorter as well; https://github.com/moby/moby/actions/runs/11301696824/job/31436480660?pr=48647

Running it again completed in less than 4 minutes

Not sure what's the cause of these though; they started to show up more recently. Either something changed in the GHA runners, or some deadlock somewhere (but I don't think the docker engine versions changed in GHA)

thaJeztah

LGTM

thaJeztah added 2 commits October 12, 2024 01:39

thaJeztah added status/4-merge area/testing labels Oct 12, 2024

thaJeztah added this to the 26.1.5 milestone Oct 12, 2024

thaJeztah approved these changes Oct 12, 2024

View reviewed changes

thaJeztah merged commit 95807d2 into moby:26.1 Oct 12, 2024

thaJeztah mentioned this pull request Oct 12, 2024

gha: more limits, update alpine version, and some minor improvements #48654

Merged

austinvazquez deleted the cherry-pick-c68c9aed8cb3916669de6d7f2c564279ec83663f-to-26.1 branch October 13, 2024 03:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[26.1 backport] gha: add guardrails timeouts on all jobs#48647

[26.1 backport] gha: add guardrails timeouts on all jobs#48647
thaJeztah merged 2 commits intomoby:26.1from
austinvazquez:cherry-pick-c68c9aed8cb3916669de6d7f2c564279ec83663f-to-26.1

austinvazquez commented Oct 12, 2024

Uh oh!

austinvazquez commented Oct 12, 2024

Uh oh!

thaJeztah commented Oct 12, 2024

Uh oh!

thaJeztah left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

austinvazquez commented Oct 12, 2024

Uh oh!

austinvazquez commented Oct 12, 2024

Uh oh!

thaJeztah commented Oct 12, 2024

Uh oh!

thaJeztah left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants