-
Notifications
You must be signed in to change notification settings - Fork 29.7k
Closed
Labels
c: contributor-productivityTeam-specific productivity, code health, technical debt.Team-specific productivity, code health, technical debt.infra: metricsInfrastructure metrics-related issuesInfrastructure metrics-related issuesteam-infraOwned by Infrastructure teamOwned by Infrastructure team
Description
I've seen this several times now on #74922 and a couple other recent PRs, particularly on Windows bots.
AFAICT, what's happening is that the "sharding" bot gets scheduled and starts. It spawns 3 more builders and waits. Those other 3 builders take most of the ~1hr the original bot has to run to even get scheduled and start. the original bot sees it is out of time and cancels the other builders, and is marked as a failure.
I think we need some combination of:
- More bots
- Fewer shards (or more shards so they finish/fail faster?)
- No timeout on shard waiting bots
- No dedicated bot waiting for the sharded bots.
/cc @godofredoc @keyonghan @CaseyHillers
I've seen this on several PRs recently.
Metadata
Metadata
Assignees
Labels
c: contributor-productivityTeam-specific productivity, code health, technical debt.Team-specific productivity, code health, technical debt.infra: metricsInfrastructure metrics-related issuesInfrastructure metrics-related issuesteam-infraOwned by Infrastructure teamOwned by Infrastructure team