Skip to content

linux-aarch64 CI tests are being timed out resulting in test failures #136192

@nikhil-arm

Description

@nikhil-arm

NOTE: Remember to label this issue with "ci: sev"

Blocks merging of : #134124 and #135857

Current Status

Status could be: preemptive, ongoing, mitigated, closed. Also tell people if they need to take action to fix it (i.e. rebase).

Error looks like

https://github.com/pytorch/pytorch/actions/workflows/linux-aarch64.yml
https://github.com/pytorch/pytorch/actions/runs/10894440149/job/30233134789

Incident timeline (all times pacific)

Linux-aarch64 time out is observed since last week and its still on going

User impact

Blocks testing / merging

Root cause

What was the root cause of this issue?

Mitigation

How did we mitigate the issue?

Prevention/followups

How do we prevent issues like this in the future?

cc @ezyang @gchanan @zou3519 @kadeng @msaroufim @seemethere @malfet @pytorch/pytorch-dev-infra @snadampal @milpuz01

Metadata

Metadata

Assignees

Labels

module: armRelated to ARM architectures builds of PyTorch. Includes Apple M1module: ciRelated to continuous integrationtriagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate module

Type

No type

Projects

Status

Done

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions