Skip to content

[CI][CUDA][Distributed] test_ring_attention_sdpa Runtime Error "The size of tensor a (64) must match the size of tensor b (8) at non-singleton dimension 2 " #162743

@nWEIdia

Description

@nWEIdia

🐛 Describe the bug

Tracking this in Umbrella Bug: #162178

Job link: https://github.com/pytorch/pytorch/actions/runs/17470577491/job/49628024468

Failure snippets including reproducer command:

E0911 12:33:44.425000 580942 torch/testing/_internal/common_distributed.py:818] RuntimeError: The size of tensor a (64) must match the size of tensor b (8) at non-singleton di
mension 2
E0911 12:33:44.425000 580942 torch/testing/_internal/common_distributed.py:818]
E0911 12:33:44.425000 580942 torch/testing/_internal/common_distributed.py:818] To execute this test, run the following from the base repo dir:
E0911 12:33:44.425000 580942 torch/testing/_internal/common_distributed.py:818] python test/distributed/tensor/test_attention.py RingAttentionTest.test_ring_attention_sdpa

Versions

Test performed with pytorch commit: 145a3a7bda15e3963a33eb1b54bba5d4a270b225

cc @H-Huang @awgu @wanchaol @fegin @fduwjj @wz337 @wconstab @d4l3k @pragupta @ezyang @msaroufim @dcci

Metadata

Metadata

Assignees

No one assigned

    Labels

    oncall: distributedAdd this issue/PR to distributed oncall triage queue

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions