[ROCm] [Flex attention] Memory access fault on nested_tensor UT

### 🐛 Describe the bug

https://github.com/pytorch/pytorch/pull/136792 accidentally disabled flex attention UTs on ROCm. We are re-enabling testing with https://github.com/pytorch/pytorch/pull/139632 but there is a memory access fault in a unit test. 

We will skip the unit test to start running ROCm flex attention UTs again, but opening this issue for tracking.

cc: @drisspg @jbschlosser 

https://hud.pytorch.org/pr/pytorch/pytorch/139632#32490576539
```
2024-11-04T18:11:36.8788561Z test_nestedtensor.py::TestNestedTensorSubclassCUDA::test_flex_attention_noncontig_with_holes_False_cuda_float32 Memory exception on virtual address 0x7fa191005000, node id 3 : Address does not belong to a known buffer
2024-11-04T18:11:36.8789792Z Memory access fault by GPU node-3 (Agent handle: 0xaf07060) on address 0x7fa191005000. Reason: Unknown.
2024-11-04T18:11:36.8790324Z Fatal Python error: Aborted
2024-11-04T18:11:36.8790497Z 
2024-11-04T18:11:36.8790643Z Thread 0x00007fa189cff700 (most recent call first):
2024-11-04T18:11:36.8790978Z   <no Python frame>
2024-11-04T18:11:36.8791118Z 
```

Will be reassessed with 3.2 triton https://github.com/pytorch/pytorch/issues/139175

### Versions

CI

cc @jeffdaily @sunway513 @jithunnair-amd @pruthvistony @ROCmSupport @dllehr-amd @hongxiayang @naromero77amd @ezyang @chauhang @penguinwu @zou3519 @ydwu4 @bdhirsh @yf225 @Chillee @drisspg @yanboliang @BoyuanFeng

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ROCm] [Flex attention] Memory access fault on nested_tensor UT #139754

🐛 Describe the bug

Versions

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[ROCm] [Flex attention] Memory access fault on nested_tensor UT #139754

Description

🐛 Describe the bug

Versions

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions