[inductor] [silence] `nn.LazyLinear-F.gumbel_softmax` return inconsistent resutls compared with eager

### 🐛 Describe the bug

**symptom:** for `nn.LazyLinear-F.gumbel_softmax`, if `hard=True`, we can see the explicit difference. If `hard=False`, the inconsistency still exists and can't pass the `allclose` check.

**device backend**: both triton and CPP

```python
import torch
import torch.nn as nn
import torch.nn.functional as F
from torch._inductor import config

config.fallback_random = True
torch.set_grad_enabled(False)
torch.manual_seed(0)


class Model(torch.nn.Module):

    def __init__(self):
        super(Model, self).__init__()
        self.lazy_linear = torch.nn.LazyLinear(out_features=10)

    def forward(self, x):
        x = self.lazy_linear(x)
        x = F.gumbel_softmax(x, tau=0.5, hard=True)
        return x


model = Model().eval().cuda()


x = torch.randn(1, 10).cuda()

inputs = [x]


def run_test(model, inputs, backend):
    if backend != "eager":
        model = torch.compile(model, backend=backend)
    torch.manual_seed(0)
    output = model(*inputs)
    print(output)
    return output


output = run_test(model, inputs, 'eager')
c_output = run_test(model, inputs, 'inductor')

print(torch.allclose(output, c_output, 1e-3, 1e-3, equal_nan=True))
print(torch.max(torch.abs(output - c_output)))
```

### Error logs
```
tensor([[0., 1., 0., 0., 0., 0., 0., 0., 0., 0.]], device='cuda:0')
tensor([[0., 0., 0., 1., 0., 0., 0., 0., 0., 0.]], device='cuda:0')
False
tensor(1., device='cuda:0')
```

### Versions
nightly 20250225

cc @chauhang @penguinwu

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[inductor] [silence] `nn.LazyLinear-F.gumbel_softmax` return inconsistent resutls compared with eager #145470

🐛 Describe the bug

Error logs

Versions

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[inductor] [silence] nn.LazyLinear-F.gumbel_softmax return inconsistent resutls compared with eager #145470

Description

🐛 Describe the bug

Error logs

Versions

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

[inductor] [silence] `nn.LazyLinear-F.gumbel_softmax` return inconsistent resutls compared with eager #145470