`at::parallel_for` does not propagate thread-local variables to child threads in embedding_renorm_

See repro below:

```python
import torch

def run(val):
    print("Running :", val)
    weights = torch.rand(100, 64, requires_grad=True)
    inp = torch.rand(val).mul(100).long()
    with torch.no_grad():
        torch.embedding_renorm_(weights, inp, 1.0, 2)
    print("This should be None: ", weights.grad_fn)

run(1000)
run(1001)
```
Outputs:
```
Running : 1000
This should be None:  None
Running : 1001
This should be None:  <CopySlices object at 0x7f02faa5e7a8>
```

Current supposition is that multi-threaded operations and thread local NoGradGuard could be problematic.

cc @ezyang @gchanan @zou3519 @jerryzh168 @SsnL @albanD @gqchen @VitalyFedyunin @ngimel @mruberry

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

`at::parallel_for` does not propagate thread-local variables to child threads in embedding_renorm_ #28370

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

at::parallel_for does not propagate thread-local variables to child threads in embedding_renorm_ #28370

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

`at::parallel_for` does not propagate thread-local variables to child threads in embedding_renorm_ #28370