torch.cat puts result on current GPU rather than GPU of inputs

```python
>>> import torch
>>> a = torch.rand(5).cuda(1)
>>> b = torch.rand(5).cuda(1)
>>> c = torch.cat([a,b], 0)
>>> c.get_device()
0
>>> torch.cuda.set_device(1)
>>> c = torch.cat([a,b], 0)
>>> c.get_device()
1
```
This is a problem for `nn.DataParallel` and similar multi-gpu things. Paszke says it might be a bug in [the wrapper](https://github.com/pytorch/pytorch/blob/master/torch/csrc/generic/methods/Tensor.cwrap#L576), because this is what `THCPAutoGPU` is supposed to deal with.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

torch.cat puts result on current GPU rather than GPU of inputs #689

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

torch.cat puts result on current GPU rather than GPU of inputs #689

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions