batch_first broken in AutogradRNN

The last line here fails on CPU or when CUDNN is otherwise unavailable:

```python
l, b, t, x, h = 2, 3, 5, 10, 20

rnn = nn.LSTM(x, h, l, batch_first=True)
inpt = Variable(torch.randn(b, t, x))
h0 = Variable(torch.randn(l, b, h))
c0 = Variable(torch.randn(l, b, h))
output, hn = rnn(inpt, (h0, c0))
```

This is because `AutogradRNN.forward` accidentally assumes `Tensor`'s in-place `transpose` semantics rather than the functional semantics of `Variable` (`cudnn.rnn.forward` gets it right):

```python
def forward(input, weight, hidden):
    if batch_first:
        input.transpose(0, 1)
    nexth, output = func(input, hidden, weight)
    if batch_first:
        output.transpose(0, 1)
```

I can push a PR that fixes this, or one of the devs can put it in the next bugfix PR:
```python
def forward(input, weight, hidden):
    if batch_first:
        input = input.transpose(0, 1)
    nexth, output = func(input, hidden, weight)
    if batch_first:
        output = output.transpose(0, 1)
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

batch_first broken in AutogradRNN #253

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

batch_first broken in AutogradRNN #253

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions