Resets rnn _flat_weights on _apply #28562

mruberry · 2019-10-24T00:05:50Z

Currently when _apply() is called on RNNBase (or one of its children, like LSTM), the _flat_weights attribute may or may not be updated. In particular, when using .to() and sending a module like LSTM to XLA, a third party device type, the tensors in _flat_weights will not be updated and will remain on CPU. This causes the LSTM forward to fail since the forward call receives a mix of XLA and CPU tensors.

This occurs because third party device types, like XLA, may not be a compatible shallow copy type to native tensors. When this is the case and _apply is called Module parameters are replaced, not updated. RNNBase would not sync _flat_tensors with its params in this case, and that caused the references in _flat_tensors to not reflect the module's current params.

This small change forces a resync of the _flat_tensors and the actual params on each _apply. This lets .to('xla') work for LSTMs, for example. A test will be added to PyTorch/XLA (which runs in our CI) to validate this behavior after the change appears in PyTorch.

ngimel

lgtm

facebook-github-bot

@mruberry has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2019-10-26T02:40:06Z

@mruberry merged this pull request in 0c48092.

resets rnn _flat_weights

de5981a

mruberry requested a review from apaszke as a code owner October 24, 2019 00:05

mruberry requested review from ngimel and removed request for apaszke October 24, 2019 00:50

ngimel approved these changes Oct 25, 2019

View reviewed changes

facebook-github-bot reviewed Oct 25, 2019

View reviewed changes

facebook-github-bot closed this in 0c48092 Oct 25, 2019

facebook-github-bot added the merged label Oct 26, 2019

mruberry deleted the reset_flat_weights branch October 26, 2019 03:50

DeNeutoy mentioned this pull request Jan 17, 2020

Pytorch 1.4 breaks RNN drop connect allenai/allennlp#3637

Closed

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Resets rnn _flat_weights on _apply #28562

Resets rnn _flat_weights on _apply #28562

Uh oh!

mruberry commented Oct 24, 2019

Uh oh!

ngimel left a comment

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot commented Oct 26, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Resets rnn _flat_weights on _apply #28562

Resets rnn _flat_weights on _apply #28562

Uh oh!

Conversation

mruberry commented Oct 24, 2019

Uh oh!

ngimel left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Oct 26, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants