Only call into reducer if torch.is_grad_enabled() #19897

pietern · 2019-04-29T01:43:14Z

Stack:
:black_circle: #19901 Finer grained consistency check in reducer 💛
:white_circle: #19897 Only call into reducer if torch.is_grad_enabled() 💚

During validation, gradient reduction is not needed, and autograd is
never called. The model output will always be a detached tensor. After
the new reducer was merged, this meant that it would find all model
parameters unused, and kick off reduction for them. When #19799 and
#19821 were merged it looked like model output during validation is an
output where no parameters are used and it tries to kick off reduction
of zeroed gradients. Test for torch.is_grad_enabled() and
self.training before calling into the reducer.

Differential Revision: D15118726

Differential Revision: D15118726 Differential Version: 80866244

Differential Revision: D15118726 Differential Version: 80866256

Differential Revision: D15118726 Differential Version: 80866265

torch/nn/parallel/distributed.py

Differential Revision: D15118726 Differential Version: 80866864

facebook-github-bot · 2019-04-29T07:04:52Z

This pull request has been merged in 5525c41.

Summary: Pull Request resolved: #19897 During validation, gradient reduction is not needed, and autograd is never called. The model output will always be a detached tensor. After the new reducer was merged, this meant that it would find all model parameters unused, and kick off reduction for them. When #19799 and output where no parameters are used and it tries to kick off reduction of zeroed gradients. Test for `torch.is_grad_enabled()` and `self.training` before calling into the reducer. Reviewed By: mrshenli Differential Revision: D15118726 fbshipit-source-id: b0208f632a61cbe8110fa626fa427937b7f05924

Summary: Pull Request resolved: pytorch#19897 During validation, gradient reduction is not needed, and autograd is never called. The model output will always be a detached tensor. After the new reducer was merged, this meant that it would find all model parameters unused, and kick off reduction for them. When pytorch#19799 and output where no parameters are used and it tries to kick off reduction of zeroed gradients. Test for `torch.is_grad_enabled()` and `self.training` before calling into the reducer. Reviewed By: mrshenli Differential Revision: D15118726 fbshipit-source-id: b0208f632a61cbe8110fa626fa427937b7f05924

V1: Initial commit

27e3da2

Differential Revision: D15118726 Differential Version: 80866244

pietern requested review from apaszke and mrshenli as code owners April 29, 2019 01:43

pytorchbot added oncall: distributed Add this issue/PR to distributed oncall triage queue module: nn Related to torch.nn labels Apr 29, 2019

pietern added 2 commits April 28, 2019 18:45

V2: Rename test module

481370c

Differential Revision: D15118726 Differential Version: 80866256

V3: Lint

2e99317

Differential Revision: D15118726 Differential Version: 80866265

pietern mentioned this pull request Apr 29, 2019

Runtime Error when using DistributedDataParallel with torch.no_grad() #19896

Closed

mrshenli approved these changes Apr 29, 2019

View reviewed changes

soumith suggested changes Apr 29, 2019

View reviewed changes

torch/nn/parallel/distributed.py Outdated Show resolved Hide resolved

V4: Remove self.training check

2434d68

Differential Revision: D15118726 Differential Version: 80866864

soumith approved these changes Apr 29, 2019

View reviewed changes

pietern mentioned this pull request Apr 29, 2019

Finer grained consistency check in reducer #19901

Closed

facebook-github-bot closed this in 5525c41 Apr 29, 2019

facebook-github-bot added the merged label Apr 29, 2019

pietern deleted the export-D15118726 branch April 29, 2019 16:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Only call into reducer if torch.is_grad_enabled() #19897

Only call into reducer if torch.is_grad_enabled() #19897

Uh oh!

pietern commented Apr 29, 2019 •

edited

Loading

Uh oh!

Uh oh!

facebook-github-bot commented Apr 29, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Only call into reducer if torch.is_grad_enabled() #19897

Only call into reducer if torch.is_grad_enabled() #19897

Uh oh!

Conversation

pietern commented Apr 29, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

facebook-github-bot commented Apr 29, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

pietern commented Apr 29, 2019 •

edited

Loading