Allow for iterations where no module parameter is used #19821

pietern · 2019-04-27T05:18:11Z

Stack:
:black_circle: #19821 Allow for iterations where no module parameter is used 💛

It is possible that not a single parameter is used during an
iteration. If this is the case, the prepare_for_backward function
marks all parameters as unused, kicks off reduction of all buckets,
and finalizes the reduction.

This is different from the prior implementation where we assumed that
autograd would produce a gradient for at least a single parameter.
We then used the autograd callback mechanism to queue a finalizer
callback. Now, this finalizer may be executed in line.

Differential Revision: D15113272

Differential Revision: D15113272 Differential Version: 80846101

Differential Revision: D15113272 Differential Version: 80846136

mrshenli · 2019-04-27T06:32:15Z

torch/csrc/distributed/c10d/reducer.cpp

+        this->finalize_backward();
+      });
+    } else {
+      finalize_backward();


I am debating with myself whether we should support this or error out with explicit message. What are the use cases for DDP without backward?

we should definitely support it. It's possible that for a few iterations there's no backward and for a few iterations there is backward (for example when doing MCTS / rollouts)

facebook-github-bot · 2019-04-28T08:32:30Z

This pull request has been merged in 9b69da2.

Summary: Pull Request resolved: pytorch#19821 It is possible that not a single parameter is used during an iteration. If this is the case, the `prepare_for_backward` function marks all parameters as unused, kicks off reduction of all buckets, *and* finalizes the reduction. This is different from the prior implementation where we assumed that autograd would produce a gradient for at least a single parameter. We then used the autograd callback mechanism to queue a finalizer callback. Now, this finalizer may be executed in line. Reviewed By: mrshenli Differential Revision: D15113272 fbshipit-source-id: dc91458b569cd8c106ddaeea558464b515683550

pietern requested review from apaszke and mrshenli as code owners April 27, 2019 05:18

pytorchbot added the oncall: distributed Add this issue/PR to distributed oncall triage queue label Apr 27, 2019

pietern force-pushed the export-D15113272 branch from ee3722a to 398fc2d Compare April 27, 2019 05:36

V3: Initial commit

4ee9dd5

Differential Revision: D15113272 Differential Version: 80846101

pietern force-pushed the export-D15113272 branch from 398fc2d to 4ee9dd5 Compare April 27, 2019 05:37

V4: (no description)

d966579

Differential Revision: D15113272 Differential Version: 80846136

pietern changed the title ~~Stop relying on autograd finalizer hook in reducer~~ Allow for iterations where no module parameter is used Apr 27, 2019

mrshenli reviewed Apr 27, 2019

View reviewed changes

soumith approved these changes Apr 27, 2019

View reviewed changes

facebook-github-bot closed this in 9b69da2 Apr 28, 2019

pietern deleted the export-D15113272 branch April 28, 2019 06:10

facebook-github-bot added the merged label Apr 28, 2019

pietern mentioned this pull request Apr 29, 2019

Only call into reducer if torch.is_grad_enabled() #19897

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow for iterations where no module parameter is used #19821

Allow for iterations where no module parameter is used #19821

Uh oh!

pietern commented Apr 27, 2019 •

edited

Loading

Uh oh!

mrshenli Apr 27, 2019

Uh oh!

soumith Apr 27, 2019

Uh oh!

facebook-github-bot commented Apr 28, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Allow for iterations where no module parameter is used #19821

Allow for iterations where no module parameter is used #19821

Uh oh!

Conversation

pietern commented Apr 27, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mrshenli Apr 27, 2019

Choose a reason for hiding this comment

Uh oh!

soumith Apr 27, 2019

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Apr 28, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

pietern commented Apr 27, 2019 •

edited

Loading