[c10d] Added the finer bucketing option for DDP #13607

teng-li · 2018-11-06T03:05:25Z

We only need this for backward, for FWD cast, the non-fine-grained bucketing should be better since it's sequential anyway.

Test should be covered all by c10d test, reduced bucket size to make bucketing happen in c10d test.

teng-li · 2018-11-06T06:27:05Z

@pytorchbot retest this please

improvements

pietern

Nice. One reserve call missing. Did you do a perf comparison for this? Curious to hear what happens.

torch/csrc/distributed/c10d/ddp.cpp

+  std::vector<std::vector<at::Tensor>> bucketedTensors;
+  auto tensorGroups =
+      torch::utils::take_tensors(tensors, bucketSize, fineGrained);
+  for (auto& tensorGroup : tensorGroups) {


facebook-github-bot

@teng-li is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

@teng-li is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

@teng-li has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

teng-li added the oncall: distributed Add this issue/PR to distributed oncall triage queue label Nov 6, 2018

teng-li requested review from apaszke and pietern as code owners November 6, 2018 03:05

teng-li closed this Nov 6, 2018

teng-li reopened this Nov 6, 2018

teng-li closed this Nov 6, 2018

teng-li reopened this Nov 6, 2018

teng-li force-pushed the take_tensor branch from f7b2a45 to 6e6d033 Compare November 6, 2018 06:29

[c10d] Let DDP to use finer bucketing strategy for fp16 perf

15d9aba

improvements

teng-li force-pushed the take_tensor branch from 6e6d033 to 15d9aba Compare November 6, 2018 07:24

pietern approved these changes Nov 6, 2018

View reviewed changes

torch/csrc/distributed/c10d/ddp.cpp

std::vector<std::vector<at::Tensor>> bucketedTensors;

auto tensorGroups =

torch::utils::take_tensors(tensors, bucketSize, fineGrained);

for (auto& tensorGroup : tensorGroups) {

This comment was marked as off-topic.

Sign in to view

Addressed comments

f34721d

teng-li force-pushed the take_tensor branch from b06b64c to f34721d Compare November 6, 2018 19:06

facebook-github-bot reviewed Nov 6, 2018

View reviewed changes

Use non-finegrained by default

5d952c2

facebook-github-bot reviewed Nov 6, 2018

View reviewed changes

teng-li changed the title ~~[c10d] Let DDP to use finer bucketing strategy for fp16 perf improvement~~ [c10d] Added the finer bucketing option for DDP Nov 6, 2018

facebook-github-bot reviewed Nov 7, 2018

View reviewed changes

facebook-github-bot closed this in 1413dd4 Nov 7, 2018

pietern mentioned this pull request Mar 7, 2019

Device agnostic gradient reduction #17757

Closed

ezyang added the merged label Jun 25, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[c10d] Added the finer bucketing option for DDP #13607

[c10d] Added the finer bucketing option for DDP #13607

Uh oh!

teng-li commented Nov 6, 2018

Uh oh!

teng-li commented Nov 6, 2018

Uh oh!

pietern left a comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[c10d] Added the finer bucketing option for DDP #13607

[c10d] Added the finer bucketing option for DDP #13607

Uh oh!

Conversation

teng-li commented Nov 6, 2018

Uh oh!

teng-li commented Nov 6, 2018

Uh oh!

pietern left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants