Skip to content

Conversation

@goldsborough
Copy link
Contributor

The next function I'm moving to C++ is sync_params. It is stacked on top of #9729, so some changes will go away when it lands and I rebase.

I also split code into a .h and .cpp file for better code organization.

@teng-li @pietern @apaszke

@ezyang
Copy link
Contributor

ezyang commented Jul 25, 2018

@pytorchbot retest this please

@weiyangfb
Copy link
Contributor

cc @teng-li would you like to review this PR too?

@teng-li
Copy link
Contributor

teng-li commented Jul 31, 2018

@goldsborough We do have a test covering this change right?

@goldsborough
Copy link
Contributor Author

@teng-li I ran the c10d DDP test that I wrote. Should we also write some unit tests for these individual functions?

@goldsborough goldsborough force-pushed the ddp-sync-params branch 3 times, most recently from e115f43 to 3523820 Compare August 6, 2018 21:33
Copy link
Contributor

@teng-li teng-li left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Look great, thank you for adding those test cases.

Please see my comments.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

@goldsborough
Copy link
Contributor Author

@apaszke @ezyang @soumith looking for codeowner approval

@goldsborough
Copy link
Contributor Author

@pytorchbot retest this please

@goldsborough goldsborough force-pushed the ddp-sync-params branch 2 times, most recently from c3851ae to c031d29 Compare August 7, 2018 22:03
@goldsborough
Copy link
Contributor Author

@pytorchbot retest this please

@soumith
Copy link
Contributor

soumith commented Aug 28, 2018

might possibly need a rebase. cc: @teng-li

@yf225
Copy link
Contributor

yf225 commented Aug 28, 2018

@pytorchbot retest this please

1 similar comment
@goldsborough
Copy link
Contributor Author

@pytorchbot retest this please

@goldsborough goldsborough force-pushed the ddp-sync-params branch 3 times, most recently from 21cd574 to 46bf4f1 Compare September 6, 2018 02:29
Rebase fixes

Rebase

Wrote unit test for dist_broadcast_coalesced

Wrote unit test for sync_params

Make the bucke size 20 bytes and increase test timeout to 10s

5 tensors per device

Increase timeout
Copy link
Contributor

@facebook-github-bot facebook-github-bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

goldsborough has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

PenghuiCheng pushed a commit to PenghuiCheng/pytorch that referenced this pull request Sep 11, 2018
Summary:
The next function I'm moving to C++ is `sync_params`. It is stacked on top of pytorch#9729, so some changes will go away when it lands and I rebase.

I also split code into a `.h` and `.cpp` file for better code organization.

The controller you requested could not be found. pietern apaszke
Pull Request resolved: pytorch#9805

Differential Revision: D9688604

Pulled By: goldsborough

fbshipit-source-id: 4467104d3f9e2354425503b9e4edbd59603e20a8
@ezyang ezyang added the merged label Jun 26, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants