Skip to content

Conversation

@jithunnair-amd
Copy link
Collaborator

No description provided.

@iotamudelta iotamudelta requested review from bddppq and ezyang October 31, 2019 21:42
@iotamudelta iotamudelta added module: rocm AMD GPU support for Pytorch open source labels Oct 31, 2019
@ezyang
Copy link
Contributor

ezyang commented Nov 1, 2019

@pytorchbot rebase this please

@jithunnair-amd
Copy link
Collaborator Author

@ezyang I believe this PR is ready to be merged. Can you please take a look?

Copy link
Contributor

@facebook-github-bot facebook-github-bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@ezyang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@facebook-github-bot
Copy link
Contributor

@ezyang merged this pull request in 7073ee2.

@jithunnair-amd jithunnair-amd deleted the enable_test_distributed_nccl_backend branch February 7, 2020 15:23
facebook-github-bot pushed a commit that referenced this pull request Feb 10, 2020
…32551)

Summary:
This is a redux of the original PR #28814 which was reverted in PR #29736 due to test_DistributedDataParallel being suspected as being flaky. Further investigation revealed it wasn't flakiness, but a bug in the PyTorch source code which has been now fixed in PR #32356. This PR is another attempt at enabling the test_distributed unit test suite only for the nccl backend.
Pull Request resolved: #32551

Differential Revision: D19729966

Pulled By: bddppq

fbshipit-source-id: 12a0d850991a903cc7723d63693b6157071d7115
ttumiel pushed a commit to ttumiel/pytorch that referenced this pull request Mar 4, 2020
…ytorch#32551)

Summary:
This is a redux of the original PR pytorch#28814 which was reverted in PR pytorch#29736 due to test_DistributedDataParallel being suspected as being flaky. Further investigation revealed it wasn't flakiness, but a bug in the PyTorch source code which has been now fixed in PR pytorch#32356. This PR is another attempt at enabling the test_distributed unit test suite only for the nccl backend.
Pull Request resolved: pytorch#32551

Differential Revision: D19729966

Pulled By: bddppq

fbshipit-source-id: 12a0d850991a903cc7723d63693b6157071d7115
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Merged module: rocm AMD GPU support for Pytorch open source

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants