Enable distributed package on windows, Gloo backend supported only#42897
Enable distributed package on windows, Gloo backend supported only#42897gunandrose4u wants to merge 67 commits intopytorch:masterfrom
Conversation
💊 CI failures summary and remediationsAs of commit 8404d1e (more details on the Dr. CI page):
1 failure confirmed as flaky and can be ignored:
Extra GitHub checks: 1 failed
codecov.io: 1 failed
This comment was automatically generated by Dr. CI (expand for details).Follow this link to opt-out of these comments for your Pull Requests.Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group. This comment has been revised 325 times. |
There was a problem hiding this comment.
We might need to add NOT MSVC to the USE_NCCL flag as well. Looks like NCCL does not officially support Windows yet https://forums.developer.nvidia.com/t/is-there-a-nccl-2-x-for-windows/55659/3
There was a problem hiding this comment.
Yes, we should add one.
There was a problem hiding this comment.
Curious, I don't see a special handling for NCCL, but there is no test failures for NCCL backend. Is it because NCCL now works on Windows or we didn't actually trigger those tests?
Update my fork from Pytorch repo
f5d9c76 to
e8e8556
Compare
Rebase from pytorch/pytorch master
This reverts commit defd3ac.
47ed8fa to
8404d1e
Compare
|
Merged again by PR f07ac6a |
Fixes #42095
For test case part will be committed to this PR later
@mrshenli, please help to review