[c10d] Make reduce as a custom op#79686
[c10d] Make reduce as a custom op#79686alanwaketan wants to merge 2 commits intogh/alanwaketan/37/basefrom
Conversation
Summary: This patch makes reduce as a custom op such that it's dispatcher passable. It's one part of the effort to route comm ops to the dispatcher such that tracing mechanisms that relies on the dispatcher can trace them, e.g., LazyTensor and AOTAutograd. Test Plan: python test/distributed/test_c10d_nccl.py -k test_reduce_ops python test/distributed/test_c10d_gloo.py -k test_reduce_basics ...and other existing distributed tests. [ghstack-poisoned]
🔗 Helpful links
✅ No Failures (0 Pending)As of commit 38f90ce (more details on the Dr. CI page): Expand to see more💚 💚 Looks good so far! There are no failures yet. 💚 💚 This comment was automatically generated by Dr. CI (expand for details).Please report bugs/suggestions to the (internal) Dr. CI Users group. |
Summary: This patch makes reduce as a custom op such that it's dispatcher passable. It's one part of the effort to route comm ops to the dispatcher such that tracing mechanisms that relies on the dispatcher can trace them, e.g., LazyTensor and AOTAutograd. Test Plan: python test/distributed/test_c10d_nccl.py -k test_reduce_ops python test/distributed/test_c10d_gloo.py -k test_reduce_basics ...and other existing distributed tests. ghstack-source-id: 6386217 Pull Request resolved: #79686
Summary: This patch makes reduce as a custom op such that it's dispatcher passable. It's one part of the effort to route comm ops to the dispatcher such that tracing mechanisms that relies on the dispatcher can trace them, e.g., LazyTensor and AOTAutograd. Test Plan: python test/distributed/test_c10d_nccl.py -k test_reduce_ops python test/distributed/test_c10d_gloo.py -k test_reduce_basics ...and other existing distributed tests. [ghstack-poisoned]
| .def( | ||
| "reduce", | ||
| [](::c10d::ProcessGroup& pg, | ||
| [](const c10::intrusive_ptr<::c10d::ProcessGroup>& pg, |
|
@pytorchbot merge |
|
@pytorchbot successfully started a merge job. Check the current status here |
|
Hey @alanwaketan. |
Summary: This patch makes reduce as a custom op such that it's dispatcher passable. It's one part of the effort to route comm ops to the dispatcher such that tracing mechanisms that relies on the dispatcher can trace them, e.g., LazyTensor and AOTAutograd. Pull Request resolved: #79686 Approved by: https://github.com/mrshenli Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/3367e632b2613bfd26bee8b8d74951b791687346 Test plan from GitHub: python test/distributed/test_c10d_nccl.py -k test_reduce_ops python test/distributed/test_c10d_gloo.py -k test_reduce_basics ...and other existing distributed tests. Reviewed By: atalman Differential Revision: D37455697 Pulled By: alanwaketan fbshipit-source-id: a0167adecf6d044c0fa503a3fd5379b169ce0c63
Stack from ghstack (oldest at bottom):
Summary:
This patch makes reduce as a custom op such that it's dispatcher
passable. It's one part of the effort to route comm ops to the dispatcher
such that tracing mechanisms that relies on the dispatcher can trace them,
e.g., LazyTensor and AOTAutograd.
Test Plan:
python test/distributed/test_c10d_nccl.py -k test_reduce_ops
python test/distributed/test_c10d_gloo.py -k test_reduce_basics
...and other existing distributed tests.