-
Notifications
You must be signed in to change notification settings - Fork 26.3k
Fix ir._WaitKernel #137401
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix ir._WaitKernel #137401
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/137401
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit 391c985 with merge base 4830bd0 ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
| @fresh_inductor_cache() | ||
| def test_wait_tensor_temp(self): | ||
| def func(arg: torch.Tensor) -> torch.Tensor: | ||
| return funcol.wait_tensor(arg) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I believe in the current unit test, arg is a normal torch.Tensor and calling .wait_tensor() on it has unclear semantics. Would it work if we pass an AsyncCollectiveTensor into this function (e.g. passing in the output of funcol.all_reduce(buf0, "avg", "0") from eager mode)?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Discussed offline. I am landing this now to unblock #136534. @benjaminglass1 please follow up on the unit test improvement.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@yf225 @desertfire See #137692
|
@pytorchbot merge |
Merge startedYour change will be merged once all checks pass (ETA 0-4 Hours). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
Summary: #137401 didn't fix the underlying inplace output issue. [ghstack-poisoned]
Summary: #137401 didn't fix the underlying inplace output issue. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames chauhang [ghstack-poisoned]
Summary: #137401 didn't fix the underlying inplace output issue. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames chauhang [ghstack-poisoned]
…tor" Summary: #137401 didn't fix the underlying inplace output issue. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames chauhang [ghstack-poisoned]
Summary: #137401 didn't fix the underlying inplace output issue. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames chauhang [ghstack-poisoned]
…tor" Summary: #137401 didn't fix the underlying inplace output issue. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames chauhang [ghstack-poisoned]
Summary: #137401 didn't fix the underlying inplace output issue. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames chauhang [ghstack-poisoned]
Summary: #137401 didn't fix the underlying inplace output issue. Pull Request resolved: #137660 Approved by: https://github.com/chenyang78
…tor" Summary: #137401 didn't fix the underlying inplace output issue. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames chauhang [ghstack-poisoned]
Summary: #137401 didn't fix the underlying inplace output issue. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames chauhang [ghstack-poisoned]
Summary: #137401 didn't fix the underlying inplace output issue. Pull Request resolved: #137660 Approved by: https://github.com/chenyang78
In ABI-compatible mode, AOTInductor could not compile _WaitKernel due to an incorrect outputs list. Add the correct set of outputs, as done in ir._CollectiveKernel.create_out_of_place. ghstack-source-id: 696daaa Pull Request resolved: pytorch/pytorch#137401
Summary: pytorch/pytorch#137401 didn't fix the underlying inplace output issue. ghstack-source-id: 93cc076 Pull Request resolved: pytorch/pytorch#137660
Stack from ghstack (oldest at bottom):
In ABI-compatible mode, AOTInductor could not compile _WaitKernel due to
an incorrect outputs list. Add the correct set of outputs, as done in
ir._CollectiveKernel.create_out_of_place.
cc @XilunWu @H-Huang @awgu @kwen2501 @wanchaol @fegin @fduwjj @wz337 @wconstab @d4l3k @c-p-i-o @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang