-
Notifications
You must be signed in to change notification settings - Fork 26.3k
[invoke_subgraph] Do not cache fake tensors for AOTDispatcher first pass #150450
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
[ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/150450
Note: Links to docs will display an error until the docs builds have been completed. ✅ You can merge normally! (3 Unrelated Failures)As of commit 513c1a2 with merge base 15dbad2 ( BROKEN TRUNK - The following job failed but were present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
UNSTABLE - The following jobs are marked as unstable, possibly due to flakiness on trunk:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
| # AOTDispatcher first pass does not run make_fx on | ||
| # dynamo graphs. As a result, it can have non OpOverload | ||
| # ops. | ||
| if not isinstance(op, torch._ops.OpOverload): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks fine to me but I'm not sure if there are any other problems from doing so, so will leave it to Richard for the stamp :3
zou3519
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Seems better than the previous state. But I think we should be tracing through the inside of the HOP subgraph so that the operator.add call gets desugared into an aten.add call, instead of manually iterating through the nodes and calling _validate_cache_key on each node. Does that make sense?
I plan to work on this. But till then, it would be good to get this in to make trunk healthy, as Lazos, me and Angela are working with invoke_subgraph recently. |
zou3519
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
sgtm
|
@pytorchbot merge |
Merge startedYour change will be merged once all checks pass (ETA 0-4 Hours). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
…alse (#150486) I am not sure if this is the right way. Pull Request resolved: #150486 Approved by: https://github.com/zou3519 ghstack dependencies: #150082, #150450
…t module (#150556) Pull Request resolved: #150556 Approved by: https://github.com/bdhirsh, https://github.com/zou3519 ghstack dependencies: #150082, #150450, #150486
…150561) I am unable to come up with a testcase. It passes many end-to-end tests that fail with ReshapeError at https://ossci-raw-job-status.s3.amazonaws.com/log/39717218372  Pull Request resolved: #150561 Approved by: https://github.com/zou3519, https://github.com/bdhirsh ghstack dependencies: #150082, #150450, #150486, #150556
…ass (pytorch#150450) Pull Request resolved: pytorch#150450 Approved by: https://github.com/zou3519 ghstack dependencies: pytorch#150082
…alse (pytorch#150486) I am not sure if this is the right way. Pull Request resolved: pytorch#150486 Approved by: https://github.com/zou3519 ghstack dependencies: pytorch#150082, pytorch#150450
…t module (pytorch#150556) Pull Request resolved: pytorch#150556 Approved by: https://github.com/bdhirsh, https://github.com/zou3519 ghstack dependencies: pytorch#150082, pytorch#150450, pytorch#150486
…ytorch#150561) I am unable to come up with a testcase. It passes many end-to-end tests that fail with ReshapeError at https://ossci-raw-job-status.s3.amazonaws.com/log/39717218372  Pull Request resolved: pytorch#150561 Approved by: https://github.com/zou3519, https://github.com/bdhirsh ghstack dependencies: pytorch#150082, pytorch#150450, pytorch#150486, pytorch#150556
ghstack-source-id: fd95d6d Pull Request resolved: pytorch/pytorch#150450
Stack from ghstack (oldest at bottom):