[inductor] use eager stride for custom op if no tags #148367

shunting314 · 2025-03-03T21:37:16Z

Stack from ghstack (oldest at bottom):

-> [inductor] use eager stride for custom op if no tags #148367

This is some sort of short term fix to recover the default behavior to apply layout constraint for custom ops when there are no tags.

A longer term attempt to make sure Inductor always gets correct eager strides is here: #148104

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov

[ghstack-poisoned]

pytorch-bot · 2025-03-03T21:37:19Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/148367

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit bf76d44 with merge base 1c544a9 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

torch/_inductor/graph.py

Fix #148356 This is some sort of short term fix to recover the default behavior to apply layout constraint for custom ops when there are no tags. A longer term attempt to make sure Inductor always gets correct eager strides is here: #148104 cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: 3d2340e Pull Request resolved: #148367

eellison

I would kind of rather we fix the relevant weight norm kernels then land this PR which is a non fully featured version of what richard was doing in the other pr.

Giving aten kernels this extra stride constraint is bad for things like channels optimization.

shunting314 · 2025-03-04T01:01:22Z

I would kind of rather we fix the relevant weight norm kernels then land this PR which is a non fully featured version of what richard was doing in the other pr.

But this is a general problem not just for weight norm. Other ATen ops may have layout constraint as well. Inductor does not know that if it's an implicit fallback

Giving aten kernels this extra stride constraint is bad for things like channels optimization.

This only affects implicit fallback. For explicitly registered ATen kernels like convolution, we still have full control of the input layouts.

This PR only brings back the behavior for custom ops. The behavior for implicit fallback of ATen ops is already added last November.

eellison · 2025-03-04T01:33:22Z

@shunting314 I don't think this is a general problem for aten ops. the error is only for specific backward ops, which assume as an invariant that the forward would have already applied the layout constraints.

shunting314 · 2025-03-04T01:38:47Z

I don't think this is a general problem for aten ops. the error is only for specific backward ops, which assume as an invariant that the forward would have already applied the layout constraints.

yea, that's what I mean by 'general'. There can be new ops like this be added in future, right?

eellison · 2025-03-04T18:49:11Z

@shunting314 we are not adding new core aten ops at any fast rate, especially with backward and unchecked invariants around striding.

Fix #148356 This is some sort of short term fix to recover the default behavior to apply layout constraint for custom ops when there are no tags. A longer term attempt to make sure Inductor always gets correct eager strides is here: #148104 cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

shunting314 · 2025-03-04T19:54:43Z

Chatted with @eellison offline. The default layout constraint for implicit fallback ATEN ops now are only applied to bwd graph since fwd ATen ops should work with any strides.

eellison · 2025-03-04T20:14:40Z

torch/_inductor/graph.py

+                    # For ATen ops, only apply the constraint for backward
+                    # ops since fwd ops should work for any strides.
+                    if torch._library.utils.is_builtin(target) and self.is_backward:
+                        decided_constraint = require_contiguous  # type: ignore[assignment]


Hmm, i guess in the future we could change this to matching the fx strides.

do you mean "needs_exact_strides"?

Fix #148356 This is some sort of short term fix to recover the default behavior to apply layout constraint for custom ops when there are no tags. A longer term attempt to make sure Inductor always gets correct eager strides is here: #148104 cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: cbfc7e7 Pull Request resolved: #148367

shunting314 · 2025-03-05T20:04:31Z

@pytorchbot merge

pytorchmergebot · 2025-03-05T20:06:12Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-03-05T20:11:55Z

Merge failed

Reason: 1 jobs have failed, first few of them are: linux-binary-manywheel / manywheel-py3_9-cuda12_6-build / build

Details for Dev Infra team

Raised by workflow job

shunting314 · 2025-03-05T20:13:24Z

@pytorchbot merge -i

pytorchmergebot · 2025-03-05T20:15:11Z

Merge started

Your change will be merged while ignoring the following 1 checks: linux-binary-manywheel / manywheel-py3_9-cuda12_6-build / build

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

[inductor] use eager stride for custom op if no tags

21cbf50

[ghstack-poisoned]

pytorch-bot bot added ciflow/inductor module: inductor labels Mar 3, 2025

shunting314 requested a review from zou3519 March 3, 2025 21:40

shunting314 added the topic: not user facing topic category label Mar 3, 2025

zou3519 requested a review from eellison March 3, 2025 22:06

zou3519 reviewed Mar 3, 2025

View reviewed changes

torch/_inductor/graph.py Outdated Show resolved Hide resolved

shunting314 added a commit that referenced this pull request Mar 4, 2025

[inductor] use eager stride for custom op if no tags

6b6e2ca

ghstack-source-id: 3d2340e Pull Request resolved: #148367

shunting314 requested a review from zou3519 March 4, 2025 00:09

eellison reviewed Mar 4, 2025

View reviewed changes

eellison approved these changes Mar 4, 2025

View reviewed changes

shunting314 added a commit that referenced this pull request Mar 4, 2025

[inductor] use eager stride for custom op if no tags

f94ce44

ghstack-source-id: cbfc7e7 Pull Request resolved: #148367

zou3519 approved these changes Mar 4, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Mar 5, 2025

pytorchmergebot added the merging label Mar 5, 2025

pytorchmergebot removed the merging label Mar 5, 2025

pytorchmergebot added the merging label Mar 5, 2025

pytorchmergebot added the Merged label Mar 6, 2025

pytorchmergebot closed this in 6cc3e69 Mar 6, 2025

pytorchmergebot removed the merging label Mar 6, 2025

github-actions bot deleted the gh/shunting314/199/head branch April 11, 2025 02:30

[inductor] use eager stride for custom op if no tags #148367

[inductor] use eager stride for custom op if no tags #148367

Uh oh!

Conversation

shunting314 commented Mar 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Mar 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/148367

✅ No Failures

Uh oh!

Uh oh!

eellison left a comment

Choose a reason for hiding this comment

Uh oh!

shunting314 commented Mar 4, 2025

Uh oh!

eellison commented Mar 4, 2025

Uh oh!

shunting314 commented Mar 4, 2025

Uh oh!

eellison commented Mar 4, 2025

Uh oh!

shunting314 commented Mar 4, 2025

Uh oh!

eellison Mar 4, 2025

Choose a reason for hiding this comment

Uh oh!

zou3519 Mar 4, 2025

Choose a reason for hiding this comment

Uh oh!

shunting314 commented Mar 5, 2025

Uh oh!

pytorchmergebot commented Mar 5, 2025

Merge started

Uh oh!

pytorchmergebot commented Mar 5, 2025

Merge failed

Uh oh!

shunting314 commented Mar 5, 2025

Uh oh!

pytorchmergebot commented Mar 5, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

shunting314 commented Mar 3, 2025 •

edited

Loading

pytorch-bot bot commented Mar 3, 2025 •

edited

Loading