[AOTI] Fix an autotune block grid computation issue #143098

desertfire · 2024-12-12T03:23:38Z

Summary: There is a grid computation issue after switching to one-pass codegen in #141980. When max-autotune is turned on, there is an incorrect grid codegen in some cases.

Reviewed By: henrylhtsang

Differential Revision: D67120987

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @chauhang @aakhundov

pytorch-bot · 2024-12-12T03:23:42Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/143098

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 8eacd41 with merge base 84f7913 ():

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

trunk / macos-py3-arm64 / test (default, 1, 3, macos-m1-stable) (gh) (trunk failure)
test_cpp_extensions_jit.py::TestCppExtensionJIT::test_warning

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-12-12T03:23:57Z

This pull request was exported from Phabricator. Differential Revision: D67120987

Summary: There is a grid computation issue after switching to one-pass codegen in #141980. When max-autotune is turned on, there is an incorrect grid codegen in some cases. Reviewed By: henrylhtsang Differential Revision: D67120987

facebook-github-bot · 2024-12-12T15:13:22Z

This pull request was exported from Phabricator. Differential Revision: D67120987

facebook-github-bot · 2024-12-13T07:45:00Z

@pytorchbot merge

(Initiating merge automatically since Phabricator Diff has merged)

pytorchmergebot · 2024-12-13T07:46:51Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Summary: There is a grid computation issue after switching to one-pass codegen in pytorch#141980. When max-autotune is turned on, there is an incorrect grid codegen in some cases. Reviewed By: henrylhtsang Differential Revision: D67120987 Pull Request resolved: pytorch#143098 Approved by: https://github.com/henrylhtsang

bhack · 2024-12-16T14:40:30Z

@desertfire I don't know if it is connected. I have solved with this PR the compile+aoti export with autotuning but I still have this issue on L40S GPU:

E1216 site-packages/torch/_inductor/select_algorithm.py:1756] [0/0] Exception out of resource: shared memory, Required: 131072, Hardware limit:101376. Reducing block sizes or `num_stages` may help. for benchmark choice TritonTemplateCaller(/tmp/torchinductor_root/tc/.....py, ACC_TYPE='tl.float32', ALLOW_TF32=True, BLOCK_K=64, BLOCK_M=128, BLOCK_N=128, B_PROLOGUE_CAST_TYPE=None, EVEN_K=True, GROUP_M=8, num_stages=5, num_warps=8)
W1216 site-packages/torch/_inductor/select_algorithm.py:1997] [0/0] out of resource: shared memory, Required: 131072, Hardware limit: 101376. Reducing block sizes or `num_stages` may help.

pytorch-bot bot added ciflow/inductor module: inductor labels Dec 12, 2024

desertfire requested a review from henrylhtsang December 12, 2024 03:23

facebook-github-bot added the fb-exported label Dec 12, 2024

desertfire added topic: bug fixes topic category release notes: inductor labels Dec 12, 2024

henrylhtsang approved these changes Dec 12, 2024

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Dec 12, 2024

facebook-github-bot force-pushed the export-D67120987 branch from 492b7b4 to 8eacd41 Compare December 12, 2024 15:13

pytorchmergebot added the merging label Dec 13, 2024

pytorchmergebot closed this in 3e1f587 Dec 13, 2024

pytorchmergebot added Merged and removed merging labels Dec 13, 2024

bhack mentioned this pull request Dec 16, 2024

Runners, torchbench, & the future #143215

Open

github-actions bot deleted the export-D67120987 branch January 16, 2025 02:02

desertfire mentioned this pull request Feb 4, 2025

Autotuning failure: Triton Error [CUDA]: invalid argument #145984

Closed

atalman added this to the 2.6.1 milestone Feb 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AOTI] Fix an autotune block grid computation issue #143098

[AOTI] Fix an autotune block grid computation issue #143098

Uh oh!

desertfire commented Dec 12, 2024 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Dec 12, 2024 •

edited

Loading

Uh oh!

facebook-github-bot commented Dec 12, 2024

Uh oh!

facebook-github-bot commented Dec 12, 2024

Uh oh!

facebook-github-bot commented Dec 13, 2024

Uh oh!

pytorchmergebot commented Dec 13, 2024

Uh oh!

bhack commented Dec 16, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

[AOTI] Fix an autotune block grid computation issue #143098

[AOTI] Fix an autotune block grid computation issue #143098

Uh oh!

Conversation

desertfire commented Dec 12, 2024 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Dec 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/143098

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

facebook-github-bot commented Dec 12, 2024

Uh oh!

facebook-github-bot commented Dec 12, 2024

Uh oh!

facebook-github-bot commented Dec 13, 2024

Uh oh!

pytorchmergebot commented Dec 13, 2024

Merge started

Uh oh!

bhack commented Dec 16, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

desertfire commented Dec 12, 2024 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Dec 12, 2024 •

edited

Loading