Skip to content

[Inductor XPU GEMM] Step 5/N: Refactor CUDACombinedScheduling and CUDACppScheduling.#160688

Closed
etaf wants to merge 42 commits intogh/etaf/159/basefrom
gh/etaf/159/head
Closed

[Inductor XPU GEMM] Step 5/N: Refactor CUDACombinedScheduling and CUDACppScheduling.#160688
etaf wants to merge 42 commits intogh/etaf/159/basefrom
gh/etaf/159/head

Conversation

@pytorch-bot
Copy link

pytorch-bot bot commented Aug 14, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/160688

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (9 Unrelated Failures)

As of commit 52c90e0 with merge base a2352cc (image):

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…ling and CUDACppScheduling."

cc jeffdaily sunway513 jithunnair-amd pruthvistony ROCmSupport dllehr-amd jataylo hongxiayang naromero77amd voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov coconutruben

[ghstack-poisoned]
etaf added a commit that referenced this pull request Aug 14, 2025
…DACppScheduling.

ghstack-source-id: 5a58c75
Pull Request resolved: #160688
@etaf etaf changed the title [Inductor XPU GEMM] Step 6/N: Refactor CUDACombinedScheduling and CUDACppScheduling. [Inductor XPU GEMM] Step 6/N: Refactor CUDACombinedScheduling and CUDACppScheduling. Aug 14, 2025
…ing and CUDACppScheduling."

cc jeffdaily sunway513 jithunnair-amd pruthvistony ROCmSupport dllehr-amd jataylo hongxiayang naromero77amd voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov coconutruben

[ghstack-poisoned]
etaf added 3 commits August 15, 2025 02:57
…ing and CUDACppScheduling."

cc jeffdaily sunway513 jithunnair-amd pruthvistony ROCmSupport dllehr-amd jataylo hongxiayang naromero77amd voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov coconutruben

[ghstack-poisoned]
…ing and CUDACppScheduling."

cc jeffdaily sunway513 jithunnair-amd pruthvistony ROCmSupport dllehr-amd jataylo hongxiayang naromero77amd voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov coconutruben

[ghstack-poisoned]
…ing and CUDACppScheduling."






[ghstack-poisoned]
…ing and CUDACppScheduling."






[ghstack-poisoned]
@etaf etaf changed the title [Inductor XPU GEMM] Step 6/N: Refactor CUDACombinedScheduling and CUDACppScheduling. [Inductor XPU GEMM] Step 5/N: Refactor CUDACombinedScheduling and CUDACppScheduling. Sep 2, 2025
…ing and CUDACppScheduling."






[ghstack-poisoned]
@etaf etaf added the topic: not user facing topic category label Sep 3, 2025
@etaf etaf marked this pull request as ready for review September 3, 2025 01:07
etaf added 2 commits December 2, 2025 00:31
…ing and CUDACppScheduling."



Following the design of #160175, This PR renamed `CUDACppScheduling` to `CUTLASSScheduling`  so that they can be reused for XPU.

[ghstack-poisoned]
…ing and CUDACppScheduling."



Following the design of #160175, This PR renamed `CUDACppScheduling` to `CUTLASSScheduling`  so that they can be reused for XPU.

[ghstack-poisoned]
tiendatngcs pushed a commit to tiendatngcs/pytorch-Dec25 that referenced this pull request Dec 10, 2025
etaf added 3 commits January 7, 2026 02:37
…ing and CUDACppScheduling."



Following the design of #160175, This PR renamed `CUDACppScheduling` to `CUTLASSScheduling`  so that they can be reused for XPU.

[ghstack-poisoned]
…ing and CUDACppScheduling."



Following the design of #160175, This PR renamed `CUDACppScheduling` to `CUTLASSScheduling`  so that they can be reused for XPU.

[ghstack-poisoned]
…ing and CUDACppScheduling."



Following the design of #160175, This PR renamed `CUDACppScheduling` to `CUTLASSScheduling`  so that they can be reused for XPU.

[ghstack-poisoned]
etaf added a commit to etaf/pytorch-inductor-xpu that referenced this pull request Jan 12, 2026
…DACppScheduling.

ghstack-source-id: c31c1da
Pull Request resolved: pytorch#160688
[ghstack-poisoned]
etaf added a commit to etaf/pytorch-inductor-xpu that referenced this pull request Jan 16, 2026
…DACppScheduling.

ghstack-source-id: c4ed8ec
Pull Request resolved: pytorch#160688
SergeyTyshkevich pushed a commit to SergeyTyshkevich/chart2 that referenced this pull request Jan 19, 2026
etaf added 3 commits January 26, 2026 17:37
…ing and CUDACppScheduling."



Following the design of #160175, This PR renamed `CUDACppScheduling` to `CUTLASSScheduling`  so that they can be reused for XPU.

[ghstack-poisoned]
…ing and CUDACppScheduling."



Following the design of #160175, This PR renamed `CUDACppScheduling` to `CUTLASSScheduling`  so that they can be reused for XPU.

[ghstack-poisoned]
…ing and CUDACppScheduling."



Following the design of #160175, This PR renamed `CUDACppScheduling` to `CUTLASSScheduling`  so that they can be reused for XPU.

[ghstack-poisoned]
etaf added a commit to etaf/pytorch-inductor-xpu that referenced this pull request Jan 29, 2026
…DACppScheduling.

ghstack-source-id: cf1b799
Pull Request resolved: pytorch#160688
…ing and CUDACppScheduling."



Following the design of #160175, This PR renamed `CUDACppScheduling` to `CUTLASSScheduling`  so that they can be reused for XPU.

[ghstack-poisoned]
etaf added a commit to etaf/pytorch-inductor-xpu that referenced this pull request Feb 11, 2026
…DACppScheduling.

ghstack-source-id: c659cce
Pull Request resolved: pytorch#160688
@etaf etaf added the ciflow/trunk Trigger trunk jobs on your pull request label Feb 11, 2026
…ing and CUDACppScheduling."



Following the design of #160175, This PR renamed `CUDACppScheduling` to `CUTLASSScheduling`  so that they can be reused for XPU.

[ghstack-poisoned]
etaf added a commit to etaf/pytorch-inductor-xpu that referenced this pull request Feb 12, 2026
…DACppScheduling.

ghstack-source-id: 320c7bd
Pull Request resolved: pytorch#160688
@etaf
Copy link
Collaborator Author

etaf commented Feb 12, 2026

@pytorchbot merge

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

…ing and CUDACppScheduling."



Following the design of #160175, This PR renamed `CUDACppScheduling` to `CUTLASSScheduling`  so that they can be reused for XPU.

cc jeffdaily sunway513 jithunnair-amd pruthvistony ROCmSupport jataylo hongxiayang naromero77amd pragupta jerrymannil xinyazhang voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben

[ghstack-poisoned]
pytorchmergebot pushed a commit that referenced this pull request Feb 12, 2026
This PR is part of #160175. It extracts the CUDA-independent functionality from `CUDACodeCache` into `CUTLASSCodeCache`, which `CUDACodeCache` then inherits and extends with CUDA-specific logic. This design allows `CUTLASSCodeCache` to be reused by XPU as well. In addition, CUDA compilation logic has been moved into torch/_inductor/codegen/cuda/compile_utils.py, making codecache.py cleaner.

Pull Request resolved: #160706
Approved by: https://github.com/EikanWang, https://github.com/mlazos
ghstack dependencies: #160688
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/inductor ciflow/trunk Trigger trunk jobs on your pull request Merged module: inductor module: rocm AMD GPU support for Pytorch open source topic: not user facing topic category

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants