[Inductor XPU GEMM] Step 7/N: Refactor CUDABenchmarkRequest#160729
[Inductor XPU GEMM] Step 7/N: Refactor CUDABenchmarkRequest#160729etaf wants to merge 41 commits intogh/etaf/161/basefrom
Conversation
[ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/160729
Note: Links to docs will display an error until the docs builds have been completed. ✅ You can merge normally! (5 Unrelated Failures)As of commit 67fc2df with merge base 98a4d7b ( FLAKY - The following jobs failed but were likely due to flakiness present on trunk:
UNSTABLE - The following jobs are marked as unstable, possibly due to flakiness on trunk:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov coconutruben [ghstack-poisoned]
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov coconutruben [ghstack-poisoned]
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov coconutruben [ghstack-poisoned]
This PR is part of #160175. It refactors the CUDA-specific code in CUDABenchmarkRequest, renaming it to CUTLASSBenchmarkRequest so that it can be reused for XPU. [ghstack-poisoned]
This PR is part of #160175. It refactors the CUDA-specific code in CUDABenchmarkRequest, renaming it to CUTLASSBenchmarkRequest so that it can be reused for XPU. [ghstack-poisoned]
This PR is part of #160175. It refactors the CUDA-specific code in CUDABenchmarkRequest, renaming it to CUTLASSBenchmarkRequest so that it can be reused for XPU. [ghstack-poisoned]
This PR is part of #160175. It refactors the CUDA-specific code in CUDABenchmarkRequest, renaming it to CUTLASSBenchmarkRequest so that it can be reused for XPU. [ghstack-poisoned]
This PR is part of #160175. It refactors the CUDA-specific code in CUDABenchmarkRequest, renaming it to CUTLASSBenchmarkRequest so that it can be reused for XPU. [ghstack-poisoned]
This PR is part of #160175. It refactors the CUDA-specific code in CUDABenchmarkRequest, renaming it to CUTLASSBenchmarkRequest so that it can be reused for XPU. [ghstack-poisoned]
This PR is part of #160175. It refactors the CUDA-specific code in CUDABenchmarkRequest, renaming it to CUTLASSBenchmarkRequest so that it can be reused for XPU. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov coconutruben [ghstack-poisoned]
|
@pytorchbot drci |
|
@pytorchbot merge |
Merge startedYour change will be merged once all checks pass (ETA 0-4 Hours). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
|
@pytorchbot merge |
Merge startedYour change will be merged once all checks pass (ETA 0-4 Hours). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
|
@pytorchbot drci |
This PR is part of #160175. It refactors the CUDA-specific code in CUDABenchmarkRequest, renaming it to CUTLASSBenchmarkRequest so that it can be reused for XPU. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo chenyang78 [ghstack-poisoned]
This PR is part of #160175. It refactors the CUDA-specific code in CUDABenchmarkRequest, renaming it to CUTLASSBenchmarkRequest so that it can be reused for XPU. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo chenyang78 [ghstack-poisoned]
|
@pytorchbot drci |
|
@pytorchbot merge |
Merge startedYour change will be merged once all checks pass (ETA 0-4 Hours). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
This PR is part of #160175. It refactors the CUDA-specific code in CUDABenchmarkRequest, renaming it to CUTLASSBenchmarkRequest so that it can be reused for XPU. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo chenyang78 [ghstack-poisoned]
ghstack-source-id: 7d6f9f1 Pull Request resolved: pytorch#160729
Stack from ghstack (oldest at bottom):
This PR is part of #160175. It refactors the CUDA-specific code in CUDABenchmarkRequest, renaming it to CUTLASSBenchmarkRequest so that it can be reused for XPU.
cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben @jataylo @chenyang78