Skip to content

Conversation

@sanchitintel
Copy link
Collaborator

@sanchitintel sanchitintel commented Nov 6, 2024

@frost-intel discovered that some Inductor auto-tuning UTs for CPU are currently broken on machines supporting AMX ISA. That's because in #136688, I had reverted a change in the AMX GEMM micro-kernel that was introduced in #131887, but it looks like some other implementations introduced after the aforementioned change rely upon it, so it should not have been reverted.

Added a fix.

Ideally, a CI machine that supports AMX should cover these UTs (test/inductor/test_cpu_select_algorithm.py). We do have at least one CI machines that support AMX.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang @aakhundov

… of dequantized B

Some Inductor auto-tuning UTs for CPU are currently broken on machines supporting AMX because I had reverted a change in the AMX micro-kernel, but it looks like some other implementations also rely upon that change now, so it should not have been reverted.

Adding a workaround. 

Ideally, a CI machine that supports AMX should cover these UTs (test/inductor/test_cpu_select_algorithm.py).
@pytorch-bot
Copy link

pytorch-bot bot commented Nov 6, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/139906

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 8cd5bdd with merge base 99deedf (image):
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@sanchitintel
Copy link
Collaborator Author

sanchitintel commented Nov 6, 2024

Hi @chuanqi129 @WeizhuoZhang-intel, can we add test/inductor/test_cpu_select_algorithm.py to the test-plan of the CI machine that supports AMX? These UTs seem to be a bit long-running (~1 hour), so perhaps more suited for a trunk CI job? Thanks!

@sanchitintel sanchitintel added the topic: not user facing topic category label Nov 6, 2024
@sanchitintel sanchitintel requested a review from jgong5 November 6, 2024 18:28
@sanchitintel sanchitintel changed the title In AMX GEMM micro-kernel, use same dtype for A & B only when B is dequantized In AMX GEMM micro-kernel, use same dtype for A & B only if B is dequantized Nov 6, 2024
@sanchitintel
Copy link
Collaborator Author

@pytorchbot merge

@pytorch-bot pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 7, 2024
@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

atalman pushed a commit to atalman/pytorch that referenced this pull request Nov 11, 2024
…ntized (pytorch#139906)

@frost-intel discovered that some Inductor auto-tuning UTs for CPU are currently broken on machines supporting AMX ISA. That's because in pytorch#136688, I had reverted a change in the AMX GEMM micro-kernel that was introduced in pytorch#131887, but it looks like some other implementations introduced after the aforementioned change rely upon it, so it should not have been reverted.

Added a fix.

Ideally, a CI machine that supports AMX should cover these UTs (test/inductor/test_cpu_select_algorithm.py). We do have at least one CI machines that support AMX.

Pull Request resolved: pytorch#139906
Approved by: https://github.com/leslie-fang-intel, https://github.com/jgong5
zero000064 pushed a commit to zero000064/pytorch that referenced this pull request Nov 14, 2024
…ntized (pytorch#139906)

@frost-intel discovered that some Inductor auto-tuning UTs for CPU are currently broken on machines supporting AMX ISA. That's because in pytorch#136688, I had reverted a change in the AMX GEMM micro-kernel that was introduced in pytorch#131887, but it looks like some other implementations introduced after the aforementioned change rely upon it, so it should not have been reverted.

Added a fix.

Ideally, a CI machine that supports AMX should cover these UTs (test/inductor/test_cpu_select_algorithm.py). We do have at least one CI machines that support AMX.

Pull Request resolved: pytorch#139906
Approved by: https://github.com/leslie-fang-intel, https://github.com/jgong5
Ryo-not-rio pushed a commit to Ryo-not-rio/pytorch that referenced this pull request Dec 2, 2024
…ntized (pytorch#139906)

@frost-intel discovered that some Inductor auto-tuning UTs for CPU are currently broken on machines supporting AMX ISA. That's because in pytorch#136688, I had reverted a change in the AMX GEMM micro-kernel that was introduced in pytorch#131887, but it looks like some other implementations introduced after the aforementioned change rely upon it, so it should not have been reverted.

Added a fix.

Ideally, a CI machine that supports AMX should cover these UTs (test/inductor/test_cpu_select_algorithm.py). We do have at least one CI machines that support AMX.

Pull Request resolved: pytorch#139906
Approved by: https://github.com/leslie-fang-intel, https://github.com/jgong5
pobin6 pushed a commit to pobin6/pytorch that referenced this pull request Dec 5, 2024
…ntized (pytorch#139906)

@frost-intel discovered that some Inductor auto-tuning UTs for CPU are currently broken on machines supporting AMX ISA. That's because in pytorch#136688, I had reverted a change in the AMX GEMM micro-kernel that was introduced in pytorch#131887, but it looks like some other implementations introduced after the aforementioned change rely upon it, so it should not have been reverted.

Added a fix.

Ideally, a CI machine that supports AMX should cover these UTs (test/inductor/test_cpu_select_algorithm.py). We do have at least one CI machines that support AMX.

Pull Request resolved: pytorch#139906
Approved by: https://github.com/leslie-fang-intel, https://github.com/jgong5
@github-actions github-actions bot deleted the sanchitj/patch_amx_gemm_micro_kernel branch December 8, 2024 02:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants