Relax tolerance for test_quick_baddbmm_cpu_complex64 #152424

Flamefire · 2025-04-29T12:35:56Z

On Zen 2 (AMD EPYC) and Intel Sapphire Rapids this fails with small differences when compiled with native targeted optimizations. I.e. it fails with -march=znver2 but succeeds with -march=znver1.

I assume some operator fusing is being used by GCC. Small differences like using vmovdqa can be seen in the minimized code of the baddbmm kernel: https://godbolt.org/z/jsxMa91Wb

The greatest differences are consistent and the same on both CPU architectures:

Greatest absolute difference: 3.43852152582258e-05 at index (1, 2, 1) (up to 1e-05 allowed)
Greatest relative difference: 3.6034286949870875e-06 at index (1, 2, 1) (up to 1.3e-06 allowed)

Hence I assume this is in the expected tolerances especially as complex128 and all other types pass.

pytorch-bot · 2025-04-29T12:36:00Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/152424

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit b489969 with merge base 6737e2c ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

torch/testing/_internal/common_methods_invocations.py

Skylion007 · 2025-05-05T14:29:46Z

@pytorchbot merge

pytorchmergebot · 2025-05-05T14:31:41Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-05-05T20:13:27Z

Merge failed

Reason: 1 jobs have failed, first few of them are: trunk / linux-focal-rocm-py3.10 / test (default, 1, 2, linux.rocm.gpu.2)

Details for Dev Infra team

Raised by workflow job

Flamefire · 2025-05-06T07:47:05Z

This is the failure:

The job has exceeded the maximum execution time of 5h0m0s

hanging in the ROCM setup step. Can we ignore that for the merge?

Flamefire · 2025-06-02T12:32:58Z

@pytorchbot merge

pytorchmergebot · 2025-06-02T12:34:53Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-06-02T12:35:08Z

Merge failed

Reason: 1 jobs have failed, first few of them are: trunk / linux-focal-rocm-py3.10 / test (default, 1, 2, linux.rocm.gpu.2)

Details for Dev Infra team

Raised by workflow job

Flamefire · 2025-06-03T07:19:04Z

@Skylion007 The failure seems to be unrelated and there is no message what exactly failed. What shall we do with that?

github-actions · 2025-08-02T07:35:47Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

Flamefire · 2025-08-04T09:53:32Z

Any updates here?

Flamefire · 2025-09-03T14:12:11Z

@pytorchbot rebase

pytorchmergebot · 2025-09-03T14:13:42Z

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

On Zen 2 (AMD EPYC) and Intel Sapphire Rapids this fails with small differences when compiled with native targeted optimizations. I.e. it fails with `-march=znver2` but succeeds with `-march=znver1`.

pytorchmergebot · 2025-09-03T14:13:46Z

Successfully rebased decomp-tolerance onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout decomp-tolerance && git pull --rebase)

Flamefire · 2025-09-04T13:18:54Z

@pytorchbot merge

pytorchmergebot · 2025-09-04T13:20:48Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

On Zen 2 (AMD EPYC) and Intel Sapphire Rapids this fails with small differences when compiled with native targeted optimizations. I.e. it fails with `-march=znver2` but succeeds with `-march=znver1`. I assume some operator fusing is being used by GCC. Small differences like using `vmovdqa` can be seen in the minimized code of the baddbmm kernel: https://godbolt.org/z/jsxMa91Wb The greatest differences are consistent and the same on both CPU architectures: ``` Greatest absolute difference: 3.43852152582258e-05 at index (1, 2, 1) (up to 1e-05 allowed) Greatest relative difference: 3.6034286949870875e-06 at index (1, 2, 1) (up to 1.3e-06 allowed) ``` Hence I assume this is in the expected tolerances especially as `complex128` and all other types pass. Pull Request resolved: pytorch#152424 Approved by: https://github.com/malfet

Flamefire requested a review from mruberry as a code owner April 29, 2025 12:35

pytorch-bot bot added the release notes: python_frontend python frontend release notes category label Apr 29, 2025

Flamefire added topic: not user facing topic category and removed release notes: python_frontend python frontend release notes category labels Apr 29, 2025

pytorchbot added the open source label Apr 29, 2025

malfet approved these changes Apr 29, 2025

View reviewed changes

torch/testing/_internal/common_methods_invocations.py Show resolved Hide resolved

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label May 5, 2025

pytorchmergebot added the merging label May 5, 2025

pytorchmergebot removed the merging label May 5, 2025

pytorchmergebot added the merging label Jun 2, 2025

pytorchmergebot removed the merging label Jun 2, 2025

github-actions bot added the Stale label Aug 2, 2025

github-actions bot closed this Sep 3, 2025

Flamefire reopened this Sep 3, 2025

Flamefire added 3 commits September 3, 2025 14:13

Relax tolerance for test_quick_baddbmm_cpu_complex64

a595f5c

On Zen 2 (AMD EPYC) and Intel Sapphire Rapids this fails with small differences when compiled with native targeted optimizations. I.e. it fails with `-march=znver2` but succeeds with `-march=znver1`.

Restrict tolerance override to CPU

a5d42b2

Add comment

b489969

pytorchmergebot force-pushed the decomp-tolerance branch from 19c6b89 to b489969 Compare September 3, 2025 14:13

pytorchmergebot added the merging label Sep 4, 2025

pytorchmergebot added the Merged label Sep 4, 2025

pytorchmergebot closed this in e532c9d Sep 4, 2025

pytorchmergebot removed the merging label Sep 4, 2025

Flamefire deleted the decomp-tolerance branch September 4, 2025 13:29

Relax tolerance for test_quick_baddbmm_cpu_complex64 #152424

Relax tolerance for test_quick_baddbmm_cpu_complex64 #152424

Uh oh!

Conversation

Flamefire commented Apr 29, 2025

Uh oh!

pytorch-bot bot commented Apr 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/152424

✅ No Failures

Uh oh!

Uh oh!

Skylion007 commented May 5, 2025

Uh oh!

pytorchmergebot commented May 5, 2025

Merge started

Uh oh!

pytorchmergebot commented May 5, 2025

Merge failed

Uh oh!

Flamefire commented May 6, 2025

Uh oh!

Flamefire commented Jun 2, 2025

Uh oh!

pytorchmergebot commented Jun 2, 2025

Merge started

Uh oh!

pytorchmergebot commented Jun 2, 2025

Merge failed

Uh oh!

Flamefire commented Jun 3, 2025

Uh oh!

github-actions bot commented Aug 2, 2025

Uh oh!

Flamefire commented Aug 4, 2025

Uh oh!

Flamefire commented Sep 3, 2025

Uh oh!

pytorchmergebot commented Sep 3, 2025

Uh oh!

pytorchmergebot commented Sep 3, 2025

Uh oh!

Flamefire commented Sep 4, 2025

Uh oh!

pytorchmergebot commented Sep 4, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

pytorch-bot bot commented Apr 29, 2025 •

edited

Loading