[Test][Easy] Use float16 dtype in test_sort_large #159939

Aidyn-A · 2025-08-06T06:25:46Z

The test fails with:

RuntimeError: var_mean only support floating point and complex dtypes

cc @ptrblck @msaroufim @eqy @jerryzh168

pytorch-bot · 2025-08-06T06:25:51Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/159939

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 712d30f with merge base a53d14d ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

eqy

approving assuming that this never ran before and therefore this is not a regression

Aidyn-A · 2025-08-06T06:30:04Z

approving assuming that this never ran before and therefore this is not a regression

Indeed, it never ran before. The only machine which has that much memory is GB300.

Aidyn-A · 2025-08-07T17:39:42Z

Hmm, those are strange failures

@pytorchbot rebase

pytorchmergebot · 2025-08-07T17:41:12Z

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

pytorchmergebot · 2025-08-07T17:41:15Z

Successfully rebased test_sort_large_float16 onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout test_sort_large_float16 && git pull --rebase)

Aidyn-A · 2025-08-08T09:54:17Z

@pytorchbot merge

pytorchmergebot · 2025-08-08T09:56:23Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

jithunnair-amd · 2025-08-08T22:04:44Z

approving assuming that this never ran before and therefore this is not a regression

Indeed, it never ran before. The only machine which has that much memory is GB300.

@Aidyn-A Did this updated test run in any of the CI jobs? It failed in ROCm CI (because the MI325 has >200GB memory) with the error Cannot sort dimension of length 8192 (link) (error coming from here), but I can't tell if it passed for CUDA at all.

Aidyn-A · 2025-08-08T22:11:24Z

approving assuming that this never ran before and therefore this is not a regression

Indeed, it never ran before. The only machine which has that much memory is GB300.

@Aidyn-A Did this updated test run in any of the CI jobs? It failed in ROCm CI (because the MI325 has >200GB memory) with the error Cannot sort dimension of length 8192 (link) (error coming from here), but I can't tell if it passed for CUDA at all.

Yes, it is passing on GB300. Was it passing prior my changes? I believe it should have failed, because var_mean does not support integer dtypes.

The test fails with: >RuntimeError: var_mean only support floating point and complex dtypes Pull Request resolved: pytorch#159939 Approved by: https://github.com/eqy

Currently std::min -> ::min did not work as expected on ROCm when input values >= 2147483648 Replace std::min to ternary statement Also std::min can be replaced by explicit typing std::min<int64_t> fixes on ROCm: test_sort_and_select.py::TestSortAndSelectCUDA::test_sort_large_cuda_float16 error: RuntimeError: Cannot sort dimension of length 8192 Combines upstream PRs: - pytorch#161054 to fix std::min on ROCm - pytorch#155546 fix python test - pytorch#159939 change test dtype from int8 to float16 Fixes: SWDEV-526432

The test fails with: >RuntimeError: var_mean only support floating point and complex dtypes Pull Request resolved: pytorch#159939 Approved by: https://github.com/eqy

Aidyn-A requested a review from eqy August 6, 2025 06:25

Aidyn-A self-assigned this Aug 6, 2025

Aidyn-A added module: cuda Related to torch.cuda, and CUDA support in general topic: not user facing topic category labels Aug 6, 2025

eqy approved these changes Aug 6, 2025

View reviewed changes

pytorchbot added the open source label Aug 6, 2025

use float16 in test_sort_large

712d30f

pytorchmergebot force-pushed the test_sort_large_float16 branch from a321cf0 to 712d30f Compare August 7, 2025 17:41

Aidyn-A added the ciflow/trunk Trigger trunk jobs on your pull request label Aug 7, 2025

pytorchmergebot added the merging label Aug 8, 2025

pytorchmergebot closed this in 556e2a7 Aug 8, 2025

pytorchmergebot added Merged and removed merging labels Aug 8, 2025

jithunnair-amd mentioned this pull request Aug 8, 2025

DISABLED test_sort_large_cuda_float16 (__main__.TestSortAndSelectCUDA) #159426

Closed

dnikolaev-amd mentioned this pull request Aug 21, 2025

[rocm7.1_internal_testing] fix large tensor sort on ROCm ROCm/pytorch#2543

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Test][Easy] Use float16 dtype in test_sort_large #159939

[Test][Easy] Use float16 dtype in test_sort_large #159939

Uh oh!

Aidyn-A commented Aug 6, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Aug 6, 2025 •

edited

Loading

Uh oh!

eqy left a comment

Uh oh!

Aidyn-A commented Aug 6, 2025

Uh oh!

Aidyn-A commented Aug 7, 2025

Uh oh!

pytorchmergebot commented Aug 7, 2025

Uh oh!

pytorchmergebot commented Aug 7, 2025

Uh oh!

Aidyn-A commented Aug 8, 2025

Uh oh!

pytorchmergebot commented Aug 8, 2025

Uh oh!

jithunnair-amd commented Aug 8, 2025

Uh oh!

Aidyn-A commented Aug 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[Test][Easy] Use float16 dtype in test_sort_large #159939

[Test][Easy] Use float16 dtype in test_sort_large #159939

Uh oh!

Conversation

Aidyn-A commented Aug 6, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/159939

✅ No Failures

Uh oh!

eqy left a comment

Choose a reason for hiding this comment

Uh oh!

Aidyn-A commented Aug 6, 2025

Uh oh!

Aidyn-A commented Aug 7, 2025

Uh oh!

pytorchmergebot commented Aug 7, 2025

Uh oh!

pytorchmergebot commented Aug 7, 2025

Uh oh!

Aidyn-A commented Aug 8, 2025

Uh oh!

pytorchmergebot commented Aug 8, 2025

Merge started

Uh oh!

jithunnair-amd commented Aug 8, 2025

Uh oh!

Aidyn-A commented Aug 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Aidyn-A commented Aug 6, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Aug 6, 2025 •

edited

Loading