BUG: Fix float16-sort failures on 32-bit x86 MSVC #29908

seiko2plus · 2025-10-09T08:13:18Z

The failures are triggered when the Intel x86 sort AVX‑512 kernels for 16‑bit
are enabled at build time and the CPU/OS also supports them. A quick look at the
zmm_vector<float16>::ge(reg_t, reg_t) seems to not correctly generate the instructions for it.

This patch does not actually fix the underlying bug; instead, it disables these kernels on 32‑bit MSVC builds as a stop‑gap, since the issue requires further investigation and an upstream fix. Note: Newer NumPy releases may drop the entire AVX‑512 support on 32‑bit for all compilers and will enable at most AVX2 as part of gh-28896

closes #29808

seiko2plus · 2025-10-09T08:24:54Z

cc @r-devulap

The failures are triggered when the Intel x86 sort AVX‑512 kernels for 16‑bit are enabled at build time and the CPU/OS also supports them. A quick look at the `zmm_vector<float16>::ge(reg_t, reg_t)` seems to not correctly generate the instructions for it. This patch does not actually fix the underlying bug; instead, it disables these kernels on 32‑bit MSVC builds as a stop‑gap, since the issue requires further investigation and an upstream fix. Note: Newer NumPy releases may drop the entire AVX‑512 support on 32‑bit for all compilers and will enable at most AVX2 as part of numpygh-28896

charris · 2025-10-09T15:39:48Z

This is only a problem with msvc?

charris · 2025-10-09T15:40:25Z

Thanks Sayed.

r-devulap · 2025-10-09T16:03:37Z

thanks @seiko2plus for debugging and fixing this! Disabling on MSVC 32-bit sounds fine to me.

The failures are triggered when the Intel x86 sort AVX‑512 kernels for 16‑bit are enabled at build time and the CPU/OS also supports them. A quick look at the `zmm_vector<float16>::ge(reg_t, reg_t)` seems to not correctly generate the instructions for it. This patch does not actually fix the underlying bug; instead, it disables these kernels on 32‑bit MSVC builds as a stop‑gap, since the issue requires further investigation and an upstream fix. Note: Newer NumPy releases may drop the entire AVX‑512 support on 32‑bit for all compilers and will enable at most AVX2 as part of numpygh-28896

BUG: Fix float16-sort failures on 32-bit x86 MSVC (#29908)

The failures are triggered when the Intel x86 sort AVX‑512 kernels for 16‑bit are enabled at build time and the CPU/OS also supports them. A quick look at the `zmm_vector<float16>::ge(reg_t, reg_t)` seems to not correctly generate the instructions for it. This patch does not actually fix the underlying bug; instead, it disables these kernels on 32‑bit MSVC builds as a stop‑gap, since the issue requires further investigation and an upstream fix. Note: Newer NumPy releases may drop the entire AVX‑512 support on 32‑bit for all compilers and will enable at most AVX2 as part of numpygh-28896

seiko2plus added 09 - Backport-Candidate PRs tagged should be backported component: SIMD Issues in SIMD (fast instruction sets) code or machinery labels Oct 9, 2025

github-actions bot added the 00 - Bug label Oct 9, 2025

seiko2plus force-pushed the disable-msvc-x86-avx512-intel-qsort16bit branch from 62962ea to 34c334b Compare October 9, 2025 08:27

seiko2plus mentioned this pull request Oct 9, 2025

New float16 failures in 32-bit Windows wheel build jobs #29808

Closed

charris merged commit 53b0d99 into numpy:main Oct 9, 2025
77 checks passed

charris mentioned this pull request Oct 9, 2025

BUG: Fix float16-sort failures on 32-bit x86 MSVC (#29908) #29910

Merged

charris removed the 09 - Backport-Candidate PRs tagged should be backported label Oct 9, 2025

charris added a commit that referenced this pull request Oct 9, 2025

Merge pull request #29910 from charris/backport-29908

5395b1c

BUG: Fix float16-sort failures on 32-bit x86 MSVC (#29908)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

BUG: Fix float16-sort failures on 32-bit x86 MSVC #29908

BUG: Fix float16-sort failures on 32-bit x86 MSVC #29908

Uh oh!

seiko2plus commented Oct 9, 2025 •

edited

Loading

Uh oh!

seiko2plus commented Oct 9, 2025

Uh oh!

charris commented Oct 9, 2025

Uh oh!

Uh oh!

charris commented Oct 9, 2025

Uh oh!

r-devulap commented Oct 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

BUG: Fix float16-sort failures on 32-bit x86 MSVC #29908

BUG: Fix float16-sort failures on 32-bit x86 MSVC #29908

Uh oh!

Conversation

seiko2plus commented Oct 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

seiko2plus commented Oct 9, 2025

Uh oh!

charris commented Oct 9, 2025

Uh oh!

Uh oh!

charris commented Oct 9, 2025

Uh oh!

r-devulap commented Oct 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

seiko2plus commented Oct 9, 2025 •

edited

Loading