CPU: workaround avx512 4bit dequantize accuracy issue for large blocksize #1828

matthewdouglas · 2025-12-10T20:14:47Z

This PR fixes 8 failing tests for 4bit dequantization on CPUs with AVX512F support.

There is an accuracy issue for the AVX512 codepath with fp16/fp32 and blocksize 2048 or 4096. This is an unlikely usecase, and as such we can accept to fallback all the way to the slower PyTorch implementation instead of a more complex fallback to a scalar C++ kernel.

…size

github-actions · 2025-12-10T20:20:22Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

CPU: workaround avx512 4bit dequantize accuracy issue for large block…

e0a6888

…size

matthewdouglas added this to the v0.49.0 milestone Dec 10, 2025

matthewdouglas added the x64 CPU label Dec 10, 2025

matthewdouglas merged commit 5ea4afe into main Dec 10, 2025
274 of 280 checks passed

matthewdouglas deleted the cpu-4bit-avx512-workaround branch December 10, 2025 21:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

CPU: workaround avx512 4bit dequantize accuracy issue for large blocksize #1828

CPU: workaround avx512 4bit dequantize accuracy issue for large blocksize #1828

Uh oh!

matthewdouglas commented Dec 10, 2025

Uh oh!

github-actions bot commented Dec 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

CPU: workaround avx512 4bit dequantize accuracy issue for large blocksize #1828

CPU: workaround avx512 4bit dequantize accuracy issue for large blocksize #1828

Uh oh!

Conversation

matthewdouglas commented Dec 10, 2025

Uh oh!

github-actions bot commented Dec 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants