[BUG] Incorrect Triton dequant_kernel for 3-bit GPTQ (INT3) leads to Triton compile error / wrong dequantization #2251 #2258

KingdalfGoodman · 2025-12-12T01:27:18Z

Summary

Fix Triton dequant_kernel for GPTQ INT3 (bits=3):

Remove invalid tensor-valued Python if control flow (Triton compile error).
Align 3-bit dequantization with GPTQ CPU packing format (10-1-10-1-10, 32 values → 3 int32 words).

This makes Triton INT3 inference compile correctly and produces numerically consistent results with the CPU reference.

Related Issue

Fixes / relates to: #2251

Notes

This is a correctness-first fix: it restores functional INT3 Triton inference and matches the CPU reference PackableQuantLinear / pack_block logic for 3-bit GPTQ. Performance has not been fully optimized or benchmarked yet.
Tested on INT3 GPTQ models (e.g. Qwen3-4B) with wikitext; perplexity returns to a reasonable range.

Qubitium · 2025-12-12T01:45:42Z

@KingdalfGoodman Thank you!

fix INT3 BUG

0d83405

KingdalfGoodman mentioned this pull request Dec 12, 2025

[BUG] Incorrect Triton dequant_kernel for 3-bit GPTQ (INT3) leads to Triton compile error / wrong dequantization #2251

Closed

Qubitium merged commit 4443148 into ModelCloud:main Dec 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BUG] Incorrect Triton dequant_kernel for 3-bit GPTQ (INT3) leads to Triton compile error / wrong dequantization #2251 #2258

[BUG] Incorrect Triton dequant_kernel for 3-bit GPTQ (INT3) leads to Triton compile error / wrong dequantization #2251 #2258

Uh oh!

KingdalfGoodman commented Dec 12, 2025

Uh oh!

Qubitium commented Dec 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[BUG] Incorrect Triton dequant_kernel for 3-bit GPTQ (INT3) leads to Triton compile error / wrong dequantization #2251 #2258

[BUG] Incorrect Triton dequant_kernel for 3-bit GPTQ (INT3) leads to Triton compile error / wrong dequantization #2251 #2258

Uh oh!

Conversation

KingdalfGoodman commented Dec 12, 2025

Summary

Related Issue

Notes

Uh oh!

Qubitium commented Dec 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants