Skip to content

GPTQModel v1.3.1

Choose a tag to compare

@Qubitium Qubitium released this 29 Nov 04:10
· 1241 commits to main since this release
e7f1437

What's Changed

⚡ Olmo2 model support.
⚡ Intel XPU acceleration via IPEX.
Sharding compat fix due to api deprecation in HF Transformers.
Removed triton dependency. Triton kernel now optionally dependent on triton pkg.
Fixed Hymba Test (Hymba requires desc_act=False)

Full Changelog: v1.3.0...v1.3.1