Skip to content

GPTQModel v0.9.5

Choose a tag to compare

@Qubitium Qubitium released this 05 Jul 13:48
· 1711 commits to main since this release
f0a1ee8

What's Changed

Another large update with added support for Intel/Qbits quantization/inference on CPU. Cuda kernels have been fully deprecated in favor of better performing Exllama (v1/v2), Marlin, and Triton kernels.

Full Changelog: v0.9.4...v0.9.5