Notable Changes
- FIX JIT Pytorch extension
pack_cpu_extstall by @ZX-ModelCloud in #2248 - Refractor Kernel External Dependency Validation by @LRL2-ModelCloud in #2249
- FIX some models not honoring model.config.use_cache by force pass use_cache=false by @LRL2-ModelCloud in #2246
- FIX Incorrect Triton dequant_kernel for 3-bit GPTQ (INT3) leads to Triton compile error / wrong dequantization #2251 by
- Support llm-awq by @ZX-ModelCloud in #2252
What's Changed
- Update version.py by @Qubitium in #2247
- Update README.md by @davedgd in #2250
- [CI] add torch 2.9.1 by @CSY-ModelCloud in #2254
@KingdalfGoodman in #2258 - Update license declaration in pyproject.toml by @CSY-ModelCloud in #2259
- Modify setup by @Qubitium in #2260
- Add release notes for version 5.6.2 by @Qubitium in #2261
- fix test_quant_formats.py by @LRL2-ModelCloud in #2262
- [CI] mount dateset dir to /monster/data/model/dataset by @CSY-ModelCloud in #2263
- fix parsing args by @CSY-ModelCloud in #2264
New Contributors
- @KingdalfGoodman made their first contribution in #2258
Full Changelog: v5.6.0...v5.6.2