Comparing changes

…_cache=false (#2246) * disable cache * add comments * Update looper_helpers.py * Add comment about use_cache handling in models Added a comment regarding the use_cache property in models. * Fix typo in TODO comment about use_cache property * Fix typo in TODO comment about use_cache property --------- Co-authored-by: Qubitium-ModelCloud <[email protected]>

* add safe_load_cpp_ext() Signed-off-by: ZX-ModelCloud <[email protected]> * fix safe_load_cpp_ext() lock Signed-off-by: ZX-ModelCloud <[email protected]> * Revert "fix safe_load_cpp_ext() lock" This reverts commit 34f6373. * Create build_directory if it does not exist Signed-off-by: ZX-ModelCloud <[email protected]> * format Signed-off-by: ZX-ModelCloud <[email protected]> * Update cpp.py --------- Signed-off-by: ZX-ModelCloud <[email protected]> Co-authored-by: Qubitium-ModelCloud <[email protected]>

Fixed a very minor typo in the recent release notes.

* add validate_once() * add cache_validate_once * bitblas and machete use cache_validate_once * mod exllama kernel * mod exllama2 * fixed machete * fix maxsize * fix method name * fix * cleanup * mod marlin_awq * mod qqq * cleanup * cleanup * cleanup * cleanup * fix import * fix import * cleanup * format code * make validate_one return a tuple of bool and optional[exception]

* Update license declaration in pyproject.toml * Remove license-files from pyproject.toml * Include licenses directory in MANIFEST.in * Downgrade version from 5.7.0 to 5.6.2

* device-smi/logbar depend udpate * log when we cannot auto-download pre-compiled whls * pypcre depend update

* Add release notes for version 5.6.2 Updated latest news section with recent release notes. * Update latest news section in README.md

* fix parsing args * fix parsing args * [CI] build eora for test

* add TestInferenceOnly::test_inference_quantized_by_llm_awq Signed-off-by: ZX-ModelCloud <[email protected]> * AwqGEMVFastQuantLinear adapted for llm-awq Signed-off-by: ZX-ModelCloud <[email protected]> * check AWQ_PACKING_BACKEND_FIELD Signed-off-by: ZX-ModelCloud <[email protected]> * AwqGEMVFastQuantLinear supports two zeros name Signed-off-by: ZX-ModelCloud <[email protected]> * add BACKEND.LLM_AWQ and FORMAT.LLM_AWQ Signed-off-by: ZX-ModelCloud <[email protected]> * removed BACKEND.LLM_AWQ Signed-off-by: ZX-ModelCloud <[email protected]> * cleanup Signed-off-by: ZX-ModelCloud <[email protected]> --------- Signed-off-by: ZX-ModelCloud <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comparing changes

Open a pull request

Commits on Dec 10, 2025

Commits on Dec 11, 2025

Commits on Dec 12, 2025

This comparison is taking too long to generate.

Uh oh!