-
Notifications
You must be signed in to change notification settings - Fork 140
Comparing changes
Open a pull request
base repository: ModelCloud/GPTQModel
base: v5.6.0
head repository: ModelCloud/GPTQModel
compare: v5.6.2
- 14 commits
- 31 files changed
- 6 contributors
Commits on Dec 10, 2025
-
FIX some models not honoring model.config.use_cache by force pass use…
…_cache=false (#2246) * disable cache * add comments * Update looper_helpers.py * Add comment about use_cache handling in models Added a comment regarding the use_cache property in models. * Fix typo in TODO comment about use_cache property * Fix typo in TODO comment about use_cache property --------- Co-authored-by: Qubitium-ModelCloud <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 95063e2 - Browse repository at this point
Copy the full SHA 95063e2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 23458a7 - Browse repository at this point
Copy the full SHA 23458a7View commit details -
[FIX] JIT Pytorch extension
pack_cpu_extstall (#2248)* add safe_load_cpp_ext() Signed-off-by: ZX-ModelCloud <[email protected]> * fix safe_load_cpp_ext() lock Signed-off-by: ZX-ModelCloud <[email protected]> * Revert "fix safe_load_cpp_ext() lock" This reverts commit 34f6373. * Create build_directory if it does not exist Signed-off-by: ZX-ModelCloud <[email protected]> * format Signed-off-by: ZX-ModelCloud <[email protected]> * Update cpp.py --------- Signed-off-by: ZX-ModelCloud <[email protected]> Co-authored-by: Qubitium-ModelCloud <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 35b228f - Browse repository at this point
Copy the full SHA 35b228fView commit details -
Fixed a very minor typo in the recent release notes.
Configuration menu - View commit details
-
Copy full SHA for 5bf5c1a - Browse repository at this point
Copy the full SHA 5bf5c1aView commit details
Commits on Dec 11, 2025
-
Refractor Kernel External Dependency Validation (#2249)
* add validate_once() * add cache_validate_once * bitblas and machete use cache_validate_once * mod exllama kernel * mod exllama2 * fixed machete * fix maxsize * fix method name * fix * cleanup * mod marlin_awq * mod qqq * cleanup * cleanup * cleanup * cleanup * fix import * fix import * cleanup * format code * make validate_one return a tuple of bool and optional[exception]
Configuration menu - View commit details
-
Copy full SHA for f26c09d - Browse repository at this point
Copy the full SHA f26c09dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8dddd6a - Browse repository at this point
Copy the full SHA 8dddd6aView commit details
Commits on Dec 12, 2025
-
Configuration menu - View commit details
-
Copy full SHA for 4443148 - Browse repository at this point
Copy the full SHA 4443148View commit details -
Update license declaration in pyproject.toml (#2259)
* Update license declaration in pyproject.toml * Remove license-files from pyproject.toml * Include licenses directory in MANIFEST.in * Downgrade version from 5.7.0 to 5.6.2
Configuration menu - View commit details
-
Copy full SHA for a4dad1a - Browse repository at this point
Copy the full SHA a4dad1aView commit details -
* device-smi/logbar depend udpate * log when we cannot auto-download pre-compiled whls * pypcre depend update
Configuration menu - View commit details
-
Copy full SHA for 4ec0370 - Browse repository at this point
Copy the full SHA 4ec0370View commit details -
Add release notes for version 5.6.2 (#2261)
* Add release notes for version 5.6.2 Updated latest news section with recent release notes. * Update latest news section in README.md
Configuration menu - View commit details
-
Copy full SHA for e1bbaae - Browse repository at this point
Copy the full SHA e1bbaaeView commit details -
Configuration menu - View commit details
-
Copy full SHA for e534536 - Browse repository at this point
Copy the full SHA e534536View commit details -
Configuration menu - View commit details
-
Copy full SHA for a72941c - Browse repository at this point
Copy the full SHA a72941cView commit details -
* fix parsing args * fix parsing args * [CI] build eora for test
Configuration menu - View commit details
-
Copy full SHA for 48c557e - Browse repository at this point
Copy the full SHA 48c557eView commit details -
* add TestInferenceOnly::test_inference_quantized_by_llm_awq Signed-off-by: ZX-ModelCloud <[email protected]> * AwqGEMVFastQuantLinear adapted for llm-awq Signed-off-by: ZX-ModelCloud <[email protected]> * check AWQ_PACKING_BACKEND_FIELD Signed-off-by: ZX-ModelCloud <[email protected]> * AwqGEMVFastQuantLinear supports two zeros name Signed-off-by: ZX-ModelCloud <[email protected]> * add BACKEND.LLM_AWQ and FORMAT.LLM_AWQ Signed-off-by: ZX-ModelCloud <[email protected]> * removed BACKEND.LLM_AWQ Signed-off-by: ZX-ModelCloud <[email protected]> * cleanup Signed-off-by: ZX-ModelCloud <[email protected]> --------- Signed-off-by: ZX-ModelCloud <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d97478f - Browse repository at this point
Copy the full SHA d97478fView commit details
This comparison is taking too long to generate.
Unfortunately it looks like we can’t render this comparison for you right now. It might be too big, or there might be something weird with your repository.
You can try running this command locally to see the comparison on your machine:
git diff v5.6.0...v5.6.2