-
Notifications
You must be signed in to change notification settings - Fork 140
Comparing changes
Open a pull request
base repository: ModelCloud/GPTQModel
base: v1.5.0
head repository: ModelCloud/GPTQModel
compare: v1.5.1
- 19 commits
- 22 files changed
- 6 contributors
Commits on Dec 24, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 5712c49 - Browse repository at this point
Copy the full SHA 5712c49View commit details -
Configuration menu - View commit details
-
Copy full SHA for a235687 - Browse repository at this point
Copy the full SHA a235687View commit details -
Add QuantizeConfig.device and use. (#950)
* normalize device + device_map * normalize device+device+map+dtype in from_pretrained() * disallow passing of device/device_map in pretrained(). add `device` to QuantizeConfig. * if user pass device+device_map and quantizeconfig.device is none, use...else quantizeconfig.device, fall back is auto select * auto-device logic should not be here * reduce reliance on accelerate * remove bad device override * fix dev not define * cleanup * already check device when select_quant_linear * fix marlin post_init --------- Co-authored-by: LRL-ModelCloud <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 55f9d72 - Browse repository at this point
Copy the full SHA 55f9d72View commit details -
fix hf_select_quant_linear (#966)
LRL-ModelCloud authoredDec 24, 2024 Configuration menu - View commit details
-
Copy full SHA for da26575 - Browse repository at this point
Copy the full SHA da26575View commit details -
Configuration menu - View commit details
-
Copy full SHA for 00fde4b - Browse repository at this point
Copy the full SHA 00fde4bView commit details
Commits on Dec 25, 2024
-
fix cuda:0 not a enum device (#968)
* fix cuda:0 not a enum device * use normalize_device
Configuration menu - View commit details
-
Copy full SHA for ebf169a - Browse repository at this point
Copy the full SHA ebf169aView commit details
Commits on Dec 26, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 762311e - Browse repository at this point
Copy the full SHA 762311eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 394324a - Browse repository at this point
Copy the full SHA 394324aView commit details -
hf select quant_linear with pack (#969)
* hf select quant_linear with pack * mark pack Optional * pack default True
LRL-ModelCloud authoredDec 26, 2024 Configuration menu - View commit details
-
Copy full SHA for c597668 - Browse repository at this point
Copy the full SHA c597668View commit details
Commits on Dec 27, 2024
-
[CI] use new ci docker images (#974)
* [CI] all use new v4 docker * [CI] disable init_unit_tests * [CI] disable init_unit_tests * [CI] disable init_unit_tests * [CI] remove login arg * [CI] print logs * [CI] source /opt/pyenv.sh * [CI] add login arg * [CI] remove source * [CI] show list * [CI] remove torch 2.4 * [CI] max 10 * [CI] clean cache * [CI] add models/ tests * update logs * add cache clean * [CI] show vram * [CI] clean cache at earlier step * [CI] fix env * [CI] fix env * [CI] show pip list * [CI] use v5 * [CI] fix xpu env * [CI] use 10.0.13.31 * Update release.yml * fix runs on * decrease delta to -20% * install transformers for test_cohere2 * decrease delta to -20%
Configuration menu - View commit details
-
Copy full SHA for bfb75f0 - Browse repository at this point
Copy the full SHA bfb75f0View commit details -
Configuration menu - View commit details
-
Copy full SHA for c296123 - Browse repository at this point
Copy the full SHA c296123View commit details -
Configuration menu - View commit details
-
Copy full SHA for 98540eb - Browse repository at this point
Copy the full SHA 98540ebView commit details
Commits on Dec 30, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 0bd8417 - Browse repository at this point
Copy the full SHA 0bd8417View commit details
Commits on Dec 31, 2024
-
use new ci docker images (#980)
* [CI] all use new v4 docker * [CI] disable init_unit_tests * [CI] disable init_unit_tests * [CI] disable init_unit_tests * [CI] remove login arg * [CI] print logs * [CI] source /opt/pyenv.sh * [CI] add login arg * [CI] remove source * [CI] show list * [CI] remove torch 2.4 * [CI] max 10 * [CI] clean cache * [CI] add models/ tests * update logs * add cache clean * [CI] show vram * [CI] clean cache at earlier step * [CI] fix env * [CI] fix env * [CI] show pip list * [CI] use v5 * [CI] fix xpu env * [CI] use 10.0.13.31 * Update release.yml * fix runs on * decrease delta to -20% * install transformers for test_cohere2 * decrease delta to -20%
Configuration menu - View commit details
-
Copy full SHA for ac040bd - Browse repository at this point
Copy the full SHA ac040bdView commit details -
fix flash attntion was auto loaded on cpu for pretrained model (#981)
* set _attn_implementation_autoset to fix auto loading flash attention on CPU * add comments
Configuration menu - View commit details
-
Copy full SHA for ce2a96e - Browse repository at this point
Copy the full SHA ce2a96eView commit details -
fix old transformer doesn't have _attn_implementation_autoset (#982)
* fix old transformer doesn't have _attn_implementation_autoset * use another func to parse Version
Configuration menu - View commit details
-
Copy full SHA for ace914b - Browse repository at this point
Copy the full SHA ace914bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5052421 - Browse repository at this point
Copy the full SHA 5052421View commit details
Commits on Jan 1, 2025
-
prepare for 2025 1.5.1 release (#984)
* prepare for 2025 1.5.1 release * Update README.md
Configuration menu - View commit details
-
Copy full SHA for 244f17b - Browse repository at this point
Copy the full SHA 244f17bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4f18747 - Browse repository at this point
Copy the full SHA 4f18747View commit details
This comparison is taking too long to generate.
Unfortunately it looks like we can’t render this comparison for you right now. It might be too big, or there might be something weird with your repository.
You can try running this command locally to see the comparison on your machine:
git diff v1.5.0...v1.5.1