[API] Replace from_pretrained and from_quantized with unified load() #535

ZYC-ModelCloud · 2024-11-06T05:41:06Z

Ad load() to unify from_pretrained and from_quantized .

Remove overrding/passing quantize_config in from_quantized.

Qubitium

Reemove passing and overriding of quantize_config in load() stage. Quant config should always be loaded via saved model config.

I cannot think of a plausible scenario where a quant model is saved without config and needs an override. This legacy code from autogptq make is much complex than necessary.

gptqmodel/models/auto.py

ZYC-ModelCloud added 2 commits November 6, 2024 05:34

add auto_config_from function

ffd868b

remove unused import

a1aaca2

ZYC-ModelCloud changed the title ~~Add from api~~ [MISC]Add _from api Nov 6, 2024

ZYC-ModelCloud and others added 6 commits November 6, 2024 06:01

change args order

b748a66

add args format

b9c5ec2

modify method name

46e043d

add save() method

86c53d5

modify unit test file call load and save

7dc30d2

mod api

062db63

Qubitium reviewed Nov 7, 2024

View reviewed changes

gptqmodel/models/auto.py Outdated Show resolved Hide resolved

gptqmodel/models/auto.py Outdated Show resolved Hide resolved

Qubitium changed the title ~~[MISC]Add _from api~~ [API] Replace from_pretrained and from_quantized with unified load() Nov 7, 2024

Qubitium mentioned this pull request Nov 7, 2024

[REFRACTOR] Model loading #537

Closed

ZYC-ModelCloud and others added 18 commits November 7, 2024 01:36

remove quantize_config in from_quantized method

3446631

remove format arg

738c0b5

remove unused import

523f0ce

mod model_name_or_path to model_id_or_path

08a075a

checkout quantize_config.file exist

fbd825b

remove max_memory args

072881e

Resolve merge conflict

eda7283

modify use new api

70df2ce

Merge branch 'main' into add-from-api

eb87edc

format code

2216f60

modify checkout file exist code

3904c46

remove unused args

da5325f

pass trust_remote_code

f3aae3f

fix wrong call

aafa508

add args **cached_file_kwargs, **kwargs

7df2a5e

reduce data length

3e6757b

remove unused unit test codes

e71e40d

remove unused unit test codes

7b016bf

Qubitium merged commit 8dea559 into ModelCloud:main Nov 8, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[API] Replace from_pretrained and from_quantized with unified load() #535

[API] Replace from_pretrained and from_quantized with unified load() #535

Uh oh!

ZYC-ModelCloud commented Nov 6, 2024 •

edited by Qubitium

Loading

Uh oh!

Qubitium left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[API] Replace from_pretrained and from_quantized with unified load() #535

[API] Replace from_pretrained and from_quantized with unified load() #535

Uh oh!

Conversation

ZYC-ModelCloud commented Nov 6, 2024 • edited by Qubitium Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Qubitium left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ZYC-ModelCloud commented Nov 6, 2024 •

edited by Qubitium

Loading