Skip to content

Conversation

@ZYC-ModelCloud
Copy link
Contributor

@ZYC-ModelCloud ZYC-ModelCloud commented Nov 6, 2024

Ad load() to unify from_pretrained and from_quantized .

Remove overrding/passing quantize_config in from_quantized.

@ZYC-ModelCloud ZYC-ModelCloud changed the title Add from api [MISC]Add _from api Nov 6, 2024
Copy link
Collaborator

@Qubitium Qubitium left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Reemove passing and overriding of quantize_config in load() stage. Quant config should always be loaded via saved model config.

I cannot think of a plausible scenario where a quant model is saved without config and needs an override. This legacy code from autogptq make is much complex than necessary.

@Qubitium Qubitium changed the title [MISC]Add _from api [API] Replace from_pretrained and from_quantized with unified load() Nov 7, 2024
@Qubitium Qubitium merged commit 8dea559 into ModelCloud:main Nov 8, 2024
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants