Skip to content

Conversation

@ZX-ModelCloud
Copy link
Collaborator

No description provided.

@ZX-ModelCloud ZX-ModelCloud marked this pull request as draft August 21, 2025 05:56
@Qubitium Qubitium changed the title support autoawq [QUANTIZATION[ Add AWQ support Aug 21, 2025
@Qubitium Qubitium changed the title [QUANTIZATION[ Add AWQ support [QUANTIZATION] Add AWQ support Aug 21, 2025
@Qubitium
Copy link
Collaborator

Qubitium commented Aug 21, 2025

TODO:

  1. Use GPTQModel QuantConfig format. Only new zero_point property is unique to awq. version already perfectly maps to our format. Add zero_point as awq_zero_point property. Not AWQ quants will have thie awq_ prefixed peroperty stripped on quant save. So every quant format has a unique {format_name}_ prefix to store unique properties that does not polute or collide with shared properties.
  2. Merge the kernels and QuantLinear into GPTQModel structure.
  3. Add awq processes pipelines.
  4. Add CI tests.
  5. Finally add awq config to GPTQConfig runtime conversion so we can execute autoawq quantized models as well. Future models will use our config.
  6. Upstream changes to hf/sglang/vllm so they can recognize our config format (minimal code change).

Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
@Qubitium Qubitium marked this pull request as ready for review September 17, 2025 08:28
@Qubitium Qubitium merged commit 2d8f35f into main Sep 18, 2025
4 checks passed
@Qubitium Qubitium deleted the zx_add_autoawq branch November 18, 2025 14:25
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants