Skip to content

Conversation

@Qubitium
Copy link
Collaborator

@Qubitium Qubitium commented Nov 22, 2025

  1. Add HF compatible hf_select_quant_linear_v2 api
  2. Separate AWQ GEMM kernel into GEMM_TORCH GEMM_CUDA GEMM_TRITON

Signed-off-by: ZX-ModelCloud <[email protected]>
@Qubitium Qubitium marked this pull request as draft November 24, 2025 06:39
Signed-off-by: ZX-ModelCloud <[email protected]>
@Qubitium Qubitium marked this pull request as ready for review November 27, 2025 03:08
ZX-ModelCloud and others added 2 commits November 27, 2025 15:00
Co-authored-by: Copilot Autofix powered by AI <223894421+github-code-quality[bot]@users.noreply.github.com>
Signed-off-by: ZX-ModelCloud <[email protected]>
@Qubitium Qubitium merged commit 88db594 into main Nov 27, 2025
6 checks passed
@Qubitium Qubitium deleted the cleanup-awq branch November 27, 2025 09:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants