Skip to content

Conversation

@ZX-ModelCloud
Copy link
Collaborator

No description provided.

ZX-ModelCloud and others added 16 commits September 22, 2025 09:23
Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: Qubitium <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
…_ES1_RKS3_S1_S5_S5_S5_S1_RKllllbbbb

Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
@Qubitium Qubitium marked this pull request as ready for review September 22, 2025 10:03
@Qubitium Qubitium merged commit 1f6dd5c into main Sep 22, 2025
5 checks passed
@Qubitium Qubitium deleted the zx_update_marlin_kernel branch September 22, 2025 10:03
Qubitium added a commit that referenced this pull request Sep 23, 2025
* update marlin kernel code

Signed-off-by: ZX-ModelCloud <[email protected]>

* remove TORCH_LIBRARY_IMPL_EXPAND

Signed-off-by: ZX-ModelCloud <[email protected]>

* update setup.py

Signed-off-by: ZX-ModelCloud <[email protected]>

* ignore warnings

Signed-off-by: Qubitium <[email protected]>

* fix include error

Signed-off-by: ZX-ModelCloud <[email protected]>

* update gptq marlin quant linear

Signed-off-by: ZX-ModelCloud <[email protected]>

* remove -diag-suppress=20281

Signed-off-by: ZX-ModelCloud <[email protected]>

* remove -diag-suppress=20281

Signed-off-by: ZX-ModelCloud <[email protected]>

* append "gptqmodel_ext/marlin/kernel_*.cu"

Signed-off-by: ZX-ModelCloud <[email protected]>

* fix undefined symbol: _Z16gptq_marlin_gemmRN2at6TensorESt8optionalIS0_ES1_RKS3_S1_S5_S5_S5_S1_RKllllbbbb

Signed-off-by: ZX-ModelCloud <[email protected]>

* fix wrong usage with _transform_param()

Signed-off-by: ZX-ModelCloud <[email protected]>

* fix KeyError: "attribute 'g_idx' already exists"

Signed-off-by: ZX-ModelCloud <[email protected]>

* fix marlin forward

Signed-off-by: ZX-ModelCloud <[email protected]>

* default to enable atomic add for small size_n

Signed-off-by: Qubitium <[email protected]>

* default to enable atomic add for small size_n

Signed-off-by: Qubitium <[email protected]>

---------

Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: Qubitium <[email protected]>
Co-authored-by: Qubitium <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants