Skip to content

Conversation

@ZX-ModelCloud
Copy link
Collaborator

No description provided.

Signed-off-by: ZX-ModelCloud <[email protected]>
Signed-off-by: ZX-ModelCloud <[email protected]>
@Qubitium Qubitium changed the title add quantization/rotation dir Add QQQ Rotation Mar 12, 2025
@Qubitium Qubitium marked this pull request as ready for review March 12, 2025 05:52
@Qubitium
Copy link
Collaborator

@HandH1998 We had added rotation. But the roatation is causing degradation issues on llama 3.2 models with latest transformers and older lllama have no issue. We are checking if the changes to rotary_scaling in model config is causing issue in later transformers.

@Qubitium Qubitium merged commit a5e2b94 into main Mar 12, 2025
4 checks passed
@Qubitium Qubitium deleted the zx_add_rotation branch March 12, 2025 09:21
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants