-
Notifications
You must be signed in to change notification settings - Fork 14.1k
model: add glm-asr support #17901
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
model: add glm-asr support #17901
Conversation
…build_stack for padding and review
02f6e66 to
86339b0
Compare
ngxson
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
the mtmd changes looks good, waiting for final approval from @CISC
|
You probably need to rebase to fix server CIs. |
CISC
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
EditorConfig error is unrelated.
Hi, I think this PR is ready to be merged. It seems the failures in the ggml-ci-x64-cpu-low-perf and ggml-ci-x64-cpu-high-perf CI jobs are unrelated to my changes. |
CISC
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Tested conversion, it was not a pleasant experience. :)
This works far more smoothly...
Make sure to read the contributing guidelines before submitting a PR
This PR adds support for the GLM-ASR architecture, specifically validating with the zai-org/GLM-ASR-Nano-2512 model.
Key Changes:
convert_hf_to_gguf.pyto handle dynamic configuration keys (glm-asr use "lm_config" instead of text_config). It now correctly identifies the config section by checking:llm_config_key = "lm_config" if "lm_config" in self.hparams else "text_config"Result