[Feature] Support zai-org/GLM-4.5-Air BF16 model #3928

ckl117 · 2025-09-05T16:23:20Z

Support load pytorch model zai-org/GLM-4.5-Air model and inference.

model_path=./torch_models/GLM-4.5-Air

python -m fastdeploy.entrypoints.openai.api_server \
    --model ${model_path} \
    --max-model-len 32768 \
    --max-num-seqs 18 \
    --tensor-parallel-size 4 \
    --port 8112 \
    --graph-optimization-config '{"use_cudagraph":true, "graph_opt_level":0}' \
    --load_choices "default_v1" \

paddle-bot · 2025-09-05T16:23:26Z

Thanks for your contribution!

…into glm45_air

YuanRisheng

需要补充单测

YuanRisheng · 2025-09-08T08:51:35Z

fastdeploy/model_executor/layers/rotary_embedding.py

            return rot_emb


+class GlmRotaryEmbedding:


能想办法把ErnieRotaryEmbedding改成一个基类，这里继承一下吗，重复代码太多了

…into glm45_air

* support glm45_air

yuanlehome · 2025-09-11T04:46:20Z

fastdeploy/model_executor/models/glm4_moe.py

+        self.model.clear_grpah_opt_backend(fd_config=self.fd_config)
+
+
+class Glm4MoePretrainedModel(PretrainedModel):


这个类可以删掉了

* support glm45_air

* [Feature] Support zai-org/GLM-4.5-Air BF16 model (#3928) * support glm45_air * [Feature] GLM-45-AIR Support Mix Quantization(Dense wfp8afp8 and wint8 triton_moe_backend) (#4051) * check * fix v1 load for mix and wint8 * check --quantizations 'None' * check * support RL rollout * check v1 loader * check glm rollout_model, change wfp8afp8 per_token_cast_to_fp8 to native impl * check rollout moe gate begin layer_id * check rollout e_score_correction_bias * delete infer_to_train_mapping={} * code check

support glm45_air

27c53b9

ckl117 added 4 commits September 6, 2025 00:29

check

29e2f21

Merge branch 'develop' of https://github.com/PaddlePaddle/FastDeploy …

9c7722d

…into glm45_air

check batch_id_per_token

c30f6cf

check

2947eab

ckl117 changed the title ~~[Feature] Support zai-org/GLM-4.5-Air model~~ [Feature] Support zai-org/GLM-4.5-Air BF16 model Sep 8, 2025

YuanRisheng reviewed Sep 8, 2025

View reviewed changes

ckl117 added 5 commits September 8, 2025 12:05

add e2e test and clean code

4069148

Merge branch 'develop' of https://github.com/PaddlePaddle/FastDeploy …

97d4797

…into glm45_air

code check and test.py tp1

dd13fe4

check test

a023d90

check

b9aa1a0

EmmonsCurse approved these changes Sep 10, 2025

View reviewed changes

YuanRisheng approved these changes Sep 10, 2025

View reviewed changes

ckl117 merged commit 637d96c into PaddlePaddle:develop Sep 10, 2025
15 of 17 checks passed

ckl117 added a commit to ckl117/FastDeploy that referenced this pull request Sep 10, 2025

[Feature] Support zai-org/GLM-4.5-Air BF16 model (PaddlePaddle#3928)

9d518a9

* support glm45_air

yuanlehome reviewed Sep 11, 2025

View reviewed changes

ckl117 added a commit to ckl117/FastDeploy that referenced this pull request Sep 11, 2025

[Feature] Support zai-org/GLM-4.5-Air BF16 model (PaddlePaddle#3928)

6a96e57

* support glm45_air

ckl117 mentioned this pull request Sep 12, 2025

[CP]Glm45 air 2.2 #4073

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] Support zai-org/GLM-4.5-Air BF16 model #3928

[Feature] Support zai-org/GLM-4.5-Air BF16 model #3928

Uh oh!

ckl117 commented Sep 5, 2025 •

edited

Loading

Uh oh!

paddle-bot bot commented Sep 5, 2025

Uh oh!

YuanRisheng left a comment

Uh oh!

YuanRisheng Sep 8, 2025

Uh oh!

Uh oh!

yuanlehome Sep 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		self.model.clear_grpah_opt_backend(fd_config=self.fd_config)


		class Glm4MoePretrainedModel(PretrainedModel):

[Feature] Support zai-org/GLM-4.5-Air BF16 model #3928

[Feature] Support zai-org/GLM-4.5-Air BF16 model #3928

Uh oh!

Conversation

ckl117 commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

paddle-bot bot commented Sep 5, 2025

Uh oh!

YuanRisheng left a comment

Choose a reason for hiding this comment

Uh oh!

YuanRisheng Sep 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

yuanlehome Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ckl117 commented Sep 5, 2025 •

edited

Loading