[BugFix] support real batch_size #3109

lizexu123 · 2025-07-31T06:38:39Z

Fix the inference logic to use num_running_requests instead of max_num_seqs; the latter brought clear gains on smaller models.

…tar_model_runner.py

…into seq_lens_5

paddle-bot · 2025-07-31T06:38:44Z

Thanks for your contribution!

…into xx_sen_len

iosmers · 2025-08-05T03:03:28Z

LGTM

…into xx_sen_len

hong19860320 · 2025-08-05T04:28:47Z

fastdeploy/worker/xpu_worker.py

    ) -> Optional[ModelRunnerOutput]:
        """ """
-        output = self.model_runner.execute_model(model_forward_batch)
+        if not is_dummy_run:


为什么需要区分是否 is_dummy_run ？

这里写反了，已修改，因为xpu里面dummy_run用的execute_model，而num_running_requests不可能为空，而dummy_run这个传不进去，所以在这里做了判断，dummy_run的时候不做切分

carryyu · 2025-08-05T07:13:58Z

fastdeploy/spec_decode/mtp.py

            self.model_inputs["block_tables"][idx : idx + 1, :block_num] = np.arange(
                idx * block_num, (idx + 1) * block_num, 1
            )
+        self.model_inputs["seq_lens_this_time"] = self.seq_lens_this_time_buffer


没有slice吗

dummy的过程不需要

carryyu · 2025-08-05T07:14:14Z

fastdeploy/spec_decode/mtp.py

            )

-    def insert_prefill_inputs(self, req_dicts: List[Request]):
+    def insert_prefill_inputs(self, req_dicts: List[Request], num_running_requests):


输入类型声明下

carryyu

LGTM

* support real bsz * fix * fix xpu_model_runner.py,gpu_model_runner.py,gcu_model_runner.py,iluvatar_model_runner.py * add event_loop_ep * fix * Add comments * fix * support mtp real_batch_size * fix * self.tmp_seq_lens_this_time->self.seq_lens_this_time_buffer * fix * fix VL real_seq_lens_this_time * fix * fix mtp * fix * fix mtp * fix xpu * fix

This reverts commit bc0b92b.

lizexu123 added 16 commits July 28, 2025 06:27

support real bsz

87adf5b

fix

6148b50

fix xpu_model_runner.py,gpu_model_runner.py,gcu_model_runner.py,iluva…

5e8cd8b

…tar_model_runner.py

add event_loop_ep

aff83a6

fix

c2d97dc

Add comments

67c3af6

Merge branch 'develop' of https://github.com/PaddlePaddle/FastDeploy …

e3507eb

…into seq_lens_5

fix

a037348

fix xpu run_45T.py

47d901a

support mtp real_batch_size

34dd616

fix

ce3b5ad

self.tmp_seq_lens_this_time->self.seq_lens_this_time_buffer

8cad578

Merge branch 'develop' of https://github.com/PaddlePaddle/FastDeploy …

2218a87

…into seq_lens_5

fix

1ce0512

fix VL real_seq_lens_this_time

0582d0c

fix

b8ef13c

lizexu123 closed this Jul 31, 2025

lizexu123 reopened this Jul 31, 2025

lizexu123 added 3 commits July 31, 2025 15:25

Merge branch 'develop' of https://github.com/PaddlePaddle/FastDeploy …

d9d89a2

…into xx_sen_len

fix mtp

9e0e8ba

fix

d42818d

lizexu123 added 2 commits August 5, 2025 11:15

Merge branch 'develop' of https://github.com/PaddlePaddle/FastDeploy …

d8e3279

…into xx_sen_len

fix mtp

79f92b5

hong19860320 reviewed Aug 5, 2025

View reviewed changes

fix xpu

c7ff27a

carryyu reviewed Aug 5, 2025

View reviewed changes

fix

ba8e54b

ltd0924 approved these changes Aug 5, 2025

View reviewed changes

carryyu approved these changes Aug 5, 2025

View reviewed changes

Jiang-Jia-Jun merged commit b01cfd6 into PaddlePaddle:develop Aug 5, 2025
10 of 14 checks passed

iosmers added a commit that referenced this pull request Aug 8, 2025

Revert "[BugFix] support real batch_size (#3109) (#3217)"

94177cf

This reverts commit bc0b92b.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BugFix] support real batch_size #3109

[BugFix] support real batch_size #3109

Uh oh!

lizexu123 commented Jul 31, 2025

Uh oh!

paddle-bot bot commented Jul 31, 2025

Uh oh!

iosmers commented Aug 5, 2025

Uh oh!

hong19860320 Aug 5, 2025

Uh oh!

lizexu123 Aug 5, 2025

Uh oh!

carryyu Aug 5, 2025

Uh oh!

lizexu123 Aug 5, 2025

Uh oh!

carryyu Aug 5, 2025

Uh oh!

lizexu123 Aug 5, 2025

Uh oh!

carryyu left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

[BugFix] support real batch_size #3109

[BugFix] support real batch_size #3109

Uh oh!

Conversation

lizexu123 commented Jul 31, 2025

Uh oh!

paddle-bot bot commented Jul 31, 2025

Uh oh!

iosmers commented Aug 5, 2025

Uh oh!

hong19860320 Aug 5, 2025

Choose a reason for hiding this comment

Uh oh!

lizexu123 Aug 5, 2025

Choose a reason for hiding this comment

Uh oh!

carryyu Aug 5, 2025

Choose a reason for hiding this comment

Uh oh!

lizexu123 Aug 5, 2025

Choose a reason for hiding this comment

Uh oh!

carryyu Aug 5, 2025

Choose a reason for hiding this comment

Uh oh!

lizexu123 Aug 5, 2025

Choose a reason for hiding this comment

Uh oh!

carryyu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants