[DataProcessor] add reasoning_tokens into usage info #4520

kxz2002 · 2025-10-21T09:28:48Z

Motivation

Requirements for the evaluation system: Model outputs include reasoning, tool calls, responses, etc., and should be calculated separately.

Modifications

A new completion_tokens_details field has been added to the returned usage field. This field contains a reasoning_tokens field indicating the number of tokens the model inferred in its output.

Usage or Command

Accuracy Tests

Checklist

Add at least a tag in the PR title.
- Tag list: [[FDConfig],[APIServer],[Engine], [Scheduler], [PD Disaggregation], [Executor], [Graph Optimization], [Speculative Decoding], [RL], [Models], [Quantization], [Loader], [OP], [KVCache], [DataProcessor], [BugFix], [Docs], [CI], [Optimization], [Feature], [Benchmark], [Others], [XPU], [HPU], [GCU], [DCU], [Iluvatar], [Metax]]
- You can add new tags based on the PR content, but the semantics must be clear.
Format your code, run pre-commit before commit.
Add unit tests. Please write the reason in this PR if no unit tests.
Provide accuracy results.
If the current PR is submitting to the release branch, make sure the PR has been submitted to the develop branch, then cherry-pick it to the release branch with the [Cherry-Pick] PR tag.

paddle-bot · 2025-10-21T09:28:54Z

Thanks for your contribution!

LiqinruiG

LGTM

* add reasoning_tokens into usage info initial commit * add unit tests * modify unit test * modify and add unit tests * fix unit test * move steam usage to processor * modify processor * modify test_logprobs * modify test_logprobs.py * modify stream reasoning tokens accumulation * fix unit test

* support mm prefix caching * update code * fix mm_hashes * support encoder cache * add encoder cache * update code * update encoder cache * fix features bug * fix worker bug * support processor cache, need to optimize yet * refactor multimodal data cache * update code * update code * update v1 scheduler * update code * update code * update codestyle * support turn off processor cache and encoder cache * update pre-commit * fix code * solve review * update code * update code * update test case * set processor cache in GiB * update test case * support mm prefix caching for qwen model * fix code style check * update pre-commit * fix unit test * fix unit test * add ci test case * fix rescheduled bug * change text_after_process to prompt_tokens * fix unit test * fix chat template * change model path * [EP] fix adapter bugs (#4572) * Update expert_service.py * Update common_engine.py * Update expert_service.py * fix v1 hang bug (#4573) * fix import image_ops error on some platforms (#4559) * [CLI]Update parameters in bench latecy cli tool and fix collect-env cli tool (#4558) * add collect-env * del files * [Graph Optimization] Add dy_runnable and introduce cudagraph_switch_threshold for cudagraph mode switching (#4578) * add new branch for sot * reorder * fix batch bug * [XPU]Moe uses a new operator (#4585) * [XPU]Moe uses a new operator * [XPU]Moe uses a new operator * update response * [Feature] Support Paddle-OCR (#4396) * init * update code * fix code style & disable thinking * adapt for common_engine.update_mm_requests_chunk_size * use 3d rope * use flash_attn_unpadded * opt siglip * update to be compatible with the latest codebase * fix typo * optim OCR performance * fix bug * fix bug * fix bug * fix bug * normlize name * modify xpu rope * revert logger * fix bug * fix bug * fix bug * support default_v1 * optim performance * fix bug --------- Co-authored-by: root <[email protected]> Co-authored-by: zhangyue66 <[email protected]> * [DataProcessor] add reasoning_tokens into usage info (#4520) * add reasoning_tokens into usage info initial commit * add unit tests * modify unit test * modify and add unit tests * fix unit test * move steam usage to processor * modify processor * modify test_logprobs * modify test_logprobs.py * modify stream reasoning tokens accumulation * fix unit test * perf: Optimize task queue communication from engine to worker (#4531) * perf: Optimize task queue communication from engine to worker * perf: get_tasks to numpy * perf: get_tasks remove to_numpy * fix: request & replace ENV * remove test_e2w_perf.py * fix code style --------- Co-authored-by: Jiang-Jia-Jun <[email protected]> * Clean up ports after processing results (#4587) * [CI] Add /re-run command in PR comments to restart failed CI workflows (#4593) * [Others] api server exits when worker process is dead (#3271) * [fix] fix terminal hangs when worker process is dead * [chore] change sleep time of monitor * [chore] remove redundant comments * update docs --------- Co-authored-by: ApplEOFDiscord <[email protected]> Co-authored-by: ApplEOFDiscord <[email protected]> Co-authored-by: ltd0924 <[email protected]> Co-authored-by: yinwei <[email protected]> Co-authored-by: JYChen <[email protected]> Co-authored-by: qwes5s5 <[email protected]> Co-authored-by: Ryan <[email protected]> Co-authored-by: yyssys <[email protected]> Co-authored-by: ming1753 <[email protected]> Co-authored-by: root <[email protected]> Co-authored-by: zhangyue66 <[email protected]> Co-authored-by: kxz2002 <[email protected]> Co-authored-by: SunLei <[email protected]> Co-authored-by: Jiang-Jia-Jun <[email protected]> Co-authored-by: Zhang Yulong <[email protected]> Co-authored-by: YuBaoku <[email protected]> Co-authored-by: 李泳桦 <[email protected]>

kxz2002 added 2 commits October 21, 2025 14:11

add reasoning_tokens into usage info initial commit

fba4c55

add unit tests

6435c34

paddle-bot bot added the contributor External developers label Oct 21, 2025

kxz2002 and others added 10 commits October 21, 2025 19:55

modify unit test

c94e189

modify and add unit tests

6a0a7cc

fix unit test

da27c94

Merge branch 'develop' into reasoning_usage

49d4d83

move steam usage to processor

4a24fef

modify processor

67f9682

modify test_logprobs

60ae0bd

modify test_logprobs.py

a53990c

modify stream reasoning tokens accumulation

b55cee2

fix unit test

c8d5433

LiqinruiG approved these changes Oct 25, 2025

View reviewed changes

LiqinruiG merged commit 327fa4c into PaddlePaddle:develop Oct 25, 2025
26 of 27 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[DataProcessor] add reasoning_tokens into usage info #4520

[DataProcessor] add reasoning_tokens into usage info #4520

Uh oh!

kxz2002 commented Oct 21, 2025 •

edited

Loading

Uh oh!

paddle-bot bot commented Oct 21, 2025

Uh oh!

LiqinruiG left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[DataProcessor] add reasoning_tokens into usage info #4520

[DataProcessor] add reasoning_tokens into usage info #4520

Uh oh!

Conversation

kxz2002 commented Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Usage or Command

Accuracy Tests

Checklist

Uh oh!

paddle-bot bot commented Oct 21, 2025

Uh oh!

LiqinruiG left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kxz2002 commented Oct 21, 2025 •

edited

Loading