fix(aiCore): normalize model ID before looking up thinking token limits by EurFelux · Pull Request #13843 · CherryHQ/cherry-studio

EurFelux · 2026-03-27T02:48:42Z

What this PR does

Before this PR:

findTokenLimit() in getReasoningEffort() was called with the raw model.id, which may contain provider prefixes or mixed casing (e.g., openai/qwen3.5-397b-a17b). This caused token limit lookups to fail, so effort was never correctly converted to thinking_budget for generic (OpenAI-compatible) providers.

After this PR:

The model ID is normalized via getLowerBaseModelName() before being passed to findTokenLimit(), ensuring correct token limit resolution and proper effort → thinking_budget conversion.

Fixes #13831

Why we need it and why it was done in this way

The following tradeoffs were made:

The normalization is applied early in getReasoningEffort() and reuses the existing getLowerBaseModelName() utility, which is already used elsewhere in the same file. This is the minimal, consistent fix.

The following alternatives were considered:

Modifying findTokenLimit() itself to normalize internally — rejected because it would change the contract for all callers, some of which may already pass normalized IDs.

Breaking changes

None.

Special notes for your reviewer

While this fix correctly resolves the token limit lookup, there is a known follow-up concern: for some models (e.g., Qwen3.5), even a correctly computed low thinking_budget can paradoxically cause more thinking than not passing the parameter at all. See the linked issue discussion for details. See #13844 for tracking improvements to the effort mapping strategy.

Checklist

PR: The PR description is expressive enough and will help future contributors
Code: Write code that humans can understand and Keep it simple
Refactor: You have left the code cleaner than you found it (Boy Scout Rule)
Upgrade: Impact of this change on upgrade flows was considered and addressed if required
Documentation: A user-guide update was considered and is present (link) or not required. Check this only when the PR introduces or changes a user-facing feature or behavior.
Self-review: I have reviewed my own code (e.g., via /gh-pr-review, gh pr diff, or GitHub UI) before requesting review from others

Release note

NONE

findTokenLimit() was called with the raw model.id instead of the normalized (lowercased, base-name) variant, causing token limit lookups to miss for models whose IDs contain provider prefixes or mixed casing. Fixes #13831 Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]> Signed-off-by: icarus <[email protected]>

EurFelux mentioned this pull request Mar 27, 2026

Fix reasoning model detection and effort configuration #13834

Closed

6 tasks

EurFelux requested a review from DeJeune March 27, 2026 04:16

alephpiece approved these changes Mar 27, 2026

View reviewed changes

DeJeune approved these changes Mar 27, 2026

View reviewed changes

DeJeune merged commit a9d3a3f into main Mar 27, 2026
11 checks passed

DeJeune deleted the fix/thinking-budget branch March 27, 2026 04:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(aiCore): normalize model ID before looking up thinking token limits#13843

fix(aiCore): normalize model ID before looking up thinking token limits#13843
DeJeune merged 1 commit intomainfrom
fix/thinking-budget

EurFelux commented Mar 27, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

EurFelux commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does

Why we need it and why it was done in this way

Breaking changes

Special notes for your reviewer

Checklist

Release note

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

EurFelux commented Mar 27, 2026 •

edited

Loading