fix: upgrade ollama provider to 3.3.1 by Pleasurecruise · Pull Request #13085 · CherryHQ/cherry-studio

Pleasurecruise · 2026-02-26T18:34:28Z

What this PR does

Before this PR:

After this PR:

continue #12526

Why we need it and why it was done in this way

The following tradeoffs were made:

Bump ollama-ai-provider-v2 from 1.5.5 to 3.3.1 and update its patch filename. Update pnpm lock entries for the new provider version and related provider deps. Adjust compiled dist files to expose ollama provider option types and add support for the expanded "think" option (boolean | 'low' | 'medium' | 'high'). Add a new ollamaReasoningOrderMiddleware to reorder reasoning stream parts ahead of text/tool parts and wire it into AiSdkMiddlewareBuilder when reasoning is enabled. Update options builder to use OllamaProviderOptions and map assistant reasoning_effort (including explicitly disabling thinking when reasoning is off).

The following alternatives were considered:

Links to places where the discussion took place:

Breaking changes

If this PR introduces breaking changes, please describe the changes and the impact on users.

Special notes for your reviewer

Checklist

This checklist is not enforcing, but it's a reminder of items that could be relevant to every PR.
Approvers are expected to review this list.

PR: The PR description is expressive enough and will help future contributors
Code: Write code that humans can understand and Keep it simple
Refactor: You have left the code cleaner than you found it (Boy Scout Rule)
Upgrade: Impact of this change on upgrade flows was considered and addressed if required
Documentation: A user-guide update was considered and is present (link) or not required. Check this only when the PR introduces or changes a user-facing feature or behavior.
Self-review: I have reviewed my own code (e.g., via /gh-pr-review, gh pr diff, or GitHub UI) before requesting review from others

Release note

Bump ollama-ai-provider-v2 from 1.5.5 to 3.3.1 and update its patch filename. Update pnpm lock entries for the new provider version and related provider deps. Adjust compiled dist files to expose ollama provider option types and add support for the expanded "think" option (boolean | 'low' | 'medium' | 'high'). Add a new ollamaReasoningOrderMiddleware to reorder reasoning stream parts ahead of text/tool parts and wire it into AiSdkMiddlewareBuilder when reasoning is enabled. Update options builder to use OllamaProviderOptions and map assistant reasoning_effort (including explicitly disabling thinking when reasoning is off).

Copilot

Pull request overview

Upgrades the Ollama AI SDK provider integration to [email protected], updates the local patch for the provider, and adjusts Cherry Studio’s Ollama option mapping and middleware to better support “thinking”/reasoning behavior and display order.

Changes:

Bump ollama-ai-provider-v2 to 3.3.1, update pnpm-lock.yaml, and replace the provider patch file.
Update Ollama provider option building to use OllamaProviderOptions and map think to support boolean or 'low' | 'medium' | 'high' (gpt-oss), including explicitly disabling thinking when reasoning is off.
Add and wire ollamaReasoningOrderMiddleware to reorder reasoning stream parts ahead of text/tool parts when reasoning is enabled.

Reviewed changes

Copilot reviewed 5 out of 6 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
src/renderer/src/aiCore/utils/options.ts	Updates Ollama provider option typing and `think` mapping logic.
src/renderer/src/aiCore/middleware/ollamaReasoningOrderMiddleware.ts	Introduces middleware to reorder Ollama reasoning/text stream parts.
src/renderer/src/aiCore/middleware/AiSdkMiddlewareBuilder.ts	Wires the new Ollama middleware when reasoning is enabled.
package.json	Bumps `ollama-ai-provider-v2` and updates `pnpm.patchedDependencies` entry.
pnpm-lock.yaml	Updates lockfile entries for the new provider version and transitive deps.
patches/[email protected]	Updates the provider’s patched dist outputs/types/behavior for `think` and related exports.

Files not reviewed (1)

pnpm-lock.yaml: Language not supported

Comments suppressed due to low confidence (1)

patches/[email protected]:92

This patch adds new runtime exports (ollamaProviderOptions, ollamaCompletionProviderOptions) in dist/index.js, but the corresponding .d.ts/.d.mts export lists in the same patch still don’t export those values. That creates a runtime/type mismatch (and may differ from the ESM build) for consumers. Either export the values in the declaration files too (and in index.mjs if applicable), or avoid adding the extra runtime exports here.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-26T18:42:24Z

src/renderer/src/aiCore/plugins/ollamaReasoningOrderPlugin.ts

+              if (chunk.type === 'text-delta' || chunk.type === 'text-end') {
+                if (!hasReasoning) {
+                  bufferedText.push(chunk)
+                  return
+                }


In streaming mode this buffers all text-* chunks until a reasoning chunk appears. If the model never emits reasoning (or thinking was disabled but enableReasoning is still true upstream), the UI will receive no text until finish/error, effectively breaking streaming. Consider limiting buffering (e.g., only buffer initial text-start/a small amount with a timeout) and then pass through text normally when reasoning doesn’t appear early.

src/renderer/src/aiCore/middleware/AiSdkMiddlewareBuilder.ts

src/renderer/src/aiCore/utils/options.ts

Pleasurecruise

Overall Assessment

This PR correctly upgrades ollama-ai-provider-v2 from 1.5.5 to 3.3.1, updates the patch file to expose the expanded think: boolean | 'low' | 'medium' | 'high' option, and adds an ollamaReasoningOrderMiddleware to address stream ordering issues. The dependency bump, lock file updates, and options.ts logic are all sound. However, the core buffering logic in wrapStream has a significant bug that would produce the same incorrect interleaved output it was intended to prevent.

Critical / Significant

Bug: `flushBufferedText` called at wrong points in `wrapStream`

File: ollamaReasoningOrderMiddleware.ts, lines 55 and 62

When text chunks arrive before reasoning-start (the exact scenario this middleware targets), they are buffered in bufferedText. The intent is to hold them until reasoning is complete and then emit them. However, the current code calls flushBufferedText inside the reasoning-start and reasoning-delta handlers, which emits the buffered text mid-reasoning:

Stream in: text-start → text-delta → reasoning-start → reasoning-delta → reasoning-end

Stream out (current): reasoning-start → text-start → text-delta → reasoning-delta → reasoning-end

The correct fix is to remove flushBufferedText from reasoning-start and reasoning-delta, and instead call it in the reasoning-end handler (and in endActiveReasoning for the synthetic end path triggered by text-start). See inline comments for details.

Minor / Nit

Missing tests for the new middleware. The buffering logic is non-trivial and the bug above demonstrates how easy it is to get it wrong. Tests for at least the 4 key scenarios (normal order, inverted order, no reasoning, incomplete reasoning) would give confidence in any future changes.

Positives

wrapGenerate (non-streaming) reordering is correct and clean.
Explicit think: false when reasoning is disabled (fixing issue #11612) is the right fix — previously the option was simply omitted, which may leave model-side thinking state ambiguous.
Switching from OllamaCompletionProviderOptions to OllamaProviderOptions is the correct type to use for chat-model provider options.
The gpt-oss model branch correctly maps 'low'/'medium'/'high' effort values to the new string-union think parameter.
Dependency upgrade, patch rename, and lock file entries are consistent and clean.

src/renderer/src/aiCore/plugins/ollamaReasoningOrderPlugin.ts

EurFelux · 2026-02-27T00:25:53Z

src/renderer/src/aiCore/plugins/ollamaReasoningOrderPlugin.ts

Note
This comment was translated by Claude.

#10545 did something similar. It's worth investigating deeply why this issue has appeared again, as this issue may occur not only on ollama.

Original Content

#10545 做过类似的事情，值得深入调查一下问题再次出现的原因，因为这个问题可能不止在ollama上出现。

Note
This comment was translated by Claude.

The patch mainly fixes the thinking depth configuration bug

The original code, when enableReasoning is enabled, only performed a simple check think = !['none', undefined].includes(reasoningEffort) for non-gpt-oss models, causing think to be false when reasoning_effort is undefined, which actually did not enable thinking

For gpt-oss models, the old code did not handle the case where reasoningEffort === 'none' and other unexpected values

When enableReasoning is disabled, the old code does not explicitly pass think: false, causing Ollama to still maintain the default thinking behavior (fixes Bug: ollama cannot disable qwen3 thinking mode #11612)

The middleware mainly fixes the issue of incorrect order of reasoning content and text content
https://github.com/nordwestt/ollama-ai-provider-v2/blob/063ac6f8f84e3a691f1b8dd9ca1665a6ece4cff7/src/responses/ollama-responses-stream-processor.ts#L170-L177

processDelta(delta, controller) { this.processTextContent(delta, controller); // ← Process text first this.processThinking(delta, controller); // ← Then process reasoning this.processToolCalls(delta, controller); }

[email protected], when processing each streaming delta, always calls processTextContent (emits text-start/text-delta) first, then calls processThinking (emits reasoning-start/reasoning-delta)

Original Content

patch主要修复的是思考深度配置bug

原来代码在enableReasoning 开启时，对于非 gpt-oss 模型只做了简单判断think = !['none', undefined].includes(reasoningEffort)，导致 reasoning_effort 为 undefined 时 think 为 false，实际上没有启用思考

对 gpt-oss 模型，旧代码未处理 reasoningEffort === 'none' 和其他非预期值的情况

当enableReasoning 关闭时，旧代码不会显式传think: false，导致 Ollama 仍可能保持默认的思考行为（修复 Bug: ollama cannot disable qwen3 thinking mode #11612）
中间件主要修复的是推理内容和文本内容顺序错误的问题
https://github.com/nordwestt/ollama-ai-provider-v2/blob/063ac6f8f84e3a691f1b8dd9ca1665a6ece4cff7/src/responses/ollama-responses-stream-processor.ts#L170-L177

processDelta(delta, controller) { this.processTextContent(delta, controller); // ← 先处理文本 this.processThinking(delta, controller); // ← 再处理推理 this.processToolCalls(delta, controller); }

[email protected] 在处理每个流式 delta 时，固定先调用 processTextContent（发出 text-start/text-delta），再调用 processThinking（发出reasoning-start/reasoning-delta）

Resolve merge conflicts: - package.json: merge patched dependencies from both branches - AiSdkMiddlewareBuilder.ts: accept deletion (refactored to plugin architecture) - ollamaReasoningOrderMiddleware: convert to plugin pattern as ollamaReasoningOrderPlugin - pnpm-lock.yaml: regenerate after dependency merge - Restore @openrouter/ai-sdk-provider patch Co-Authored-By: Claude Opus 4.6 <[email protected]>

Replace LanguageModelV2StreamPart with LanguageModelV3StreamPart and use specificationVersion 'v3' to match the upgraded ai-sdk. Co-Authored-By: Claude Opus 4.6 <[email protected]>

Swap processThinking/processTextContent order in ollama provider patch so reasoning chunks are emitted before text chunks (Issue #12642). This is a cleaner fix at the source level, removing the need for the ollamaReasoningOrderPlugin middleware wrapper. Co-Authored-By: Claude Opus 4.6 <[email protected]>

EurFelux

Overall this looks good to me.

I verified the provider upgrade flow and the related option-mapping changes. The reasoning toggle behavior for Ollama is now explicit when reasoning is disabled, and the new type migration to OllamaProviderOptions is consistent with the dependency bump. Patch and lockfile updates are aligned with the version upgrade.

No blocking issues found from this review.

Use a truthy check (if (delta?.content)) in processTextContent instead of an explicit != null comparison so empty strings are not treated as valid content. Includes corresponding update to the compiled dist/index.mjs and refreshes the pnpm-lock.yaml patch hash to match the patched changes.

Copilot AI review requested due to automatic review settings February 26, 2026 18:34

Copilot started reviewing on behalf of Pleasurecruise February 26, 2026 18:34 View session

Copilot AI reviewed Feb 26, 2026

View reviewed changes

Pleasurecruise requested review from DeJeune and EurFelux February 26, 2026 19:15

Pleasurecruise commented Feb 26, 2026

View reviewed changes

Update ollamaReasoningOrderMiddleware.ts

713b384

EurFelux reviewed Feb 27, 2026

View reviewed changes

This comment was marked as resolved.

Sign in to view

DeJeune and others added 3 commits February 27, 2026 20:11

fix: update ollamaReasoningOrderPlugin to v3 stream types

fdec1d6

Replace LanguageModelV2StreamPart with LanguageModelV3StreamPart and use specificationVersion 'v3' to match the upgraded ai-sdk. Co-Authored-By: Claude Opus 4.6 <[email protected]>

DeJeune requested a review from EurFelux February 27, 2026 12:34

EurFelux approved these changes Feb 27, 2026

View reviewed changes

GeorgeDong32 approved these changes Feb 27, 2026

View reviewed changes

DeJeune merged commit a965667 into main Feb 27, 2026
7 checks passed

DeJeune deleted the refactor-ollama branch February 27, 2026 14:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: upgrade ollama provider to 3.3.1#13085

fix: upgrade ollama provider to 3.3.1#13085
DeJeune merged 6 commits intomainfrom
refactor-ollama

Pleasurecruise commented Feb 26, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 26, 2026

Uh oh!

Uh oh!

Uh oh!

Pleasurecruise left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

EurFelux Feb 27, 2026 •

edited by kangfenmao

Loading

Uh oh!

Pleasurecruise Feb 27, 2026 •

edited by kangfenmao

Loading

Uh oh!

This comment was marked as resolved.

EurFelux left a comment •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

Pleasurecruise commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does

Why we need it and why it was done in this way

Breaking changes

Special notes for your reviewer

Checklist

Release note

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Pleasurecruise left a comment

Choose a reason for hiding this comment

Overall Assessment

Critical / Significant

Bug: flushBufferedText called at wrong points in wrapStream

Minor / Nit

Positives

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

EurFelux Feb 27, 2026 • edited by kangfenmao Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Pleasurecruise Feb 27, 2026 • edited by kangfenmao Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

This comment was marked as resolved.

EurFelux left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Pleasurecruise commented Feb 26, 2026 •

edited

Loading

Bug: `flushBufferedText` called at wrong points in `wrapStream`

EurFelux Feb 27, 2026 •

edited by kangfenmao

Loading

Pleasurecruise Feb 27, 2026 •

edited by kangfenmao

Loading

EurFelux left a comment •

edited

Loading