feat: add model fallback support with TTFT-based timeout by keakon · Pull Request #13189 · anomalyco/opencode

keakon · 2026-02-11T18:00:05Z

What does this PR do?

This PR adds model fallback support for agents. When a primary model fails or is too slow to respond, the system automatically cycles through configured fallback models.
feat #7602 #9575

Key changes:

New config options: fallback_models (list of fallback models in priority order) and first_token_timeout (base timeout in ms for first-token detection, scaled by input size: base + inputChars * 0.5)
Fallback cycling logic (SessionFallback): builds a deduplicated model list, cycles through all models starting from the last successful one, and remembers which model worked per session/agent pair
TTFT timeout: measures time-to-first-token from after LLM.stream() returns (excluding framework init overhead), so the timeout reflects actual network/model latency
Short retry before switching: for retryable API errors, one retry attempt is made on the same model before falling back to the next
Error handling: context overflow and user abort errors skip fallback (not model-specific); FirstTokenTimeoutError is properly unwrapped from AbortError so it triggers fallback instead of
being treated as user cancellation
Toast notification: when a fallback switch occurs, a toast is shown indicating the model change
Edge case fixes: clamps out-of-bounds fallbackStartIndex to prevent infinite loops when models are removed from config; resets both indices when the remembered model fails to resolve

Files changed:

src/session/fallback.ts — new module for fallback state management and model cycling
src/session/processor.ts — main integration: TTFT timer, fallback loop, error classification
src/agent/agent.ts / src/config/config.ts — schema additions for fallback_models and first_token_timeout
src/session/prompt.ts / src/session/message-v2.ts — pass fallback config to processor; handle FirstTokenTimeoutError

How did you verify your code works?

Added unit tests for SessionFallback (buildModelList, nextIndex cycling, recordSuccess/getStartIndex, full cycle simulations including remembered-model and out-of-bounds edge cases) — see test/session/fallback.test.ts
Added unit tests for TTFT computation (estimateInputChars, computeTtftTimeout with various input sizes) and FirstTokenTimeoutError serialization — see test/session/processor-ttft.test.ts
Manual testing with multiple model configurations to verify fallback cycling, TTFT timeout triggering, and toast notifications

When a primary model fails or is too slow to respond, the system automatically cycles through configured fallback models. - Add `fallback_models` and `first_token_timeout` config options - Implement SessionFallback module for model cycling and state tracking - Measure TTFT from HTTP request initiation (excluding framework init) - Add short retry before switching models for retryable API errors - Clamp out-of-bounds fallbackStartIndex to prevent infinite loops - Handle FirstTokenTimeoutError separately from user abort - Show toast notification on fallback model switch - Add unit tests for fallback cycling and TTFT computation

github-actions · 2026-02-11T18:00:16Z

Thanks for your contribution!

This PR doesn't have a linked issue. All PRs must reference an existing issue.

Please:

Open an issue describing the bug/feature (if one doesn't exist)
Add Fixes #<number> or Closes #<number> to this PR description

See CONTRIBUTING.md for details.

github-actions · 2026-02-11T18:01:27Z

The following comment was made by an LLM, it may be inaccurate:

Based on my search, I found two potentially related PRs:

PR fix(copilot): add gpt-5.3-codex fallback model #13172 - fix(copilot): add gpt-5.3-codex fallback model
- fix(copilot): add gpt-5.3-codex fallback model #13172
- This appears to be a simpler fallback model addition, possibly related to the same feature area
PR feat: add runtime model fallback on retry exhaustion #11739 - feat: add runtime model fallback on retry exhaustion
- feat: add runtime model fallback on retry exhaustion #11739
- This PR addresses model fallback on retry exhaustion, which overlaps with PR feat: add model fallback support with TTFT-based timeout #13189's fallback cycling and retry logic
PR fix(opencode): correct model fallback index tracking and config parsing #8669 - fix(opencode): correct model fallback index tracking and config parsing
- fix(opencode): correct model fallback index tracking and config parsing #8669
- Directly related to fallback index tracking, which is a core component of the new PR's SessionFallback logic

These PRs may represent earlier attempts at implementing fallback functionality or related features. PR #13189 appears to be a more comprehensive implementation with TTFT-based timeout support.

aaryan-rampal · 2026-03-03T18:50:47Z

Seems useful. Any updates on getting this merged?
Otherwise, I wonder if this adds latency when deciding to switch models. Is it always running a token speed test? Is it only on delegation? I would also like to add multiple options for selecting a model. A simple sequential (with wrap around), round robin, etc. Perhaps we can even let user define their own logic. For my purposes, I want this functionality but with a simple sequential ordering.

github-actions bot added the needs:issue label Feb 11, 2026

github-actions bot added needs:issue and removed needs:issue labels Feb 11, 2026

morgaesis mentioned this pull request Feb 13, 2026

fix: retry on timeout errors instead of failing #13502

Open

github-actions bot mentioned this pull request Mar 20, 2026

feat: Add model auto-switch functionality for improved error handling #18383

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add model fallback support with TTFT-based timeout#13189

feat: add model fallback support with TTFT-based timeout#13189
keakon wants to merge 1 commit intoanomalyco:devfrom
keakon:feat/model-fallback

keakon commented Feb 11, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Feb 11, 2026

Uh oh!

github-actions bot commented Feb 11, 2026

Uh oh!

aaryan-rampal commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

keakon commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

How did you verify your code works?

Uh oh!

github-actions bot commented Feb 11, 2026

Uh oh!

github-actions bot commented Feb 11, 2026

Uh oh!

aaryan-rampal commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

keakon commented Feb 11, 2026 •

edited

Loading