feat: add TPM throttling error handling with 1-minute retry delay by wenshao · Pull Request #1791 · QwenLM/qwen-code

wenshao · 2026-02-10T15:51:54Z

Add support for detecting and handling TPM (Tokens Per Minute) throttling errors. When a TPM throttling error is detected (e.g., 'Throttling: TPM(10680324/10000000)'), the system now waits 1 minute before retrying instead of using exponential backoff.

Changes:

Add isTPMThrottlingError() function to detect TPM throttling errors
Modify retryWithBackoff() to use fixed 1-minute delay for TPM errors
Add unit tests for TPM throttling detection and retry behavior

TLDR

Dive Deeper

Reviewer Test Plan

Testing Matrix

	🍏	🪟	🐧
npm run	❓	❓	❓
npx	❓	❓	❓
Docker	❓	❓	❓
Podman	❓	-	-
Seatbelt	❓	-	-

Linked issues / bugs

Add support for detecting and handling TPM (Tokens Per Minute) throttling errors. When a TPM throttling error is detected (e.g., 'Throttling: TPM(10680324/10000000)'), the system now waits 1 minute before retrying instead of using exponential backoff. Changes: - Add isTPMThrottlingError() function to detect TPM throttling errors - Modify retryWithBackoff() to use fixed 1-minute delay for TPM errors - Add unit tests for TPM throttling detection and retry behavior Co-authored-by: Qwen-Coder <[email protected]>

packages/core/src/utils/retry.ts

Co-authored-by: 易良 <[email protected]>

- Remove redundant error checking logic in isTPMThrottlingError function - Reuse isStructuredError and isApiError utilities from quotaErrorDetection module - Clean up duplicate import statements

- Move TPM throttling check before shouldRetryOnError to ensure TPM errors without standard HTTP status codes are still retried - Add comprehensive unit tests for edge cases: - TPM error without status property - Nested TPM error object without top-level status - Consecutive TPM throttling errors - Max attempts exhaustion for TPM errors

- Change 'as' to 'as unknown as' for proper type casting

…PM throttling test Add a .catch() handler to the promise before advancing timers to prevent Node.js from reporting an unhandled rejection when maxAttempts is exhausted during the TPM throttling retry test.

This reverts commit 9b882b4.

yiliang114 · 2026-02-12T05:27:14Z

During local simulation of a throttling event (TPM 12231856/10000000, HTTP 429), the error is gracefully handled in the background. End users in the TUI will not experience immediate disruption or error notifications. With debug logging enabled, these throttling events are recorded in the log files for operational visibility and diagnostics.

The left side shows the local proxy tool simulating a TPM throttling error. The right side shows the output from the local Qwen Code CLI. Below is the debug log file.

packages/core/src/utils/retry.ts

- Refactor retry utility to support GLM rate limit errors (code 1302) and TPM throttling - Add getRateLimitRetryInfo() for unified rate-limit error detection - Add exponential backoff for non-TPM rate limit errors - Extend StreamEventType.RETRY with RetryInfo payload for UI feedback - Add RetryCountdownMessage component for visual retry countdown - Update useGeminiStream hook to handle retry events with countdown timer - Add i18n support for rate limit messages (en/zh)

- Use fixed 60s delay matching DashScope per-minute quota window - Increase max retries from 3 to 10 to align with Claude Code behavior - Remove unused isTPMThrottlingError, isGLMRateLimitError, isRateLimitThrottlingError functions - Simplify getRateLimitRetryInfo to only extract reason, delay is now caller's responsibility Co-authored-by: Qwen-Coder <[email protected]>

pomelo-nwu · 2026-02-12T10:30:02Z

packages/core/src/utils/retry.ts

+  }
+
+  // Try to extract code from JSON embedded in error message string
+  const message = getErrorMessage(error);


Avoid overhandling errorMessage here.
There should be a dedicated component in the CLI package to display messages for different error types — this part should only handle retry logic.

- Extract rate-limit detection into dedicated rateLimit.ts module - Support detection from ApiError, StructuredError, HttpError, and JSON strings - Handle common rate-limit codes: 429, 503, 1302 (GLM) - Simplify retry.ts by removing duplicated detection logic

pomelo-nwu · 2026-02-13T09:27:14Z

packages/core/src/core/geminiChat.ts

 export type StreamEvent =
  | { type: StreamEventType.CHUNK; value: GenerateContentResponse }
-  | { type: StreamEventType.RETRY };
+  | { type: StreamEventType.RETRY; retryInfo?: RetryInfo };


should be retryInfo: RetryInfo

pomelo-nwu

LGTM!

pomelo-nwu · 2026-02-13T09:32:45Z

@wenshao @yiliang114 Thanks for your contribution!

wenshao requested review from DennisYu07, LaZzyMan, Mingholy, gwinthis, pomelo-nwu and tanzhenxin as code owners February 10, 2026 15:51

Copilot AI mentioned this pull request Feb 10, 2026

Fix retry logic: Respect Retry-After headers for TPM throttling errors wenshao/qwen-code#2

Closed

pomelo-nwu assigned yiliang114 Feb 11, 2026

yiliang114 reviewed Feb 11, 2026

View reviewed changes

packages/core/src/utils/retry.ts Outdated Show resolved Hide resolved

wenshao and others added 5 commits February 11, 2026 16:56

Update packages/core/src/utils/retry.ts

c573c6a

Co-authored-by: 易良 <[email protected]>

Handle TPM throttling in stream retries

93a131d

refactor(core): simplify TPM throttling error detection logic

e9d2ead

- Remove redundant error checking logic in isTPMThrottlingError function - Reuse isStructuredError and isApiError utilities from quotaErrorDetection module - Clean up duplicate import statements

test(core): fix type assertion in pipeline test for error_finish chunk

aef2921

- Change 'as' to 'as unknown as' for proper type casting

yiliang114 approved these changes Feb 12, 2026

View reviewed changes

yiliang114 added 2 commits February 12, 2026 12:55

wip

9b882b4

test(core): add rejection handler to prevent unhandled rejection in T…

1c38455

…PM throttling test Add a .catch() handler to the promise before advancing timers to prevent Node.js from reporting an unhandled rejection when maxAttempts is exhausted during the TPM throttling retry test.

yiliang114 force-pushed the feat/tpm-throttling-retry branch from f8d914b to 1c38455 Compare February 12, 2026 05:10

Revert "wip"

2394d73

This reverts commit 9b882b4.

pomelo-nwu reviewed Feb 12, 2026

View reviewed changes

packages/core/src/utils/retry.ts Outdated Show resolved Hide resolved

yiliang114 and others added 4 commits February 12, 2026 16:21

Merge branch 'main' into feat/tpm-throttling-retry-wenshao

3153ff5

fix(openai): tool call cleanup order when fixing streaming errors

0d2e394

pomelo-nwu reviewed Feb 12, 2026

View reviewed changes

pomelo-nwu reviewed Feb 13, 2026

View reviewed changes

pomelo-nwu approved these changes Feb 13, 2026

View reviewed changes

pomelo-nwu merged commit 001d010 into QwenLM:main Feb 13, 2026
24 of 25 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add TPM throttling error handling with 1-minute retry delay#1791

feat: add TPM throttling error handling with 1-minute retry delay#1791
pomelo-nwu merged 14 commits intoQwenLM:mainfrom
wenshao:feat/tpm-throttling-retry

wenshao commented Feb 10, 2026

Uh oh!

Uh oh!

yiliang114 commented Feb 12, 2026

Uh oh!

Uh oh!

pomelo-nwu Feb 12, 2026

Uh oh!

pomelo-nwu Feb 13, 2026

Uh oh!

pomelo-nwu left a comment

Uh oh!

pomelo-nwu commented Feb 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

wenshao commented Feb 10, 2026

TLDR

Dive Deeper

Reviewer Test Plan

Testing Matrix

Linked issues / bugs

Uh oh!

Uh oh!

yiliang114 commented Feb 12, 2026

Uh oh!

Uh oh!

pomelo-nwu Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

pomelo-nwu Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

pomelo-nwu left a comment

Choose a reason for hiding this comment

Uh oh!

pomelo-nwu commented Feb 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments