fix: prevent false positive billing error detection in sanitizeUserFacingText by jpaine · Pull Request #12777 · openclaw/openclaw

jpaine · 2026-02-09T17:14:01Z

Summary

sanitizeUserFacingText() pattern-matches /\b402\b/ via isBillingErrorMessage() on all assistant text before channel delivery. This causes any response containing the literal number "402" in normal context (e.g., "$402.55" in a cost report, street addresses, item counts) to be silently replaced with a billing error warning on the channel surface.

Fixes #12711.

lobster-biscuit

Repro Steps

Have the assistant generate a response containing "402" in any non-error context (e.g., "$402.55 MTD spend")
Response is delivered to the channel as: ⚠️ API provider returned a billing error...
Dashboard view shows the original, correct response

Root Cause

sanitizeUserFacingText() calls isBillingErrorMessage(trimmed) unconditionally on all user-facing text. The billing error patterns include /\b402\b/, which matches "402" at any word boundary — including after $ (since $ is not a word character). Normal dollar amounts like "$402.55", addresses like "402 Main Street", and counts like "402 items" all trigger the false positive.

The context overflow check in the same function already gates itself behind error-like indicators (isRawApiErrorPayload, isLikelyHttpErrorText, ERROR_PREFIX_RE), but the billing check had no such guard.

Behavior Changes

sanitizeUserFacingText() now only rewrites billing errors when the text also looks like a leaked error payload (raw API JSON, HTTP status line, or error-prefixed text)
Normal assistant text containing "402" passes through unchanged
isBillingErrorMessage() itself is unchanged — it remains sensitive for use in formatAssistantErrorText() and classifyFailoverReason() where the input is already known to be an error message
Added "payment" to HTTP_ERROR_HINTS so "402 Payment Required" is properly recognized as an HTTP error

Codebase and GitHub Search

Searched all callers of sanitizeUserFacingText (normalize-reply, agent-runner-execution, sessions-helpers, pi-embedded-utils)
Searched all callers of isBillingErrorMessage to confirm the function itself should not change (used in formatAssistantErrorText and classifyFailoverReason where input is known-error)
Reviewed the parallel pattern in shouldRewriteContextOverflowText which already gates context overflow detection behind error indicators

Tests

All pass (pnpm check + pnpm test):

New: "does not rewrite normal text containing '402'" — dollar amounts ($402.55), street addresses, item counts, historical years
New: "still rewrites leaked billing error payloads" — HTTP status ("402 Payment Required"), error-prefixed ("Error: 402 Payment Required"), JSON API error payloads with billing keywords
Existing isBillingErrorMessage tests unchanged and passing
Pre-existing failure in isCompactionFailureError test is unrelated (present on main)

Sign-Off

Models used: Claude claude-4.6-opus-high-thinking (Cursor agent)
Submitter effort: guided/reviewed by human

Made with Cursor

Greptile Overview

Greptile Summary

This PR adjusts sanitizeUserFacingText() to avoid rewriting normal assistant output into the billing warning when the text merely contains the number 402 (e.g. $402.55). It does this by gating the billing rewrite behind additional “looks like an error payload” checks (isRawApiErrorPayload, isLikelyHttpErrorText, or ERROR_PREFIX_RE). It also adds "payment" to HTTP_ERROR_HINTS so plain HTTP status lines like 402 Payment Required are recognized as HTTP-ish error text, and adds tests covering both the non-rewrite and rewrite cases.

Confidence Score: 5/5

This PR is safe to merge with minimal risk.
The change is small, localized to sanitizeUserFacingText(), and the new gating logic directly addresses the reported false positive without changing isBillingErrorMessage() behavior for known-error contexts. Added tests cover the new behavior, and the HTTP_ERROR_HINTS tweak aligns with existing isLikelyHttpErrorText() semantics. No additional call sites or invariants appear to be broken by this change.
No files require special attention

_{(2/5) Greptile learns from your feedback when you react with thumbs up/down!}

…12889 #12309 #3594 #7483 #10094 #10368 #11317 #11359 #11649 #12022 #12432 #12676 #12711; PRs #7567 #10220 #10601 #10620 #10760 #11680 #11685 #12052 #12226 #12433 #12702 #12720 #12726 #12777)

Takhoffman · 2026-02-10T01:53:20Z

Fixed in #12988.

This will go out in the next OpenClaw release.

If you still see this after updating to the first release that includes #12988, please open a new issue with:

your OpenClaw version
channel (Telegram/Slack/etc)
the exact prompt/response that got rewritten
whether Web UI showed the full text vs the channel being rewritten
relevant logs around send/normalize (if available)

Link back here for context.

dominicnunez · 2026-02-14T20:06:25Z

Great approach — gating isBillingErrorMessage behind structural error signals (isRawApiErrorPayload || isLikelyHttpErrorText || ERROR_PREFIX_RE) is the right pattern and matches how shouldRewriteContextOverflowText already works in the same file. The "payment" addition to HTTP_ERROR_HINTS is also correct and necessary.

Relationship to #12988

#12988 (merged to main, not yet released) added the errorContext flag so error-pattern rewrites only run when the caller knows the text is an error payload. That fixes the broadest class of false positives — normal assistant text mentioning "402" or billing keywords no longer gets rewritten. The billing regex was also tightened (from bare \b402\b to patterns requiring context like status: 402, http 402, error code 402, etc.), so $402.55 and 402 Main Street no longer match.

This PR is still needed because errorContext: true doesn't mean the text is a billing error — it just means it's some error. Error payloads from non-LLM sources can contain billing keywords in a non-LLM-billing context. For example, a tool error that says "Got a 402 from the upstream API" or an error containing "payment required" for a non-billing reason would match isBillingErrorMessage and get replaced with the billing warning, even though the user's LLM API key is fine. The structural guard this PR adds ensures billing rewrites only fire when the text also looks like a proper error payload (JSON, HTTP status line, or error-prefixed), closing that gap.

What needs updating

The PR needs a rebase onto current main (post-#12988). Two things to fix:

1. The diff targets stale code.

The hunk header shows sanitizeUserFacingText(text: string): string, but current main has:

export function sanitizeUserFacingText(text: string, opts?: { errorContext?: boolean }): string {

The billing check moved from line ~424 (unconditional) to line 526 (inside if (errorContext)). The structural guard needs to be applied there.

2. Tests need { errorContext: true } and updated inputs.

On current main, without errorContext, billing rewrites never run. The $402.55 test input also no longer matches the tightened billing regex, so it passes trivially even without this fix.

Tests should use inputs that actually trigger isBillingErrorMessage on current main, and pass errorContext: true:

// False positive — tool/upstream 402, not an LLM billing error.
// Should pass through even in error context.
expect(sanitizeUserFacingText("Got a 402 from the upstream API", { errorContext: true }))
  .not.toContain("billing");

// Real billing error payload — should still rewrite
expect(sanitizeUserFacingText("402 Payment Required", { errorContext: true }))
  .toContain("billing");

// JSON billing error — should still rewrite
expect(
  sanitizeUserFacingText(
    '{"type":"error","error":{"message":"insufficient credits","type":"billing_error"}}',
    { errorContext: true },
  ),
).toContain("billing");

…cingText (openclaw#12711)

Takhoffman · 2026-02-19T14:00:50Z

Closing as superseded by the merged sanitize/error-context work:

Agents: scope sanitizeUserFacingText rewrites to errorContext #12988 (scope sanitizeUserFacingText rewrites behind errorContext)
fix(telegram): reduce false positives billing error detection in conversation text #12946 (billing false-positive reduction)
fix(agents): narrow billing error 402 regex to avoid false positives on issue IDs #13827 (402 pattern narrowing)

This PR’s intent appears covered by those merged changes and current mainline tests.

openclaw-barnacle bot added the agents Agent runtime and tooling label Feb 9, 2026

Takhoffman self-assigned this Feb 10, 2026

Takhoffman mentioned this pull request Feb 10, 2026

Agents: scope sanitizeUserFacingText rewrites to errorContext #12988

Merged

dominicnunez mentioned this pull request Feb 14, 2026

Refactor: thread structured error classification through sanitizer pipeline #16521

Open

fix: prevent false positive billing error detection in sanitizeUserFa…

79e3ea2

…cingText (openclaw#12711)

jpaine force-pushed the fix/sanitize-billing-false-positive branch from fb52c02 to 79e3ea2 Compare February 15, 2026 02:53

openclaw-barnacle bot added the size: XS label Feb 15, 2026

thewilloftheshadow force-pushed the main branch from bfc1ccb to f92900f Compare February 15, 2026 18:46

Takhoffman closed this Feb 19, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: prevent false positive billing error detection in sanitizeUserFacingText#12777

fix: prevent false positive billing error detection in sanitizeUserFacingText#12777
jpaine wants to merge 1 commit intoopenclaw:mainfrom
jpaine:fix/sanitize-billing-false-positive

jpaine commented Feb 9, 2026 •

edited by greptile-apps bot

Loading

Uh oh!

Takhoffman commented Feb 10, 2026

Uh oh!

dominicnunez commented Feb 14, 2026 •

edited

Loading

Uh oh!

Takhoffman commented Feb 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Uh oh!

Conversation

jpaine commented Feb 9, 2026 • edited by greptile-apps bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Repro Steps

Root Cause

Behavior Changes

Codebase and GitHub Search

Tests

Greptile Overview

Greptile Summary

Confidence Score: 5/5

Uh oh!

Takhoffman commented Feb 10, 2026

Uh oh!

dominicnunez commented Feb 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Relationship to #12988

What needs updating

Uh oh!

Takhoffman commented Feb 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

jpaine commented Feb 9, 2026 •

edited by greptile-apps bot

Loading

dominicnunez commented Feb 14, 2026 •

edited

Loading