fix(sanitizer): thread errorKind through pipeline to prevent false positives by dominicnunez · Pull Request #29031 · openclaw/openclaw

dominicnunez · 2026-02-27T18:08:47Z

Summary

Adds structured ErrorKind type and deriveErrorKind() to classify errors at the source (runner payload layer) instead of regex-matching text content in sanitizeUserFacingText
Threads errorKind through the sanitizer pipeline so sanitizeUserFacingText uses a switch on the caller-provided kind rather than regex matching the text body
Prevents false positives where assistant text mentioning billing errors, context overflow, rate limits, etc. was being incorrectly rewritten to user-facing error messages

Problem

sanitizeUserFacingText used regex patterns to detect error types in text content. This caused false positives: if an assistant reply mentioned a billing error (e.g. "check whether we need to re-enable billing") or context overflow, it would be silently rewritten to a canned error message like "billing error: please upgrade your plan".

Solution

The caller now classifies the error once via deriveErrorKind() at the point where the raw error message is available, and passes the errorKind through to sanitizeUserFacingText. The sanitizer uses a structured switch on errorKind instead of regex matching, eliminating false positives entirely.

Test plan

Added sanitizeuserfacingtext-errorkind.test.ts — tests structured errorKind dispatch in sanitizeUserFacingText
Added payloads.errorkind.test.ts — tests deriveErrorKind derivation in buildEmbeddedRunPayloads
Updated existing sanitizeuserfacingtext.test.ts to pass errorKind where needed
All 74 tests pass (vitest run)
Type-checks clean (no new errors introduced; 1 pre-existing error in extraparams.test.ts unrelated to this PR)
Lightly tested with local OpenClaw instance

🤖 AI-assisted (Claude Code). I understand the code and have verified the changes are correct.

greptile-apps · 2026-02-27T18:14:25Z

Greptile Summary

This PR successfully addresses the false positive issue where assistant text mentioning errors was being incorrectly rewritten. The solution threads a structured ErrorKind type through the sanitizer pipeline, replacing regex-based classification with explicit error kind classification at the source.

Key Changes

Added ErrorKind type with structured error categories (billing, rate_limit, timeout, auth, context_overflow, overloaded, format, compaction_failure, role_ordering, image_size, unknown)
Added deriveErrorKind() function to classify errors at the runner payload layer based on the raw error message
Updated sanitizeUserFacingText() to use a switch statement on the caller-provided errorKind instead of regex matching the text body
Threaded errorKind through the pipeline via ReplyPayload, EmbeddedPiRunResult, and related types
Comprehensive test coverage with new test files for errorKind derivation and sanitization

Architecture

The refactor correctly eliminates false positives by:

Classifying errors once at the source (buildEmbeddedRunPayloads, extractAssistantText, sessions-helpers)
Passing the errorKind alongside the error text through the pipeline
Using structured classification in sanitizeUserFacingText instead of regex pattern matching

Issue Found

deriveErrorKind() doesn't detect role ordering errors ("400 Incorrect role information", "roles must alternate"), returning "unknown" instead of "role_ordering". This affects sessions-helpers.ts where role ordering errors won't get the friendly "Message ordering conflict" message. The pattern exists in formatAssistantErrorText() but wasn't added to deriveErrorKind().

Most code paths work correctly because:

run.ts catches role ordering errors early (line 856) before deriveErrorKind is called
formatAssistantErrorText() formats the error text before deriveErrorKind runs in buildEmbeddedRunPayloads

But sessions-helpers.ts calls deriveErrorKind directly and relies on the result.

Confidence Score: 3/5

This PR is mostly safe to merge with one logical issue that should be fixed
Score reflects a well-architected solution that successfully solves the false positive problem, but has one missing error detection case in deriveErrorKind for role ordering errors. The issue is not critical because most code paths handle role ordering errors at other levels (in run.ts or via formatAssistantErrorText), but it creates an inconsistency where sessions-helpers.ts won't format role ordering errors with the friendly message. The test suite is comprehensive for covered cases but doesn't test role ordering error derivation, which would have caught this gap. The fix is straightforward: add the existing role ordering pattern check to deriveErrorKind.
Pay close attention to src/agents/pi-embedded-helpers/errors.ts - the deriveErrorKind function needs to handle role ordering errors

_{Last reviewed commit: 427404e}

greptile-apps

_{13 files reviewed, 1 comment}

_{Edit Code Review Agent Settings | Greptile}

greptile-apps · 2026-02-27T18:14:29Z

src/agents/pi-embedded-helpers/errors.ts

  return false;
 }

+export function deriveErrorKind(rawErrorMessage: string): ErrorKind {
+  if (isCompactionFailureError(rawErrorMessage)) {
+    return "compaction_failure";
+  }
+  if (isLikelyContextOverflowError(rawErrorMessage)) {
+    return "context_overflow";
+  }
+  // Check overloaded before classifyFailoverReason, which conflates it with rate_limit.
+  if (isOverloadedErrorMessage(rawErrorMessage)) {
+    return "overloaded";
+  }
+  if (isImageDimensionErrorMessage(rawErrorMessage) || isImageSizeError(rawErrorMessage)) {
+    return "image_size";
+  }
+  const failoverReason = classifyFailoverReason(rawErrorMessage);
+  const errorKindValues: ReadonlySet<string> = new Set<ErrorKind>([
+    "billing",
+    "rate_limit",
+    "timeout",
+    "auth",
+    "context_overflow",


deriveErrorKind doesn't detect role ordering errors

The function checks for compaction_failure, context_overflow, overloaded, and image_size errors, but doesn't check for role ordering errors (e.g., "400 Incorrect role information" or "roles must alternate"). This means it returns "unknown" for these errors.

Impact: In sessions-helpers.ts (line 173), when deriveErrorKind is called on a role ordering error message, it returns "unknown". Then sanitizeUserFacingText is called with errorKind: "unknown", which doesn't match the "role_ordering" case in the switch statement. The error gets formatted as a generic HTTP error instead of the friendly "Message ordering conflict" message.

Suggested change

return false;

}

export function deriveErrorKind(rawErrorMessage: string): ErrorKind {

if (isCompactionFailureError(rawErrorMessage)) {

return "compaction_failure";

}

if (isLikelyContextOverflowError(rawErrorMessage)) {

return "context_overflow";

}

// Check overloaded before classifyFailoverReason, which conflates it with rate_limit.

if (isOverloadedErrorMessage(rawErrorMessage)) {

return "overloaded";

}

if (isImageDimensionErrorMessage(rawErrorMessage) || isImageSizeError(rawErrorMessage)) {

return "image_size";

}

const failoverReason = classifyFailoverReason(rawErrorMessage);

const errorKindValues: ReadonlySet<string> = new Set<ErrorKind>([

"billing",

"rate_limit",

"timeout",

"auth",

"context_overflow",

export function deriveErrorKind(rawErrorMessage: string): ErrorKind {

if (isCompactionFailureError(rawErrorMessage)) {

return "compaction_failure";

}

if (isLikelyContextOverflowError(rawErrorMessage)) {

return "context_overflow";

}

// Check overloaded before classifyFailoverReason, which conflates it with rate_limit.

if (isOverloadedErrorMessage(rawErrorMessage)) {

return "overloaded";

}

// Check role ordering errors

if (/incorrect role information|roles must alternate|400.*role|"message".*role.*information/i.test(rawErrorMessage)) {

return "role_ordering";

}

if (isImageDimensionErrorMessage(rawErrorMessage) || isImageSizeError(rawErrorMessage)) {

return "image_size";

}

const failoverReason = classifyFailoverReason(rawErrorMessage);

const errorKindValues: ReadonlySet<string> = new Set<ErrorKind>([

"billing", "rate_limit", "timeout", "auth", "context_overflow",

"overloaded", "format", "compaction_failure", "role_ordering", "image_size", "unknown",

]);

if (failoverReason && failoverReason !== "unknown" && errorKindValues.has(failoverReason)) {

return failoverReason as ErrorKind;

}

return "unknown";

}

Prompt To Fix With AI

This is a comment left during a code review. Path: src/agents/pi-embedded-helpers/errors.ts Line: 879-902 Comment: `deriveErrorKind` doesn't detect role ordering errors The function checks for compaction_failure, context_overflow, overloaded, and image_size errors, but doesn't check for role ordering errors (e.g., "400 Incorrect role information" or "roles must alternate"). This means it returns "unknown" for these errors. Impact: In `sessions-helpers.ts` (line 173), when `deriveErrorKind` is called on a role ordering error message, it returns "unknown". Then `sanitizeUserFacingText` is called with `errorKind: "unknown"`, which doesn't match the "role_ordering" case in the switch statement. The error gets formatted as a generic HTTP error instead of the friendly "Message ordering conflict" message. ```suggestion export function deriveErrorKind(rawErrorMessage: string): ErrorKind { if (isCompactionFailureError(rawErrorMessage)) { return "compaction_failure"; } if (isLikelyContextOverflowError(rawErrorMessage)) { return "context_overflow"; } // Check overloaded before classifyFailoverReason, which conflates it with rate_limit. if (isOverloadedErrorMessage(rawErrorMessage)) { return "overloaded"; } // Check role ordering errors if (/incorrect role information|roles must alternate|400.*role|"message".*role.*information/i.test(rawErrorMessage)) { return "role_ordering"; } if (isImageDimensionErrorMessage(rawErrorMessage) || isImageSizeError(rawErrorMessage)) { return "image_size"; } const failoverReason = classifyFailoverReason(rawErrorMessage); const errorKindValues: ReadonlySet<string> = new Set<ErrorKind>([ "billing", "rate_limit", "timeout", "auth", "context_overflow", "overloaded", "format", "compaction_failure", "role_ordering", "image_size", "unknown", ]); if (failoverReason && failoverReason !== "unknown" && errorKindValues.has(failoverReason)) { return failoverReason as ErrorKind; } return "unknown"; } ``` How can I resolve this? If you propose a fix, please make it concise.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: bf5c23f864

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-02-27T18:14:40Z

src/agents/pi-embedded-helpers/errors.ts

+  if (isImageDimensionErrorMessage(rawErrorMessage) || isImageSizeError(rawErrorMessage)) {
+    return "image_size";
+  }
+  const failoverReason = classifyFailoverReason(rawErrorMessage);


Classify role-ordering errors before fallback mapping

deriveErrorKind never emits "role_ordering": it only checks compaction/context/overloaded/image-size, then falls back to classifyFailoverReason(rawErrorMessage), and that helper has no role-ordering branch. Because this commit also removed regex reclassification from sanitizeUserFacingText, an assistant error like "400 Incorrect role information" now resolves to errorKind: "unknown" and skips the friendly "Message ordering conflict" rewrite, so raw provider text can leak through in paths that rely on deriveErrorKind + sanitizeUserFacingText (for example extractAssistantText).

Useful? React with 👍 / 👎.

dominicnunez · 2026-02-27T18:21:00Z

Addressed the review feedback from both Greptile and Codex — deriveErrorKind now detects role ordering errors ("400 Incorrect role information", "roles must alternate", etc.) and returns "role_ordering" instead of falling through to "unknown". Added a test case covering this path. See a428ac4.

The remaining CI failure is a pre-existing oxfmt formatting issue in CONTRIBUTING.md — not related to this PR.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: a428ac4a2d

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-02-27T18:23:11Z

src/agents/pi-embedded-helpers/errors.ts

+  if (isImageDimensionErrorMessage(rawErrorMessage) || isImageSizeError(rawErrorMessage)) {
+    return "image_size";
+  }
+  const failoverReason = classifyFailoverReason(rawErrorMessage);


Distinguish transient HTTP failures from timeouts

Using classifyFailoverReason(rawErrorMessage) here causes every transient HTTP 5xx/529 error to be classified as "timeout" (because that helper intentionally collapses transient transport failures into the failover bucket). After this commit, that errorKind is threaded into sanitizeUserFacingText, so messages like 500 Internal Server Error are now rewritten to "LLM request timed out." instead of preserving the HTTP/unavailable context, which regresses user-facing diagnostics in paths like assistant text extraction and reply normalization.

Useful? React with 👍 / 👎.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 332394706c

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-02-27T18:46:25Z

src/auto-reply/reply/agent-runner-execution.ts

      defaultRuntime.error(`Embedded agent failed before reply: ${message}`);
      const safeMessage = isTransientHttp
-        ? sanitizeUserFacingText(message, { errorContext: true })
+        ? sanitizeUserFacingText(message, { errorContext: true, errorKind: "timeout" })


Preserve transient HTTP details in pre-reply fallback

This path now hard-codes errorKind: "timeout" for every transient HTTP failure before a reply is produced, so errors like 500 Internal Server Error and 529 are rewritten to LLM request timed out. instead of preserving the actual HTTP status/context. In the ⚠️ Agent failed before reply: ... fallback path, that regression makes diagnostics misleading and hides actionable upstream failure details from operators.

Useful? React with 👍 / 👎.

dominicnunez · 2026-02-27T19:05:25Z

Addressed the second Codex review (transient HTTP 5xx misclassification) in 448b29a → 3323947.

deriveErrorKind no longer delegates to classifyFailoverReason, which collapses 5xx/529 into "timeout" for failover retry logic. Instead it calls each is* helper directly — isTimeoutErrorMessage, isRateLimitErrorMessage, isBillingErrorMessage, etc. — so the two classification paths (failover routing vs user-facing diagnostics) are fully decoupled.

Transient HTTP errors like 500 Internal Server Error now stay errorKind: "unknown" and get the existing raw-error formatting ("HTTP 500: Internal Server Error") instead of being misclassified as "LLM request timed out.". Added a test case confirming this.

Note: the check CI failure is a pre-existing oxfmt issue in CONTRIBUTING.md introduced by 2916152 — fix submitted in #29056.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 64c64e6e09

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-02-27T19:30:05Z

src/agents/pi-embedded-helpers/errors.ts

+  if (isOverloadedErrorMessage(rawErrorMessage)) {
+    return "overloaded";
+  }


Keep transient HTTP statuses out of overloaded mapping

deriveErrorKind now classifies "503 Service Unavailable" as "overloaded" because it checks isOverloadedErrorMessage before any transient-HTTP guard, and sanitizeUserFacingText then rewrites that to the generic overloaded copy. This is a regression from the previous behavior where HTTP-prefixed errors were preserved as HTTP 503: ..., so operators lose the actionable status code/context for transient upstream failures.

Useful? React with 👍 / 👎.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: b6a070cd6c

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-02-27T20:48:44Z

src/agents/pi-embedded-utils.ts

+  const errorKind =
+    errorContext && msg.errorMessage ? deriveErrorKind(msg.errorMessage) : undefined;


Derive kind from content when errorMessage is missing

errorContext is set whenever stopReason === "error", but errorKind is only derived from msg.errorMessage; with this commit’s regex removal in sanitizeUserFacingText, error replies that only carry the provider error in content (including legacy transcript entries with missing errorMessage) now skip role/context/billing rewrites and surface raw provider text instead. For example, a content-only "400 Incorrect role information" no longer maps to the friendly ordering-conflict message on this path.

Useful? React with 👍 / 👎.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 3369559144

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-02-27T20:54:38Z

src/agents/tools/sessions-helpers.ts

+  const errorKind =
+    errorContext && typeof errorMessage === "string" ? deriveErrorKind(errorMessage) : undefined;


Derive error kind when assistant errorMessage is missing

This path only calls deriveErrorKind when errorMessage is a string, but errorContext is also true for stopReason === "error"; after this commit removed regex reclassification from sanitizeUserFacingText, error turns that carry provider text only in content (e.g. legacy/history entries like "400 Incorrect role information") now bypass friendly rewrites and surface raw provider text in readLatestAssistantReply/sessions flows. Please derive errorKind from available error text when stopReason is error even if errorMessage is absent.

Useful? React with 👍 / 👎.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 74c04f75a2

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-02-27T21:35:19Z

src/agents/pi-embedded-helpers/errors.ts

+  if (isTransientHttpError(rawErrorMessage)) {
+    return "unknown";
+  }
+  if (isOverloadedErrorMessage(rawErrorMessage)) {
+    return "overloaded";


Treat HTTP 5xx: strings as transient before overloaded mapping

deriveErrorKind now relies on isTransientHttpError to keep transient 5xx diagnostics raw, but that helper only recognizes 503 ... (space form) and misses formatted strings like HTTP 503: Service Unavailable; those then fall through to isOverloadedErrorMessage and get rewritten to the generic overloaded copy. This shows up in the new content-fallback classification paths (extractAssistantText in src/agents/pi-embedded-utils.ts and src/agents/tools/sessions-helpers.ts) when legacy/error-history entries only contain formatted HTTP text, causing loss of actionable status details that the new guard was intended to preserve.

Useful? React with 👍 / 👎.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: c57b4a92df

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-02-28T17:14:07Z

src/agents/pi-embedded-utils.ts

+  const errorKind =
+    errorContext && msg.errorMessage ? deriveErrorKind(msg.errorMessage) : undefined;


Restrict errorKind override to real error turns

extractAssistantText now derives errorKind whenever errorMessage is present, even if the assistant turn was not an actual error (stopReason !== "error"). In that mixed-state case, sanitizeUserFacingText will replace normal assistant content with canned billing/rate-limit/etc text because the new errorKind switch ignores the extracted body once a kind is set. This regresses reply/session reads by hiding valid assistant output when providers attach stale error metadata; only derive/apply errorKind for true error turns (or after confirming the extracted text is itself an error payload).

Useful? React with 👍 / 👎.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 20fa26d10a

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-02-28T17:27:27Z

src/agents/pi-embedded-helpers/errors.ts

+    failoverReason !== "auth_permanent" &&
+    failoverReason !== "model_not_found"
+  ) {
+    return failoverReason;
+  }


Preserve API internal-error details instead of forcing timeout

deriveErrorKind now forwards most classifyFailoverReason outputs directly, including "timeout"; because classifyFailoverReason treats JSON API internal-server payloads (for example {"type":"error","error":{"type":"api_error","message":"Internal server error"}}) as timeout, downstream sanitization rewrites those errors to LLM request timed out. and drops the provider/type details that formatAssistantErrorText already extracted. This regresses diagnostics in the payload normalization paths that now consume errorKind.

Useful? React with 👍 / 👎.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 55157676ef

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-02-28T17:37:57Z

src/auto-reply/reply/agent-runner-execution.ts

-        ? sanitizeUserFacingText(message, { errorContext: true })
-        : message;
-      const trimmedMessage = safeMessage.replace(/\.\s*$/, "");
+      const trimmedMessage = message.replace(/\.\s*$/, "");


Sanitize transient HTTP fallback text before returning to users

This fallback now interpolates the raw exception string directly into ⚠️ Agent failed before reply: ..., so transient provider failures that include full HTML/error payloads (for example 521 <!doctype html>...) will be sent verbatim to end users after the retry path is exhausted. Before this change, transient errors were sanitized/formatted first, which avoided leaking noisy multi-line payloads and kept replies channel-safe; restoring formatted handling here would preserve HTTP context without dumping raw bodies.

Useful? React with 👍 / 👎.

bmendonca3

Blocking issues before merge:

errorKind derivation is now too narrow for provider paths that populate errorMessage without stopReason === "error".
the transient HTTP fallback path regresses and now surfaces raw provider or network failure text again.

The sanitizer threading is moving in the right direction, but these two reply-surface regressions need to be fixed first.

bmendonca3 · 2026-02-28T17:58:46Z

src/agents/pi-embedded-utils.ts

-  return sanitizeUserFacingText(extracted, { errorContext });
+  const isErrorTurn = msg.stopReason === "error";
+  const errorContext = isErrorTurn || Boolean(msg.errorMessage?.trim());
+  const errorKind =


This only derives errorKind when stopReason === "error", even though errorContext is also true when the provider filled errorMessage. That regresses the provider path where we have an error string but no stopReason, because those turns lose the specific rewrite and fall back to generic sanitization. I would derive the kind whenever errorContext is true, using errorMessage when present and extracted as the fallback.

bmendonca3 · 2026-02-28T17:58:46Z

src/auto-reply/reply/agent-runner-execution.ts

-        ? sanitizeUserFacingText(message, { errorContext: true })
-        : message;
-      const trimmedMessage = safeMessage.replace(/\.\s*$/, "");
+      const trimmedMessage = message.replace(/\.\s*$/, "");


This drops the previous transient-HTTP sanitization and now echoes the raw provider or network failure text back to the user after retry exhaustion. That is exactly the class of message we usually scrub, because it tends to contain transport noise or provider-internal phrasing. Please keep the old sanitizeUserFacingText(message, { errorContext: true }) path for transient HTTP failures before building the fallback text.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 40df5b0647

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-03-18T17:39:55Z

src/agents/pi-embedded-helpers/errors.ts

+      case "billing":
+        return BILLING_ERROR_USER_MESSAGE;


Preserve provider/model context in billing error payloads

When buildEmbeddedRunPayloads() has already formatted a billing failure with formatBillingErrorMessage(provider, model), the later sanitizer pass still sees errorKind: "billing" and unconditionally replaces that text with the generic BILLING_ERROR_USER_MESSAGE. On the normal reply-delivery paths (normalizeReplyPayload and the streaming normalization in agent-runner-execution.ts), a payload like ⚠️ Anthropic (claude-3-5-sonnet) returned a billing error… is therefore downgraded to provider-agnostic copy, so users/operators lose the specific provider/model that actually ran out of credits.

Useful? React with 👍 / 👎.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: a2179c780b

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-03-18T17:50:07Z

src/agents/pi-embedded-runner/run/payloads.ts

+  const effectiveErrorSource = lastAssistantErrored
+    ? params.lastAssistant?.errorMessage?.trim() || fallbackRawAnswerText.trim() || undefined
+    : undefined;


Inspect content when errorMessage is only a wrapper

If an error turn carries a generic wrapper like Request failed in errorMessage but the detailed provider payload lives in content, effectiveErrorSource never inspects that content. deriveErrorKind() then stays unknown, formatAssistantErrorText() formats only the wrapper, and buildEmbeddedRunPayloads() can emit Request failed plus raw provider text instead of the friendly rewrite the old content-based sanitizer produced (for example 400 Incorrect role information no longer becomes the ordering-conflict message). This affects any provider or legacy transcript shape that stores the actionable error only in the assistant body.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-18T17:50:07Z

src/agents/pi-embedded-utils.ts

+  const trimmedErrorMessage = msg.errorMessage?.trim();
+  const errorSource = errorContext ? trimmedErrorMessage || extracted.trim() : undefined;
+  const errorKind = errorSource ? deriveErrorKind(errorSource) : undefined;


Inspect error content before trusting transcript errorMessage

This new trimmedErrorMessage || extracted.trim() precedence regresses transcript/session reads when stopReason === "error", errorMessage is generic, and the real provider error is only in content. In that case errorKind becomes unknown, so sanitizeUserFacingText() no longer applies the structured role-ordering/context/billing rewrites and commands that read the latest assistant reply now surface raw provider text; the same pattern is repeated in sessions-helpers.ts, so session-history reads regress too.

Useful? React with 👍 / 👎.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: e3c357ae3b

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-03-18T18:09:23Z

src/agents/pi-embedded-helpers/errors.ts

+      case "auth":
+      case "auth_permanent":
+        return AUTH_ERROR_USER_MESSAGE;


Preserve actionable scope details for structured auth errors

Because extractAssistantText, sessions-helpers, and normalizeReplyPayload now thread errorKind, an auth payload like 401 {"error":{"message":"Missing scopes: model.request"}} reaches this branch and is collapsed to the generic authentication copy. Before this commit, the same text flowed through formatRawAssistantErrorForUi, which preserved the missing-scope detail (see the existing TUI expectations for Missing scopes: model.request), so users/operators now lose the exact permission they need to add.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-18T18:09:23Z

src/agents/pi-embedded-helpers/errors.ts

+      case "image_size":
+        return IMAGE_SIZE_USER_MESSAGE;
+      case "model_not_found":
+        return MODEL_NOT_FOUND_USER_MESSAGE;


Keep model-specific guidance in model_not_found replies

This generic rewrite now discards the concrete model identifier and any recovery hint carried by the original error text. For example, provider-catalog errors like Unknown model: openai/gpt-5.3-codex-spark ... Use openai-codex/gpt-5.3-codex-spark. will now be reduced to The selected model is not available, even though the original message told the user exactly which replacement to use. Since the new pipeline derives errorKind before sanitizing assistant/session replies, that guidance is no longer surfaced on those paths.

Useful? React with 👍 / 👎.

openclaw-barnacle bot added agents Agent runtime and tooling size: M labels Feb 27, 2026

greptile-apps bot reviewed Feb 27, 2026

View reviewed changes

chatgpt-codex-connector bot reviewed Feb 27, 2026

View reviewed changes

dominicnunez mentioned this pull request Feb 27, 2026

ci: CONTRIBUTING.md fails oxfmt --check, blocking all PR CI #29039

Closed

dominicnunez force-pushed the fix-errorkind-sanitizer-pipeline branch from 448b29a to 3323947 Compare February 27, 2026 18:40

chatgpt-codex-connector bot reviewed Feb 27, 2026

View reviewed changes

dominicnunez mentioned this pull request Feb 27, 2026

style: fix oxfmt formatting in CONTRIBUTING.md #29056

Closed

1 task

dominicnunez mentioned this pull request Feb 27, 2026

Refactor: thread structured error classification through sanitizer pipeline #16521

Closed

dominicnunez force-pushed the fix-errorkind-sanitizer-pipeline branch from 3323947 to 64c64e6 Compare February 27, 2026 19:22

chatgpt-codex-connector bot reviewed Feb 27, 2026

View reviewed changes

openclaw-barnacle bot added channel: feishu Channel integration: feishu size: L and removed size: M labels Feb 27, 2026

chatgpt-codex-connector bot reviewed Feb 27, 2026

View reviewed changes

dominicnunez force-pushed the fix-errorkind-sanitizer-pipeline branch from df4b377 to 3e973c8 Compare February 28, 2026 17:05

openclaw-barnacle bot added size: M and removed channel: feishu Channel integration: feishu size: L labels Feb 28, 2026

chatgpt-codex-connector bot reviewed Feb 28, 2026

View reviewed changes

dominicnunez force-pushed the fix-errorkind-sanitizer-pipeline branch from 5515767 to 5dbed9b Compare February 28, 2026 17:40

bmendonca3 suggested changes Feb 28, 2026

View reviewed changes

dominicnunez force-pushed the fix-errorkind-sanitizer-pipeline branch from abcf03b to 9d39e5e Compare March 11, 2026 16:02

dominicnunez force-pushed the fix-errorkind-sanitizer-pipeline branch 4 times, most recently from 744def4 to 40df5b0 Compare March 18, 2026 17:31

chatgpt-codex-connector bot reviewed Mar 18, 2026

View reviewed changes

dominicnunez force-pushed the fix-errorkind-sanitizer-pipeline branch from 40df5b0 to a2179c7 Compare March 18, 2026 17:40

chatgpt-codex-connector bot reviewed Mar 18, 2026

View reviewed changes

fix(agents): harden embedded error sanitization paths

e3c357a

dominicnunez force-pushed the fix-errorkind-sanitizer-pipeline branch from a2179c7 to e3c357a Compare March 18, 2026 17:58

chatgpt-codex-connector bot reviewed Mar 18, 2026

View reviewed changes

dominicnunez closed this Mar 23, 2026

		const errorKind =
		errorContext && msg.errorMessage ? deriveErrorKind(msg.errorMessage) : undefined;

		const errorKind =
		errorContext && typeof errorMessage === "string" ? deriveErrorKind(errorMessage) : undefined;

Uh oh!

Conversation

dominicnunez commented Feb 27, 2026

Summary

Problem

Solution

Test plan

Uh oh!

greptile-apps bot commented Feb 27, 2026

Greptile Summary

Key Changes

Architecture

Issue Found

Confidence Score: 3/5

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

dominicnunez commented Feb 27, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

dominicnunez commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Feb 28, 2026

dominicnunez commented Feb 27, 2026 •

edited

Loading