fix(agents): recover sandbox edit after post-write failure by 5Funingyuan · Pull Request #45964 · openclaw/openclaw

5Funingyuan · 2026-03-14T10:03:37Z

Summary

Problem: sandboxed edit could report a false "failed" result even when the file write had already succeeded, because sandboxed edit did not use the same post-write recovery path that host edit already had.
Why it matters: mutating tool failures are surfaced to users, so a false failure is misleading and can make a successful edit look unsafe or incomplete.
What changed: generalized wrapHostEditToolWithPostWriteRecovery(...) to accept injected path resolution and read-back behavior, then wired sandbox edit to verify through the sandbox fs bridge and return success when newText is present and oldText is no longer present after an upstream post-write throw.
What did NOT change (scope boundary): this does not change generic tool-runner error classification, does not change host edit behavior beyond keeping its existing default recovery path, and does not add any new permissions, config, or network behavior.

Change Type (select all)

Scope (select all touched areas)

Linked Issue/PR

Closes [Bug] edit tool reports 'failed' but file is actually modified #45770
Related #

User-visible / Behavior Changes

For sandboxed agent runs, a successful edit no longer emits a false failure when the upstream edit tool throws after the write has already landed (for example, during post-write diff/result formatting).

No config or default behavior changes outside this recovery path.

Security Impact (required)

New permissions/capabilities? (No)
Secrets/tokens handling changed? (No)
New/changed network calls? (No)
Command/tool execution surface changed? (No)
Data access scope changed? (No)
If any Yes, explain risk + mitigation:

Repro + Verification

Environment

OS: macOS
Runtime/container: Node 22.16.0, Vitest; sandbox path covered through a bridge-backed unit test
Model/provider: N/A
Integration/channel (if any): N/A
Relevant config (redacted): None

Steps

Create a sandboxed edit tool using a sandbox fs bridge.
Simulate an upstream post-write throw while bridge read-back shows newText present and oldText absent.
Execute the edit tool and observe whether it returns success or surfaces a failure.

Expected

If the write already landed, the edit should return success instead of surfacing a false failure.
Pre-write failures should still be reported as failures.

Actual

Before this change, sandboxed edit could surface failure because it did not use post-write recovery.
After this change, sandboxed edit recovers in the same class of post-write failure that host edit already handled.

Evidence

Attach at least one:

Failing test/log before + passing after
Trace/log snippets
Screenshot/recording
Perf numbers (if relevant)

$ volta run --node 22.16.0 pnpm test -- src/agents/pi-tools.read.host-edit-recovery.test.ts

RUN  v4.1.0 /Users/a1-6/openclaw

Test Files  1 passed (1)
Tests       4 passed (4)

greptile-apps · 2026-03-14T10:06:27Z

Greptile Summary

This PR fixes a false-failure bug in sandboxed edit: when the upstream edit library throws after successfully writing the file (e.g. during diff/result formatting), sandboxed edits now recover in the same way the host edit tool already did. The fix generalizes wrapHostEditToolWithPostWriteRecovery to accept injected resolvePath and readFile hooks, then wires the sandbox tool to verify through the SandboxFsBridge before deciding whether to recover or rethrow.

pi-tools.host-edit.ts — adds PostWriteRecoveryOptions so callers can inject path resolution and read-back behavior without changing the host default path.
pi-tools.read.ts — createSandboxedEditTool now wraps the base tool with post-write recovery using a pass-through path resolver and the bridge's readFile.
Test file adds a sandbox success test with a bridge mock, but is missing parity tests for the two rethrow conditions (newText absent; oldText still present), which leaves pre-write failure detection for sandbox unverified.
The default host readFile lambda drops the signal parameter, so an in-flight recovery read won't be aborted when a cancellation signal fires in host mode.

Confidence Score: 4/5

Safe to merge — the change is well-scoped, host behavior is unchanged, and the core recovery logic is correct.
The logic is sound: the generalization is minimal, the sandbox wiring correctly delegates path resolution and reads to the bridge, and the recovery condition (hasNew && !stillHasOld) mirrors the already-reviewed host implementation. The score is 4 rather than 5 because the default host readFile silently drops the AbortSignal, and the test suite is missing the two sandbox rethrow cases that would confirm pre-write failure detection works end-to-end for sandbox mode.
src/agents/pi-tools.host-edit.ts (signal handling in default readFile) and src/agents/pi-tools.read.host-edit-recovery.test.ts (missing sandbox rethrow tests).

Prompt To Fix All With AI

This is a comment left during a code review.
Path: src/agents/pi-tools.host-edit.ts
Line: 33-35

Comment:
**Default `readFile` silently drops `signal`**

The default `readFile` lambda captures only `filePath` and ignores the `signal` argument passed on line 68. If an `AbortSignal` is triggered while the recovery read is in flight (host mode, no custom `readFile` provided), the read will not be cancelled, potentially delaying or masking the abort.

```suggestion
  const readFile =
    options?.readFile ??
    ((filePath: string, signal?: AbortSignal) => fs.readFile(filePath, { encoding: "utf-8", signal }));
```

How can I resolve this? If you propose a fix, please make it concise.

---

This is a comment left during a code review.
Path: src/agents/pi-tools.read.host-edit-recovery.test.ts
Line: 91-123

Comment:
**Missing sandbox "should rethrow" test cases**

The host tool has three tests covering both recovery and the two rethrow conditions (newText absent, oldText still present). The new sandbox test only covers the success/recovery path. Without parity tests, a regression in the sandbox rethrow logic (e.g. a bad bridge mock returning the wrong content) would go undetected.

Consider adding at minimum:

```ts
it("rethrows for sandbox edit when bridge readFile shows newText is absent", async () => {
  const bridge: SandboxFsBridge = {
    ...
    readFile: vi.fn(async () => Buffer.from("unchanged content", "utf-8")),
  };
  const tool = createSandboxedEditTool({ root: "/workspace", bridge });
  await expect(
    tool.execute("call-1", { path: "/workspace/MEMORY.md", oldText: "x", newText: "never-written" }, undefined),
  ).rejects.toThrow("Simulated post-write failure");
});

it("rethrows for sandbox edit when bridge readFile shows oldText still present (pre-write failure)", async () => {
  const oldText = "replace me";
  const newText = "new content";
  const bridge: SandboxFsBridge = {
    ...
    readFile: vi.fn(async () => Buffer.from(`before ${oldText} after ${newText}`, "utf-8")),
  };
  const tool = createSandboxedEditTool({ root: "/workspace", bridge });
  await expect(
    tool.execute("call-1", { path: "/workspace/MEMORY.md", oldText, newText }, undefined),
  ).rejects.toThrow("Simulated post-write failure");
});
```

How can I resolve this? If you propose a fix, please make it concise.

_{Last reviewed commit: 361c450}

greptile-apps · 2026-03-14T10:06:30Z

src/agents/pi-tools.host-edit.ts

+  const readFile =
+    options?.readFile ??
+    ((filePath: string) => fs.readFile(filePath, "utf-8"));


Default readFile silently drops signal

The default readFile lambda captures only filePath and ignores the signal argument passed on line 68. If an AbortSignal is triggered while the recovery read is in flight (host mode, no custom readFile provided), the read will not be cancelled, potentially delaying or masking the abort.

Suggested change

const readFile =

options?.readFile ??

((filePath: string) => fs.readFile(filePath, "utf-8"));

const readFile =

options?.readFile ??

((filePath: string, signal?: AbortSignal) => fs.readFile(filePath, { encoding: "utf-8", signal }));

Prompt To Fix With AI

This is a comment left during a code review. Path: src/agents/pi-tools.host-edit.ts Line: 33-35 Comment: **Default `readFile` silently drops `signal`** The default `readFile` lambda captures only `filePath` and ignores the `signal` argument passed on line 68. If an `AbortSignal` is triggered while the recovery read is in flight (host mode, no custom `readFile` provided), the read will not be cancelled, potentially delaying or masking the abort. ```suggestion const readFile = options?.readFile ?? ((filePath: string, signal?: AbortSignal) => fs.readFile(filePath, { encoding: "utf-8", signal })); ``` How can I resolve this? If you propose a fix, please make it concise.

greptile-apps · 2026-03-14T10:06:31Z

src/agents/pi-tools.read.host-edit-recovery.test.ts

+  it("returns success for sandbox edit when upstream throws after writing", async () => {
+    const filePath = "/workspace/MEMORY.md";
+    const oldText = "# Memory";
+    const newText = "Blog Writing";
+    const bridge: SandboxFsBridge = {
+      resolvePath: () => ({
+        hostPath: "/tmp/MEMORY.md",
+        relativePath: "MEMORY.md",
+        containerPath: filePath,
+      }),
+      readFile: vi.fn(async () => Buffer.from(`\n\n${newText}\n`, "utf-8")),
+      writeFile: vi.fn(async () => {}),
+      mkdirp: vi.fn(async () => {}),
+      remove: vi.fn(async () => {}),
+      rename: vi.fn(async () => {}),
+      stat: vi.fn(async () => ({ type: "file", size: newText.length, mtimeMs: Date.now() })),
+    };
+
+    const tool = createSandboxedEditTool({ root: "/workspace", bridge });
+    const result = await tool.execute("call-1", { path: filePath, oldText, newText }, undefined);
+
+    expect(result).toBeDefined();
+    const content = Array.isArray((result as { content?: unknown }).content)
+      ? (result as { content: Array<{ type?: string; text?: string }> }).content
+      : [];
+    const textBlock = content.find((b) => b?.type === "text" && typeof b.text === "string");
+    expect(textBlock?.text).toContain("Successfully replaced text");
+    expect(bridge.readFile).toHaveBeenCalledWith({
+      filePath,
+      cwd: "/workspace",
+      signal: undefined,
+    });
+  });


Missing sandbox "should rethrow" test cases

The host tool has three tests covering both recovery and the two rethrow conditions (newText absent, oldText still present). The new sandbox test only covers the success/recovery path. Without parity tests, a regression in the sandbox rethrow logic (e.g. a bad bridge mock returning the wrong content) would go undetected.

Consider adding at minimum:

it("rethrows for sandbox edit when bridge readFile shows newText is absent", async () => { const bridge: SandboxFsBridge = { ... readFile: vi.fn(async () => Buffer.from("unchanged content", "utf-8")), }; const tool = createSandboxedEditTool({ root: "/workspace", bridge }); await expect( tool.execute("call-1", { path: "/workspace/MEMORY.md", oldText: "x", newText: "never-written" }, undefined), ).rejects.toThrow("Simulated post-write failure"); }); it("rethrows for sandbox edit when bridge readFile shows oldText still present (pre-write failure)", async () => { const oldText = "replace me"; const newText = "new content"; const bridge: SandboxFsBridge = { ... readFile: vi.fn(async () => Buffer.from(`before ${oldText} after ${newText}`, "utf-8")), }; const tool = createSandboxedEditTool({ root: "/workspace", bridge }); await expect( tool.execute("call-1", { path: "/workspace/MEMORY.md", oldText, newText }, undefined), ).rejects.toThrow("Simulated post-write failure"); });

Prompt To Fix With AI

This is a comment left during a code review. Path: src/agents/pi-tools.read.host-edit-recovery.test.ts Line: 91-123 Comment: **Missing sandbox "should rethrow" test cases** The host tool has three tests covering both recovery and the two rethrow conditions (newText absent, oldText still present). The new sandbox test only covers the success/recovery path. Without parity tests, a regression in the sandbox rethrow logic (e.g. a bad bridge mock returning the wrong content) would go undetected. Consider adding at minimum: ```ts it("rethrows for sandbox edit when bridge readFile shows newText is absent", async () => { const bridge: SandboxFsBridge = { ... readFile: vi.fn(async () => Buffer.from("unchanged content", "utf-8")), }; const tool = createSandboxedEditTool({ root: "/workspace", bridge }); await expect( tool.execute("call-1", { path: "/workspace/MEMORY.md", oldText: "x", newText: "never-written" }, undefined), ).rejects.toThrow("Simulated post-write failure"); }); it("rethrows for sandbox edit when bridge readFile shows oldText still present (pre-write failure)", async () => { const oldText = "replace me"; const newText = "new content"; const bridge: SandboxFsBridge = { ... readFile: vi.fn(async () => Buffer.from(`before ${oldText} after ${newText}`, "utf-8")), }; const tool = createSandboxedEditTool({ root: "/workspace", bridge }); await expect( tool.execute("call-1", { path: "/workspace/MEMORY.md", oldText, newText }, undefined), ).rejects.toThrow("Simulated post-write failure"); }); ``` How can I resolve this? If you propose a fix, please make it concise.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: f528009bfd

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-03-14T10:27:44Z

src/agents/pi-tools.read.ts

    operations: createSandboxEditOperations(params),
  }) as unknown as AnyAgentTool;
-  return wrapToolParamNormalization(base, CLAUDE_PARAM_GROUPS.edit);
+  const withRecovery = wrapHostEditToolWithPostWriteRecovery(base, params.root, {


Avoid marking sandbox edits successful on pre-write failures

This new sandbox wiring unconditionally applies wrapHostEditToolWithPostWriteRecovery, whose heuristic returns success whenever the file currently contains newText and not oldText; that also matches pre-write failures like "oldText not found" when the file already had the target text before this call. In that case the edit did not actually apply any replacement, but callers now get a success result in sandbox mode, which can hide real edit failures and lead the agent to continue with incorrect assumptions.

Useful? React with 👍 / 👎.

5Funingyuan · 2026-03-14T10:52:10Z

I took another pass through the latest failing checks after fixing the sandbox edit recovery test typing issue.

At this point, the remaining failures appear to be in unrelated extension and Windows test shards rather than in the sandbox edit recovery path changed in this PR. The current failures are showing up in areas like extensions/telegram, extensions/discord, extensions/whatsapp, extensions/slack, extensions/signal, and src/infra/heartbeat-runner.*, not in the src/agents/pi-tools.* changes here.

The targeted sandbox recovery test for this PR passes locally, and the PR-specific typing issue in src/agents/pi-tools.read.host-edit-recovery.test.ts has already been fixed in a follow-up commit.

fix(agents): recover sandbox edit after post-write failure

361c450

openclaw-barnacle bot added agents Agent runtime and tooling size: S labels Mar 14, 2026

greptile-apps bot reviewed Mar 14, 2026

View reviewed changes

5Funingyuan added 2 commits March 14, 2026 18:12

style(agents): format sandbox edit recovery changes

4d077ef

test(agents): tighten sandbox fs stat mock type

f528009

chatgpt-codex-connector bot reviewed Mar 14, 2026

View reviewed changes

5Funingyuan mentioned this pull request Mar 14, 2026

[Bug] edit tool reports 'failed' but file is actually modified #45770

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(agents): recover sandbox edit after post-write failure#45964

fix(agents): recover sandbox edit after post-write failure#45964
5Funingyuan wants to merge 3 commits intoopenclaw:mainfrom
5Funingyuan:fix-sandbox-edit-post-write-recovery

5Funingyuan commented Mar 14, 2026

Uh oh!

greptile-apps bot commented Mar 14, 2026

Uh oh!

greptile-apps bot Mar 14, 2026

Uh oh!

greptile-apps bot Mar 14, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Mar 14, 2026

Uh oh!

5Funingyuan commented Mar 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

5Funingyuan commented Mar 14, 2026

Summary

Change Type (select all)

Scope (select all touched areas)

Linked Issue/PR

User-visible / Behavior Changes

Security Impact (required)

Repro + Verification

Environment

Steps

Expected

Actual

Evidence

Uh oh!

greptile-apps bot commented Mar 14, 2026

Greptile Summary

Confidence Score: 4/5

Uh oh!

greptile-apps bot Mar 14, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Mar 14, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 14, 2026

Choose a reason for hiding this comment

Uh oh!

5Funingyuan commented Mar 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant