Skip to content

fix(cron): consolidate announce delivery, fire-and-forget trigger, and minimal prompt mode#40204

Merged
tyler6204 merged 2 commits intomainfrom
fix/cron-announce-and-trigger
Mar 8, 2026
Merged

fix(cron): consolidate announce delivery, fire-and-forget trigger, and minimal prompt mode#40204
tyler6204 merged 2 commits intomainfrom
fix/cron-announce-and-trigger

Conversation

@tyler6204
Copy link
Copy Markdown
Member

@tyler6204 tyler6204 commented Mar 8, 2026

Summary

  • keep subagent announces spawned from isolated cron runs internal so cron announce delivery stays the single external delivery path
  • consolidate descendant fallback output into the cron announce payload when the parent reply is still an interim placeholder
  • add a detached manual run path for gateway cron.run so the API returns immediately after enqueueing background work
  • use MINIMAL prompt mode for cron isolated agent turn sessions (they were wastefully loading the full chat prompt with SOUL.md, TOOLS.md, etc. when they only need tool access and a focused task instruction)

Testing

  • pnpm exec vitest run src/agents/pi-embedded-runner/run/attempt.test.ts src/agents/subagent-announce.timeout.test.ts src/cron/isolated-agent/delivery-dispatch.double-announce.test.ts src/gateway/server.cron.test.ts
  • pnpm test
    • unrelated existing failures reproduced locally in src/media/store.redirect.test.ts and src/process/supervisor/adapters/child.test.ts

Closes #40178
Closes #40192

@aisle-research-bot
Copy link
Copy Markdown

aisle-research-bot bot commented Mar 8, 2026

🔒 Aisle Security Analysis

We found 2 potential security issue(s) in this PR:

# Severity Title
1 🟠 High Unbounded in-memory cron command queue enables resource exhaustion via repeated cron.run enqueues
2 🟡 Medium Spoofable cron session keys treated as internal suppress subagent announcement delivery

1. 🟠 Unbounded in-memory cron command queue enables resource exhaustion via repeated cron.run enqueues

Property Value
Severity High
CWE CWE-400
Location src/cron/service/ops.ts:527-562

Description

The new cron.run behavior enqueues a background task (context.cron.enqueueRun) rather than executing immediately. The enqueue path has no queue length cap, backpressure, or per-caller throttling, which allows an authorized caller to rapidly create an arbitrarily large in-process backlog.

Key observations:

  • enqueueRun() always schedules a new queued task when the job is runnable, and returns success immediately.
  • The underlying queue implementation (enqueueCommandInLane) stores each task closure and Promise capability in memory (state.queue.push(...)) with no maximum size, so repeated requests can grow memory usage until OOM.
  • Because inspectManualRunDisposition() runs before the queued task sets runningAtMs, a burst of requests can enqueue many mode:"force" runs before the first queued run marks the job as running. With forced mode, each queued task will then execute sequentially, sustaining CPU/network load.
  • Log amplification: for each queued item that waits >5s, onWait emits a warning, enabling log flooding proportional to the queue size.

Vulnerable code (enqueue without bounds):

// src/cron/service/ops.ts
void enqueueCommandInLane(
  CommandLane.Cron,
  async () => run(state, id, mode),
  { warnAfterMs: 5_000, onWait: (...) => log.warn(...) },
);
return { ok: true, enqueued: true, runId };

Underlying queue has no size limit:

// src/process/command-queue.ts
state.queue.push({ task, resolve, reject, ... });

Impact:

  • Remote denial of service of the gateway process via memory exhaustion and/or sustained background job execution, by spamming cron.run (admin-scoped method).
  • Operational impact from excessive warning/error logs when the queue grows.

Recommendation

Implement backpressure and abuse controls for queued manual runs:

  1. Cap the queue depth for CommandLane.Cron (and/or globally) and fail fast when exceeded.
  2. Rate limit cron.run per authenticated client/device and/or per job ID.
  3. Coalesce/dedupe queued manual runs per job (e.g., allow at most one pending manual run per job unless explicitly requested).
  4. Throttle onWait logging (sample logs or log once per job/runId).

Example (queue cap in enqueueCommandInLane):

// src/process/command-queue.ts
const MAX_QUEUE_DEPTH_BY_LANE: Record<string, number> = {
  [CommandLane.Cron]: 100,
  [CommandLane.Main]: 500,
};

export function enqueueCommandInLane<T>(lane: string, task: () => Promise<T>, opts?: ...) {
  ...
  const maxDepth = MAX_QUEUE_DEPTH_BY_LANE[cleaned] ?? 500;
  const depth = state.queue.length + state.activeTaskIds.size;
  if (depth >= maxDepth) {
    return Promise.reject(new Error(`lane queue full: lane=${cleaned} depth=${depth}`));
  }
  ...
}

And in enqueueRun, consider rejecting or coalescing when there is already a pending manual run for that jobId (store a per-job pending flag under locked(...)).


2. 🟡 Spoofable cron session keys treated as internal suppress subagent announcement delivery

Property Value
Severity Medium
CWE CWE-807
Location src/agents/subagent-announce.ts:82-84

Description

The subagent announce flow now treats any session key that looks like a cron session as an internal requester:

  • isInternalAnnounceRequesterSession() returns true when isCronSessionKey(sessionKey) is true.
  • isCronSessionKey() is purely string/pattern based (any agent:<id>:cron:<...>), and does not verify the session was created by the cron subsystem.
  • Gateway RPC callers can supply arbitrary sessionKey values in the agent method (validated only for malformed agent: prefix), so a caller can spoof a cron-looking key.
  • When the requester is considered internal, subagent completion announcements are sent with deliver: false and with channel/to/accountId omitted, suppressing user-visible delivery and changing routing behavior.

Security impact if an untrusted/less-trusted client can call gateway.agent:

  • Delivery suppression / availability: attacker can prevent subagent completion messages from being delivered externally by choosing a cron-looking sessionKey.
  • Routing integrity: announcement routing decisions are based on an untrusted string rather than authenticated/authorized cron execution context.

Vulnerable logic (classification):

function isInternalAnnounceRequesterSession(sessionKey: string | undefined): boolean {
  return getSubagentDepthFromSessionStore(sessionKey) >= 1 || isCronSessionKey(sessionKey);
}

Recommendation

Do not rely on string parsing of a client-controllable session key to decide whether a requester is “internal”. Instead, base this on authenticated server-side state.

Recommended approaches:

  1. Mark cron sessions in the session store with non-spoofable metadata (set only by the cron service), then check that metadata:
// When creating cron sessions (cron subsystem only)
await sessions.patch({
  key: cronSessionKey,
  origin: { kind: "cron", jobId },
});// When deciding internal-ness
function isInternalAnnounceRequesterSession(sessionKey?: string): boolean {
  if (getSubagentDepthFromSessionStore(sessionKey) >= 1) return true;
  const entry = sessionKey ? loadSessionEntryByKey(sessionKey) : undefined;
  return entry?.origin?.kind === "cron";
}
  1. Gate cron-shaped session keys at the gateway boundary: reject or ignore agent:<id>:cron:* session keys from non-admin clients (or from any client except the cron subsystem), e.g. require senderIsOwner / ADMIN scope.

  2. If the intent is to cover only ephemeral cron run sessions, consider using the stricter isCronRunSessionKey() and still pair it with server-side verification.

Any of these ensures cron-only routing behavior cannot be triggered by an untrusted caller choosing a special-looking sessionKey.


Analyzed PR: #40204 at commit 219276e

Last updated on: 2026-03-08T22:44:48Z

@openclaw-barnacle openclaw-barnacle bot added gateway Gateway runtime agents Agent runtime and tooling size: S maintainer Maintainer-authored PR labels Mar 8, 2026
@greptile-apps
Copy link
Copy Markdown
Contributor

greptile-apps bot commented Mar 8, 2026

Greptile Summary

This PR makes three coordinated improvements to the cron delivery pipeline: it prevents subagent announces from bypassing the cron delivery path when the requester is a cron run session; it consolidates descendant fallback output into the cron announce payload when the cron agent's own reply is still an interim placeholder; and it detaches cron.run gateway calls so the API responds immediately after enqueueing background work.

  • subagent-announce.ts: Adds isInternalAnnounceRequesterSession helper that extends the existing requesterDepth >= 1 subagent guard to also cover cron session keys via isCronSessionKey. The closure captures the let variables requesterDepth and targetRequesterSessionKey by reference, so it correctly re-evaluates after the fallback-traversal mutation at lines 1368–1372.
  • ops.ts: Extracts prepareManualRun / finishPreparedManualRun from the monolithic run function and adds enqueueRun, which fires finishPreparedManualRun as a detached promise with a logged .catch. The typed PreparedManualRun discriminated union removes the now-dead if (!prepared.executionJob || ...) defensive check. Stale runningAtMs markers from mid-flight termination are handled by the existing startup cleanup.
  • server-methods/cron.ts: Switches cron.run handler to enqueueRun; response shape is unchanged so existing callers are unaffected.
  • Test coverage: Three new test cases cover the cron-session-as-requester internal delivery, descendant output consolidation, and immediate gateway return respectively.

Confidence Score: 4/5

  • Safe to merge; the refactoring is well-scoped and the fire-and-forget pattern is guarded with proper error handling and existing startup cleanup.
  • The logic changes are small and well-tested. The only non-trivial concern is that the new regression test for cron-session-as-requester relies on subagentSessionRunActive = true (the test-harness default), which may not reflect production behavior if cron sessions are not registered in the subagent run registry — the test does not cover the isSubagentSessionRunActive = false path for cron sessions. Whether that edge case can actually occur depends on whether isolated-agent cron runs create usable session entries, which looks likely from the code structure but is not verified in the test.
  • src/agents/subagent-announce.timeout.test.ts — the regression test at line 200 should ideally also cover the subagentSessionRunActive = false path for cron sessions to ensure no silent announce failure in that edge case.

Comments Outside Diff (1)

  1. src/agents/subagent-announce.timeout.test.ts, line 200-222 (link)

    Regression test relies on subagentSessionRunActive = true default, which may not reflect production behavior for cron sessions

    The test at line 200 validates that when requesterSessionKey is a cron session key, the announce is kept internal (deliver=false). However, the test relies on the subagentSessionRunActive = true default set in beforeEach (line 128).

    In the runSubagentAnnounceFlow logic (around line 1355), when requesterIsInternalSession() returns true for a cron session key, it enters the block and calls isSubagentSessionRunActive(cronSessionKey). In production, this may return false for cron sessions since they might not be registered as "active subagent runs" in the subagent run registry.

    If isSubagentSessionRunActive returns false, the code falls into:

    const parentSessionEntry = loadSessionEntryByKey(targetRequesterSessionKey);
    const parentSessionAlive = hasUsableSessionEntry(parentSessionEntry);
    if (!parentSessionAlive) {
      const fallback = resolveRequesterForChildSession(targetRequesterSessionKey);
      if (!fallback?.requesterSessionKey) {
        shouldDeleteChildSession = false;
        return false; // silently fails
      }
    }

    If isolated-agent cron runs do create usable session entries accessible via loadSessionEntryByKey, this path is fine and the announce would still be delivered internally. However, it would be worth confirming this is the case, and potentially adding a variant of this test where subagentSessionRunActive = false to verify the fallback path also keeps the announce internal (or at least doesn't silently drop it).

Last reviewed commit: 48b03aa

@tyler6204 tyler6204 force-pushed the fix/cron-announce-and-trigger branch from 48b03aa to ce0e11d Compare March 8, 2026 20:45
@tyler6204 tyler6204 changed the title fix(cron): consolidate announce delivery and make manual trigger fire-and-forget fix(cron): consolidate announce delivery, fire-and-forget trigger, and minimal prompt mode Mar 8, 2026
@tyler6204 tyler6204 self-assigned this Mar 8, 2026
Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: ce0e11d289

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Comment on lines +1223 to +1224
const requesterIsInternalSession = () =>
requesterDepth >= 1 || isCronSessionKey(targetRequesterSessionKey);
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P1 Badge Preserve external notify path for late cron descendants

Treating every cron requester session as internal here means descendant completion announces from cron runs now go through requesterIsSubagent=true, which drives sendSubagentAnnounceDirectly to call agent with deliver=false (internal only). In the active-descendant timeout path, dispatchCronDelivery already returns early without sending (src/cron/isolated-agent/delivery-dispatch.ts:319-330), so those later descendant announces are the only remaining path to user-visible completion; after this change they are suppressed externally, and long-running cron jobs can finish without any final user update.

Useful? React with 👍 / 👎.

tyler6204 added a commit that referenced this pull request Mar 8, 2026
@tyler6204 tyler6204 force-pushed the fix/cron-announce-and-trigger branch from ce0e11d to 1d61df4 Compare March 8, 2026 21:24
@openclaw-barnacle openclaw-barnacle bot added docs Improvements or additions to documentation cli CLI command changes size: L and removed size: S labels Mar 8, 2026
Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 1d61df4809

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

"cron: queued manual run background execution failed",
);
});
return { ok: true, enqueued: true, runId } as const;
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P2 Badge Return enqueue failures during gateway drain

This path always returns { ok: true, enqueued: true, runId } even when the queue rejects the task immediately. During restart, markGatewayDraining() causes enqueueCommandInLane(...) to reject with GatewayDrainingError, and this code only logs that rejection in the detached .catch, so cron.run reports success while the job is dropped and never executed. Manual runs submitted in that drain window therefore get silently lost from the caller’s perspective.

Useful? React with 👍 / 👎.

@tyler6204 tyler6204 force-pushed the fix/cron-announce-and-trigger branch from 1d61df4 to 219276e Compare March 8, 2026 21:46
@tyler6204 tyler6204 merged commit 38543d8 into main Mar 8, 2026
10 checks passed
@tyler6204 tyler6204 deleted the fix/cron-announce-and-trigger branch March 8, 2026 21:46
Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 219276e940

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Comment on lines +534 to +538
void enqueueCommandInLane(
CommandLane.Cron,
async () => {
const result = await run(state, id, mode);
if (result.ok && "ran" in result && !result.ran) {
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P2 Badge Rebind queued cron.run tasks to the current cron service state

enqueueRun captures the current state in a fire-and-forget lane task, so queued manual runs can execute against an old CronServiceState after a hot reload replaces the cron service (src/gateway/server-reload-handlers.ts restarts cron but does not clear CommandLane.Cron). In that window, a request accepted before reload can run with stale config (for example, cronEnabled/store settings from the old state), which can produce runs after operators thought cron had been reconfigured or disabled.

Useful? React with 👍 / 👎.

GordonSH-oss pushed a commit to GordonSH-oss/openclaw that referenced this pull request Mar 9, 2026
…d minimal prompt mode (openclaw#40204)

* fix(cron): consolidate announce delivery and detach manual runs

* fix: queue detached cron runs (openclaw#40204)
jenawant pushed a commit to jenawant/openclaw that referenced this pull request Mar 10, 2026
…d minimal prompt mode (openclaw#40204)

* fix(cron): consolidate announce delivery and detach manual runs

* fix: queue detached cron runs (openclaw#40204)
Hua688 pushed a commit to Hua688/openclaw that referenced this pull request Mar 10, 2026
…w#41266)

Manual cron.run dispatches enqueue onto CommandLane.Cron (concurrency=1).
Isolated job execution internally re-enters the same cron lane via
runEmbeddedPiAgent, causing a classic deadlock: the outer task holds
the single slot while the inner enqueue waits for a free slot.

Move the outer enqueueRun dispatch to CommandLane.Nested so it does not
compete with the inner cron-lane work.  Scheduled runs are unaffected
(they call executeJobCoreWithTimeout directly without enqueueRun).

Regression introduced in 38543d8 (fix(cron): consolidate announce
delivery, fire-and-forget trigger, and minimal prompt mode openclaw#40204).

Co-authored-by: Copilot <[email protected]>
Hua688 pushed a commit to Hua688/openclaw that referenced this pull request Mar 10, 2026
…w#41266)

Manual cron.run dispatches enqueue onto CommandLane.Cron (concurrency=1).
Isolated job execution internally re-enters the same cron lane via
runEmbeddedPiAgent, causing a classic deadlock: the outer task holds
the single slot while the inner enqueue waits for a free slot.

Move the outer enqueueRun dispatch to CommandLane.Nested so it does not
compete with the inner cron-lane work.  Scheduled runs are unaffected
(they call executeJobCoreWithTimeout directly without enqueueRun).

Regression introduced in 38543d8 (fix(cron): consolidate announce
delivery, fire-and-forget trigger, and minimal prompt mode openclaw#40204).

Co-authored-by: Copilot <[email protected]>
sauerdaniel pushed a commit to sauerdaniel/openclaw that referenced this pull request Mar 11, 2026
…d minimal prompt mode (openclaw#40204)

* fix(cron): consolidate announce delivery and detach manual runs

* fix: queue detached cron runs (openclaw#40204)
Moshiii pushed a commit to Moshiii/openclaw that referenced this pull request Mar 11, 2026
…d minimal prompt mode (openclaw#40204)

* fix(cron): consolidate announce delivery and detach manual runs

* fix: queue detached cron runs (openclaw#40204)
Moshiii pushed a commit to Moshiii/openclaw that referenced this pull request Mar 11, 2026
…d minimal prompt mode (openclaw#40204)

* fix(cron): consolidate announce delivery and detach manual runs

* fix: queue detached cron runs (openclaw#40204)
Hua688 pushed a commit to Hua688/openclaw that referenced this pull request Mar 11, 2026
…w#41266)

Manual cron.run dispatches enqueue onto CommandLane.Cron (concurrency=1).
Isolated job execution internally re-enters the same cron lane via
runEmbeddedPiAgent, causing a classic deadlock: the outer task holds
the single slot while the inner enqueue waits for a free slot.

Move the outer enqueueRun dispatch to CommandLane.Nested so it does not
compete with the inner cron-lane work.  Scheduled runs are unaffected
(they call executeJobCoreWithTimeout directly without enqueueRun).

Regression introduced in 38543d8 (fix(cron): consolidate announce
delivery, fire-and-forget trigger, and minimal prompt mode openclaw#40204).

Co-authored-by: Copilot <[email protected]>
dhoman pushed a commit to dhoman/chrono-claw that referenced this pull request Mar 11, 2026
…d minimal prompt mode (openclaw#40204)

* fix(cron): consolidate announce delivery and detach manual runs

* fix: queue detached cron runs (openclaw#40204)
Hua688 pushed a commit to Hua688/openclaw that referenced this pull request Mar 12, 2026
…w#41266)

Manual cron.run dispatches enqueue onto CommandLane.Cron (concurrency=1).
Isolated job execution internally re-enters the same cron lane via
runEmbeddedPiAgent, causing a classic deadlock: the outer task holds
the single slot while the inner enqueue waits for a free slot.

Move the outer enqueueRun dispatch to CommandLane.Nested so it does not
compete with the inner cron-lane work.  Scheduled runs are unaffected
(they call executeJobCoreWithTimeout directly without enqueueRun).

Regression introduced in 38543d8 (fix(cron): consolidate announce
delivery, fire-and-forget trigger, and minimal prompt mode openclaw#40204).

Co-authored-by: Copilot <[email protected]>
Hua688 pushed a commit to Hua688/openclaw that referenced this pull request Mar 13, 2026
…w#41266)

Manual cron.run dispatches enqueue onto CommandLane.Cron (concurrency=1).
Isolated job execution internally re-enters the same cron lane via
runEmbeddedPiAgent, causing a classic deadlock: the outer task holds
the single slot while the inner enqueue waits for a free slot.

Move the outer enqueueRun dispatch to CommandLane.Nested so it does not
compete with the inner cron-lane work.  Scheduled runs are unaffected
(they call executeJobCoreWithTimeout directly without enqueueRun).

Regression introduced in 38543d8 (fix(cron): consolidate announce
delivery, fire-and-forget trigger, and minimal prompt mode openclaw#40204).

Co-authored-by: Copilot <[email protected]>
Taskle pushed a commit to Taskle/openclaw that referenced this pull request Mar 14, 2026
…d minimal prompt mode (openclaw#40204)

* fix(cron): consolidate announce delivery and detach manual runs

* fix: queue detached cron runs (openclaw#40204)
Hua688 pushed a commit to Hua688/openclaw that referenced this pull request Mar 15, 2026
…w#41266)

Manual cron.run dispatches enqueue onto CommandLane.Cron (concurrency=1).
Isolated job execution internally re-enters the same cron lane via
runEmbeddedPiAgent, causing a classic deadlock: the outer task holds
the single slot while the inner enqueue waits for a free slot.

Move the outer enqueueRun dispatch to CommandLane.Nested so it does not
compete with the inner cron-lane work.  Scheduled runs are unaffected
(they call executeJobCoreWithTimeout directly without enqueueRun).

Regression introduced in 38543d8 (fix(cron): consolidate announce
delivery, fire-and-forget trigger, and minimal prompt mode openclaw#40204).

Co-authored-by: Copilot <[email protected]>
Hua688 pushed a commit to Hua688/openclaw that referenced this pull request Mar 15, 2026
…w#41266)

Manual cron.run dispatches enqueue onto CommandLane.Cron (concurrency=1).
Isolated job execution internally re-enters the same cron lane via
runEmbeddedPiAgent, causing a classic deadlock: the outer task holds
the single slot while the inner enqueue waits for a free slot.

Move the outer enqueueRun dispatch to CommandLane.Nested so it does not
compete with the inner cron-lane work.  Scheduled runs are unaffected
(they call executeJobCoreWithTimeout directly without enqueueRun).

Regression introduced in 38543d8 (fix(cron): consolidate announce
delivery, fire-and-forget trigger, and minimal prompt mode openclaw#40204).

Co-authored-by: Copilot <[email protected]>
Hua688 pushed a commit to Hua688/openclaw that referenced this pull request Mar 15, 2026
…w#41266)

Manual cron.run dispatches enqueue onto CommandLane.Cron (concurrency=1).
Isolated job execution internally re-enters the same cron lane via
runEmbeddedPiAgent, causing a classic deadlock: the outer task holds
the single slot while the inner enqueue waits for a free slot.

Move the outer enqueueRun dispatch to CommandLane.Nested so it does not
compete with the inner cron-lane work.  Scheduled runs are unaffected
(they call executeJobCoreWithTimeout directly without enqueueRun).

Regression introduced in 38543d8 (fix(cron): consolidate announce
delivery, fire-and-forget trigger, and minimal prompt mode openclaw#40204).

Co-authored-by: Copilot <[email protected]>
senw-developers pushed a commit to senw-developers/va-openclaw that referenced this pull request Mar 17, 2026
…d minimal prompt mode (openclaw#40204)

* fix(cron): consolidate announce delivery and detach manual runs

* fix: queue detached cron runs (openclaw#40204)
V-Gutierrez pushed a commit to V-Gutierrez/openclaw-vendor that referenced this pull request Mar 17, 2026
…d minimal prompt mode (openclaw#40204)

* fix(cron): consolidate announce delivery and detach manual runs

* fix: queue detached cron runs (openclaw#40204)
Hua688 pushed a commit to Hua688/openclaw that referenced this pull request Mar 18, 2026
…w#41266)

Manual cron.run dispatches enqueue onto CommandLane.Cron (concurrency=1).
Isolated job execution internally re-enters the same cron lane via
runEmbeddedPiAgent, causing a classic deadlock: the outer task holds
the single slot while the inner enqueue waits for a free slot.

Move the outer enqueueRun dispatch to CommandLane.Nested so it does not
compete with the inner cron-lane work.  Scheduled runs are unaffected
(they call executeJobCoreWithTimeout directly without enqueueRun).

Regression introduced in 38543d8 (fix(cron): consolidate announce
delivery, fire-and-forget trigger, and minimal prompt mode openclaw#40204).

Co-authored-by: Copilot <[email protected]>
Hua688 pushed a commit to Hua688/openclaw that referenced this pull request Mar 21, 2026
…w#41266)

Manual cron.run dispatches enqueue onto CommandLane.Cron (concurrency=1).
Isolated job execution internally re-enters the same cron lane via
runEmbeddedPiAgent, causing a classic deadlock: the outer task holds
the single slot while the inner enqueue waits for a free slot.

Move the outer enqueueRun dispatch to CommandLane.Nested so it does not
compete with the inner cron-lane work.  Scheduled runs are unaffected
(they call executeJobCoreWithTimeout directly without enqueueRun).

Regression introduced in 38543d8 (fix(cron): consolidate announce
delivery, fire-and-forget trigger, and minimal prompt mode openclaw#40204).

Co-authored-by: Copilot <[email protected]>
Hua688 pushed a commit to Hua688/openclaw that referenced this pull request Mar 21, 2026
…w#41266)

Manual cron.run dispatches enqueue onto CommandLane.Cron (concurrency=1).
Isolated job execution internally re-enters the same cron lane via
runEmbeddedPiAgent, causing a classic deadlock: the outer task holds
the single slot while the inner enqueue waits for a free slot.

Move the outer enqueueRun dispatch to CommandLane.Nested so it does not
compete with the inner cron-lane work.  Scheduled runs are unaffected
(they call executeJobCoreWithTimeout directly without enqueueRun).

Regression introduced in 38543d8 (fix(cron): consolidate announce
delivery, fire-and-forget trigger, and minimal prompt mode openclaw#40204).

Co-authored-by: Copilot <[email protected]>
Hua688 pushed a commit to Hua688/openclaw that referenced this pull request Mar 23, 2026
…w#41266)

Manual cron.run dispatches enqueue onto CommandLane.Cron (concurrency=1).
Isolated job execution internally re-enters the same cron lane via
runEmbeddedPiAgent, causing a classic deadlock: the outer task holds
the single slot while the inner enqueue waits for a free slot.

Move the outer enqueueRun dispatch to CommandLane.Nested so it does not
compete with the inner cron-lane work.  Scheduled runs are unaffected
(they call executeJobCoreWithTimeout directly without enqueueRun).

Regression introduced in 38543d8 (fix(cron): consolidate announce
delivery, fire-and-forget trigger, and minimal prompt mode openclaw#40204).

Co-authored-by: Copilot <[email protected]>
Hua688 pushed a commit to Hua688/openclaw that referenced this pull request Mar 24, 2026
…w#41266)

Manual cron.run dispatches enqueue onto CommandLane.Cron (concurrency=1).
Isolated job execution internally re-enters the same cron lane via
runEmbeddedPiAgent, causing a classic deadlock: the outer task holds
the single slot while the inner enqueue waits for a free slot.

Move the outer enqueueRun dispatch to CommandLane.Nested so it does not
compete with the inner cron-lane work.  Scheduled runs are unaffected
(they call executeJobCoreWithTimeout directly without enqueueRun).

Regression introduced in 38543d8 (fix(cron): consolidate announce
delivery, fire-and-forget trigger, and minimal prompt mode openclaw#40204).

Co-authored-by: Copilot <[email protected]>
0x666c6f added a commit to 0x666c6f/openclaw that referenced this pull request Mar 26, 2026
* refactor: share Apple talk config parsing

* refactor: add canonical talk config payload

* refactor: centralize talk silence timeout defaults

* test: cover invalid talk config inputs

* test: decouple ios talk parsing coverage

* fix: resolve live config paths in status and gateway metadata (openclaw#39952)

* fix: resolve live config paths in status and gateway metadata

* fix: resolve remaining runtime config path references

* test: cover gateway config.set config path response

* fix(web-search): restore OpenRouter compatibility for Perplexity (openclaw#39937) (openclaw#39937)

* Zalo: fix provider lifecycle restarts (openclaw#39892)

* Zalo: fix provider lifecycle restarts

* Zalo: add typing indicators, smart webhook cleanup, and API type fixes

* fix review

* add allow list test secrect

* Zalo: bound webhook cleanup during shutdown

* Zalo: bound typing chat action timeout

* Zalo: use plugin-safe abort helper import

* fix(plugins): ship Feishu bundled runtime dependency (openclaw#39990)

* fix: ship feishu bundled runtime dependency

* test: align feishu bundled dependency specs

* fix(hooks): use resolveAgentIdFromSessionKey in runBeforeReset  (openclaw#39875)

Merged via squash.

Prepared head SHA: 00a2b24
Co-authored-by: rbutera <[email protected]>
Co-authored-by: altaywtf <[email protected]>
Reviewed-by: @altaywtf

* CLI: include commit hash in --version output (openclaw#39712)

* CLI: include commit hash in --version output

* fix(version): harden commit SHA resolution and keep output consistent

* CLI: keep install checks compatible with commit-tagged version output

* fix(cli): include commit hash in root version fast path

* test(cli): allow null commit-hash mocks

* Installer: share version parser across install scripts

* Installer: avoid sourcing helpers from stdin cwd

* CLI: note commit-tagged version output

* CLI: anchor commit hash resolution to module root

* CLI: harden commit hash resolution

* CLI: fix commit hash lookup edge cases

* CLI: prefer live git metadata in dev builds

* CLI: keep git lookup inside package root

* Infra: tolerate invalid moduleUrl hints

* CLI: cache baked commit metadata fallbacks

* CLI: align changelog attribution with prep gate

* CLI: restore changelog contributor credit

---------

Co-authored-by: echoVic <[email protected]>
Co-authored-by: echoVic <[email protected]>

* fix: fail closed talk provider selection

* fix: align talk config secret schemas

* refactor: require canonical talk resolved payload

* test: add talk config contract fixtures

* refactor: split talk gateway config loaders

* refactor: avoid checkout during prep head verification

* refactor: dedupe prep branch push flow

* fix: treat model api drift as baseUrl refresh

* refactor: extract pure models config merge helpers

* refactor: expand provider capability registry

* refactor: reuse one models.json read per write

* fix: publish models.json atomically

* refactor: scope prep push results to env artifacts

* refactor: fold implicit provider injection into resolver

* fix(telegram): use message previews in DMs

* CI: scope CodeQL JavaScript analysis

* feat: allow compaction model override via config (openclaw#38753)

Merged via squash.

Prepared head SHA: a3d6d6c
Co-authored-by: starbuck100 <[email protected]>
Co-authored-by: jalehman <[email protected]>
Reviewed-by: @jalehman

* fix(sessions): clear stale contextTokens on model switch (openclaw#38044)

Merged via squash.

Prepared head SHA: bac2df4
Co-authored-by: yuweuii <[email protected]>
Co-authored-by: jalehman <[email protected]>
Reviewed-by: @jalehman

* fix: prefer bundled channel plugins over npm duplicates (openclaw#40094)

* fix: prefer bundled channel plugins over npm duplicates

* fix: tighten bundled plugin review follow-ups

* fix: address check gate follow-ups

* docs: add changelog for bundled plugin install fix

* fix: align lifecycle test formatting with CI oxfmt

* docs: update Brave Search API docs for Feb 2026 plan restructuring (openclaw#40111)

Merged via squash.

Prepared head SHA: c651f07
Co-authored-by: remusao <[email protected]>
Co-authored-by: gumadeiras <[email protected]>
Reviewed-by: @gumadeiras

* Add too-many-prs override label handling

* CI: satisfy provider merge fixture typing

* Tests: reduce web search secret-scan noise

* Web search: rename Perplexity auth source helper

* Docs: use placeholder OpenRouter key in Perplexity guide

* Docs: use placeholder OpenRouter key in web tool docs

* Fixtures: normalize talk config API key placeholder

* Tests: lower entropy git commit fixtures

* Chore: widen xxxxx detect-secrets allowlist

* Chore: refresh detect-secrets baseline

* Web search: allowlist Perplexity auth source type name

* Chore: refresh detect-secrets baseline after docs line changes

* Chore: refresh detect-secrets baseline after final scan

* Chore: refresh detect-secrets baseline for Feishu docs

* CLI: set local gateway mode in setup

* Tests: format daemon lifecycle CLI coverage

* refactor: use model compat for anthropic tool payload normalization

* refactor: move bundled extension gap allowlists into manifests

* refactor: thread config runtime env through models config

* refactor: split models registry loading from persistence

* refactor: centralize transcript provider quirks

* refactor: decompose implicit provider resolution

* refactor: extract openai stream wrappers

* refactor: validate bundled extension release metadata

* refactor: extract static provider builders

* refactor: extract provider stream wrappers

* refactor: extract bundled extension manifest parser

* test: standardize hermetic provider env snapshots

* test: add implicit provider matrix coverage

* fix(acp): persist spawned child session history (openclaw#40137)

Merged via squash.

Prepared head SHA: 62de5d5
Co-authored-by: mbelinky <[email protected]>
Co-authored-by: mbelinky <[email protected]>
Reviewed-by: @mbelinky

* fix: require talk resolved payload

* refactor: dedupe android talk config parsing

* test: expand talk config contract fixtures

* refactor: split android talk voice resolution

* test: isolate plugin loader from mocked module cache

* test: isolate legacy plugin-sdk root import check

* test: isolate git commit resolution fallbacks

* refactor: simplify plugin sdk compatibility aliases

* refactor: centralize acp session resolution guards

* refactor: neutralize context engine runtime bridge

* refactor: split doctor config analysis helpers

* refactor: extract ios watch reply coordinator

* fix: restore acp session meta narrowing

* refactor: extract qmd process runner

* fix: restore gate after rebase

* docs: add Browserbase as hosted remote CDP option

Add Browserbase documentation section alongside the existing Browserless
section in the browser docs. Includes signup instructions, CDP connection
configuration, and environment variable setup for both English and Chinese
(zh-CN) translations.

Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>

* Revert "docs: add Browserbase as hosted remote CDP option"

This reverts commit c469657.

* docs: add Browserbase as hosted remote CDP option

Add Browserbase documentation section alongside the existing Browserless
section in the browser docs. Includes signup instructions, CDP connection
configuration, and environment variable setup.

Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>

* docs: fix duplicate heading lint error

Rename "Configuration" sub-heading to "Profile setup" to avoid
MD024/no-duplicate-heading conflict with the existing top-level
"Configuration" heading.

Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>

* docs: fix Browserbase section to match official docs

Browserbase requires creating a session via their API to get a CDP
connect URL, unlike Browserless which uses a static endpoint. Updated
to show the correct curl-based session creation flow, removed
unverified static WebSocket URL, and added the 5-minute connect
timeout note from official docs.

Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>

* docs: restore direct wss://connect.browserbase.com URL

Browserbase exposes a direct WebSocket connect endpoint that
auto-creates a session, similar to how Browserless works. Simplified
the section to use this static URL pattern instead of requiring
manual session creation via the API.

Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>

* docs: fact-check Browserbase section against official docs

- Fix CAPTCHA/stealth/proxy claims: these are Developer plan+ only,
  not available on free tier
- Fix free tier limits: 1 browser hour, 15-min session duration
  (not "60 minutes of monthly usage")
- Add link to pricing page for paid plan details
- Simplify structure to match Browserless section format
- Remove sub-headings to match Browserless section style

Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>

* docs: simplify Browserbase section, drop pricing details

Restore platform-level feature description (CAPTCHA solving, stealth
mode, proxies) without plan-specific pricing gating. Keep free tier
note brief.

Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>

* feat(browser): support direct WebSocket CDP URLs for Browserbase

Browserbase uses direct WebSocket connections (wss://) rather than the
standard HTTP-based /json/version CDP discovery flow used by Browserless.
This change teaches the browser tool to accept ws:// and wss:// URLs as
cdpUrl values: when a WebSocket URL is detected, OpenClaw connects
directly instead of attempting HTTP discovery.

Changes:
- config.ts: accept ws:// and wss:// in cdpUrl validation
- cdp.helpers.ts: add isWebSocketUrl() helper
- cdp.ts: skip /json/version when cdpUrl is already a WebSocket URL
- chrome.ts: probe WSS endpoints via WebSocket handshake instead of HTTP
- cdp.test.ts: add test for direct WebSocket target creation
- docs/tools/browser.md: update Browserbase section with correct URL
  format and notes

Co-Authored-By: Claude Opus 4.6 <[email protected]>

* test+docs: comprehensive coverage and generic framing

- Add 12 new tests covering: isWebSocketUrl detection, parseHttpUrl WSS
  acceptance/rejection, direct WS target creation with query params,
  SSRF enforcement on WS URLs, WS reachability probing bypasses HTTP
- Reframe docs section as generic "Direct WebSocket CDP providers" with
  Browserbase as one example — any WSS-based provider works
- Update security tips to mention WSS alongside HTTPS

Co-Authored-By: Claude Opus 4.6 <[email protected]>

* fix(browser): update existing tests for ws/wss protocol support

Two pre-existing tests still expected ws:// URLs to be rejected by
parseHttpUrl, which now accepts them. Switch the invalid-protocol
fixture to ftp:// and tighten the assertion to match the full
"must be http(s) or ws(s)" error message.

Co-Authored-By: Claude Opus 4.6 <[email protected]>

* fix(browser): preserve wss:// cdpUrl in legacy default profile resolution

* chore: remove vendor-specific references from code comments

* style(browser): fix oxfmt formatting in config.ts

Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>

* fix: preserve loopback ws cdp tab ops (openclaw#31085) (thanks @shrey150)

* fix: share context engine registry across bundled chunks (openclaw#40115)

Merged via squash.

Prepared head SHA: 6af4820
Co-authored-by: jalehman <[email protected]>
Co-authored-by: jalehman <[email protected]>
Reviewed-by: @jalehman

* fix(browser): rewrite 0.0.0.0 and [::] wildcard addresses in CDP WebSocket URLs

Containerized browsers (e.g. browserless in Docker) report
`ws://0.0.0.0:<internal-port>` in their `/json/version` response.
`normalizeCdpWsUrl` rewrites loopback WS hosts to the external
CDP host:port, but `0.0.0.0` and `[::]` were not treated as
addresses needing rewriting, causing OpenClaw to try connecting
to `ws://0.0.0.0:3000` literally — which always fails.

Fixes openclaw#17752

Co-Authored-By: Claude Opus 4.6 <[email protected]>

* fix: normalize wildcard remote CDP websocket URLs (openclaw#17760) (thanks @joeharouni)

* fix(browser): wait for extension tabs after relay drop (openclaw#32331)

* fix: wait for extension relay tab reconnects (openclaw#32461) (thanks @AaronWander)

* fix(infra): make browser relay bind address configurable

Add browser.relayBindHost config option so the Chrome extension relay
server can bind to a non-loopback address (e.g. 0.0.0.0 for WSL2).
Defaults to 127.0.0.1 when unset, preserving current behavior.

Closes openclaw#39214

Co-Authored-By: Claude Opus 4.6 <[email protected]>

* fix(browser): add IP validation, fix upgrade handler for non-loopback bind

- Zod schema: validate relayBindHost with ipv4/ipv6 instead of bare string
- Upgrade handler: allow non-loopback connections when bindHost is explicitly
  non-loopback (e.g. 0.0.0.0 for WSL2), keeping loopback-only default
- Test: verify actual bind address via relay.bindHost instead of just checking
  reachability on 127.0.0.1 which passes regardless
- Expose bindHost on ChromeExtensionRelayServer type for inspection

Co-Authored-By: Claude Opus 4.6 <[email protected]>

* fix: make browser relay bind address configurable (openclaw#39364) (thanks @mvanhorn)

* docs: add WSL2 + Windows remote Chrome CDP troubleshooting (openclaw#39407) (thanks @Owlock)

* macos: add remote gateway token field for remote mode

* macos: clarify remote token placeholder text

* macos: add mode-toggle remote token sync coverage

* tests: document remote token persistence across mode toggle

* fix(macos): preserve unsupported remote gateway tokens

* docs(changelog): credit macos remote token author

* fix(macos): improve tailscale gateway discovery (openclaw#40167)

Sanitized test tailnet hostnames and re-ran the targeted macOS gateway discovery test suite before merge.

* refactor(browser): scope CDP sessions and harden stale target recovery

* feat: add local backup CLI (openclaw#40163)

Merged via squash.

Prepared head SHA: ed46625
Co-authored-by: shichangs <[email protected]>
Co-authored-by: gumadeiras <[email protected]>
Reviewed-by: @gumadeiras

* fix(ci): scope secrets scan to branch changes

* fix(ci): refresh detect-secrets baseline

* fix: harden backup verify path validation

* fix(plugin-sdk): lazily load legacy root alias

* fix(setup-podman): cd to TMPDIR before podman load to avoid cwd permission error (openclaw#39435)

* fix(setup-podman): cd to TMPDIR before podman load to avoid inherited cwd permission error

* fix(podman): safe cwd in run_as_user to prevent chdir errors

Co-Authored-By: Claude Opus 4.6  <[email protected]>
Signed-off-by: sallyom <[email protected]>

---------

Signed-off-by: sallyom <[email protected]>
Co-authored-by: sallyom <[email protected]>
Co-authored-by: Claude Opus 4.6 (1M context) <[email protected]>

* fix(cron): consolidate announce delivery, fire-and-forget trigger, and minimal prompt mode (openclaw#40204)

* fix(cron): consolidate announce delivery and detach manual runs

* fix: queue detached cron runs (openclaw#40204)

* Gateway/iOS: replay queued foreground actions safely after resume (openclaw#40281)

Merged via squash.

- Local validation: `pnpm exec vitest run --config vitest.gateway.config.ts src/gateway/server-methods/nodes.invoke-wake.test.ts`
- Local validation: `pnpm build`
- mb-server validation: `pnpm exec vitest run --config vitest.gateway.config.ts src/gateway/server-methods/nodes.invoke-wake.test.ts`
- mb-server validation: `pnpm build`
- mb-server validation: `pnpm protocol:check`

* iOS: auto-load the scoped gateway canvas with safe fallback (openclaw#40282)

Merged via squash.

- mb-server validation: `swift test --package-path apps/shared/OpenClawKit --filter GatewayNodeSessionTests`
- mb-server validation: `pnpm build`
- Scope note: top-level `RootTabs` shell change was intentionally removed from this PR before merge

* Update CHANGELOG.md

* Update CHANGELOG.md

* fix(run-openclaw-podman): add SELinux :Z mount option on enforcing/permissive hosts (openclaw#39449)

* fix(run-openclaw-podman): add SELinux :Z mount option on Linux with enforcing/permissive SELinux

* fix(quadlet): add SELinux :Z label to openclaw.container.in volume mount

* fix(podman): add SELinux :Z mount option for Fedora/RHEL hosts

Co-Authored-By: Claude Opus 4.6 <[email protected]>
Signed-off-by: sallyom <[email protected]>

---------

Signed-off-by: sallyom <[email protected]>
Co-authored-by: sallyom <[email protected]>
Co-authored-by: Claude Opus 4.6 <[email protected]>

* Docker: trim runtime image payload (openclaw#40307)

* Docker: shrink runtime image payload

* Docker: add runtime pnpm opt-in

* Docker: collapse helper entrypoint chmod layers

* Docker: restore bundled pnpm runtime

* Update CHANGELOG.md

* docs(changelog): move post-2026.3.8 entries to unreleased (openclaw#40342)

* docs(changelog): move post-2026.3.8 entries to unreleased

* Update CHANGELOG.md

* fix(tui): improve color contrast for light-background terminals (openclaw#40345)

* fix(tui): improve colour contrast for light-background terminals (openclaw#38636)

Detect light terminal backgrounds via COLORFGBG and apply a WCAG
AA-compliant light palette. Adds OPENCLAW_THEME=light|dark env var
override for terminals without auto-detection.

Uses proper sRGB linearisation and WCAG 2.1 contrast ratios to pick
whichever text palette (dark or light) has higher contrast against
the detected background colour.

Co-authored-by: ademczuk <[email protected]>

* Update CHANGELOG.md

---------

Co-authored-by: ademczuk <[email protected]>
Co-authored-by: ademczuk <[email protected]>

* fix(models): keep --all aligned with synthetic catalog rows

* test(models): refresh list assertions after main sync

* fix: normalize openai-codex gpt-5.4 transport overrides

* docs: add refactor cluster backlog

* refactor: dedupe plugin runtime stores

* refactor: share gateway argv parsing

* refactor: extract gateway port diagnostics helper

* refactor: reuse broadcast route key construction

* refactor: share multi-account config schema fragments

* test: dedupe brave llm-context rejection cases

* refactor: share channel config adapter base

* fix(agents): bootstrap runtime plugins before context-engine resolution

* docs(changelog): remove rebase marker

* refactor: harden browser relay CDP flows

* fix(config): refresh runtime snapshot from disk after write. Fixes openclaw#37175 (openclaw#37313)

Merged via squash.

Prepared head SHA: 69e1861
Co-authored-by: bbblending <[email protected]>
Co-authored-by: gumadeiras <[email protected]>
Reviewed-by: @gumadeiras

* refactor: harden browser runtime profile handling

* refactor(models): extract list row builders

* refactor(agents): extract provider model normalization

* refactor(models): split models.json planning from writes

* refactor(models): split provider discovery helpers

* chore(docs): drop refactor cleanup tracker

* gateway: fix global Control UI 404s for symlinked wrappers and bundled package roots (openclaw#40385)

Merged via squash.

Prepared head SHA: 567b3ed
Co-authored-by: velvet-shark <[email protected]>
Co-authored-by: velvet-shark <[email protected]>
Reviewed-by: @velvet-shark

* Docker: improve build cache reuse (openclaw#40351)

* Docker: improve build cache reuse

* Tests: cover Docker build cache layout

* Docker: fix sandbox cache mount continuations

* Docker: document qr-import manifest scope

* Docker: narrow e2e install inputs

* CI: cache Docker builds in workflows

* CI: route sandbox smoke through setup script

* CI: keep sandbox smoke on script path

* fix(tests): correct security check failure

* docs(changelog): correct Control UI contributor credit (openclaw#40420)

Merged via squash.

Prepared head SHA: e4295fe
Co-authored-by: velvet-shark <[email protected]>
Co-authored-by: velvet-shark <[email protected]>
Reviewed-by: @velvet-shark

* fix(models): use 1M context for openai-codex gpt-5.4 (openclaw#37876)

Merged via squash.

Prepared head SHA: c410207
Co-authored-by: yuweuii <[email protected]>
Co-authored-by: jalehman <[email protected]>
Reviewed-by: @jalehman

* fix(telegram): add download timeout to prevent polling loop hang (openclaw#40098)

Merged via squash.

Prepared head SHA: abdfa1a
Co-authored-by: tysoncung <[email protected]>
Co-authored-by: obviyus <[email protected]>
Reviewed-by: @obviyus

* ACP: add optional ingress provenance receipts (openclaw#40473)

Merged via squash.

Prepared head SHA: b63e46d
Co-authored-by: mbelinky <[email protected]>
Co-authored-by: mbelinky <[email protected]>
Reviewed-by: @mbelinky

* alphabetize web search providers (openclaw#40259)

Merged via squash.

Prepared head SHA: be6350e
Co-authored-by: kesku <[email protected]>
Co-authored-by: obviyus <[email protected]>
Reviewed-by: @obviyus

* fix(plugin-sdk): remove remaining bundled plugin src imports (openclaw#39638)

Verified:
- pnpm build
- pnpm check
- pnpm test:macmini

Co-authored-by: Kyle <[email protected]>
Co-authored-by: Tak Hoffman <[email protected]>

* test: fix android talk config contract fixture

* chore(acpx): move runtime test fixtures to test-utils (openclaw#40548)

Verified:
- pnpm install --frozen-lockfile
- pnpm build
- pnpm check
- pnpm test:macmini

* fix(agents): re-expose configured tools under restrictive profiles

* fix(media): accept reader read result type

* build(protocol): sync generated swift models

* fix: dedupe inbound Telegram DM replies per agent (openclaw#40519)

Merged via squash.

Prepared head SHA: 6e235e7
Co-authored-by: obviyus <[email protected]>
Co-authored-by: obviyus <[email protected]>
Reviewed-by: @obviyus

* fix(matrix): restore robust DM routing without the memberCount heuristic (openclaw#19736)

* fix(matrix): remove memberCount heuristic from DM detection

The memberCount === 2 check in isDirectMessage() misclassifies 2-person
group rooms (admin channels, monitoring rooms) as DMs, routing them to
the main session instead of their room-specific session.

Matrix already distinguishes DMs from groups at the protocol level via
m.direct account data and is_direct member state flags. Both are already
checked by client.dms.isDm() and hasDirectFlag(). The memberCount
heuristic only adds false positives for 2-person groups.

Move resolveMemberCount() below the protocol-level checks so it is only
reached for rooms not matched by m.direct or is_direct. This narrows its
role to diagnostic logging for confirmed group rooms.

Refs: openclaw#19739

* fix(matrix): add conservative fallback for broken DM flags

Some homeservers (notably Continuwuity) have broken m.direct account
data or never set is_direct on invite events. With the memberCount
heuristic removed, these DMs are no longer detected.

Add a conservative fallback that requires two signals before classifying
as DM: memberCount === 2 AND no explicit m.room.name. Group rooms almost
always have explicit names; DMs almost never do.

Error handling distinguishes M_NOT_FOUND (missing state event, expected
for unnamed rooms) from network/auth errors. Non-404 errors fall through
to group classification rather than guessing.

This is independently revertable — removing this commit restores pure
protocol-based detection without any heuristic fallback.

* fix(matrix): add parentPeer for DM room binding support

Add parentPeer to DM routes so conversations are bindable by room ID
while preserving DM trust semantics (secure 1:1, no group restrictions).

Suggested by @KirillShchetinin.

* fix(matrix): override DM detection for explicitly configured rooms

Builds on @robertcorreiro's config-driven approach from openclaw#9106.

Move resolveMatrixRoomConfig() before the DM check. If a room matches
a non-wildcard config entry (matchSource === "direct") and was
classified as DM, override the classification to group. This gives users
a deterministic escape hatch for misclassified rooms.

Wildcards are excluded from the override to avoid breaking DM routing
when a "*" catch-all exists. roomConfig is gated behind isRoom so DMs
never inherit group settings (skills, systemPrompt, autoReply).

This commit is independently droppable if the scope is too broad.

* test(matrix): add DM detection and config override tests

- 15 unit tests for direct.ts: all detection paths, priority order,
  M_NOT_FOUND vs network error handling, edge cases (whitespace names,
  API failures)
- 8 unit tests for rooms.ts: matchSource classification, wildcard
  safety for DM override, direct match priority over wildcard

* Changelog: note matrix DM routing follow-up

* fix(matrix): preserve DM fallback and room bindings

---------

Co-authored-by: Tak Hoffman <[email protected]>

* Fix cron text announce delivery for Telegram targets (openclaw#40575)

Merged via squash.

Prepared head SHA: 54b1513
Co-authored-by: obviyus <[email protected]>
Co-authored-by: obviyus <[email protected]>
Reviewed-by: @obviyus

* fix: clear plugin discovery cache after plugin installation (openclaw#39752)

Verified:
- pnpm build
- pnpm check
- pnpm test:macmini

Co-authored-by: GazeKingNuWu <[email protected]>
Co-authored-by: Tak Hoffman <[email protected]>

* test: fix windows secrets runtime ci

* fix(daemon): enable LaunchAgent before bootstrap on restart

restartLaunchAgent was missing the launchctl enable call that
installLaunchAgent already performs. launchd can persist a "disabled"
state after bootout, causing bootstrap to silently fail and leaving the
gateway unloaded until a manual reinstall.

Fixes openclaw#39211

Co-Authored-By: Claude Opus 4.6 <[email protected]>

* fix(daemon): also enable LaunchAgent in repairLaunchAgentBootstrap

The repair/recovery path had the same missing `enable` guard as
`restartLaunchAgent`.  If launchd persists a "disabled" state after a
previous `bootout`, the `bootstrap` call in `repairLaunchAgentBootstrap`
fails silently, leaving the gateway unloaded in the recovery flow.

Add the same `enable` guard before `bootstrap` that was already applied
to `installLaunchAgent` and (in this PR) `restartLaunchAgent`.

Co-Authored-By: Claude Opus 4.6 <[email protected]>

* fix(gateway): exit non-zero on restart shutdown timeout

When a config-change restart hits the force-exit timeout, exit with
code 1 instead of 0 so launchd/systemd treats it as a failure and
triggers a clean process restart. Stop-timeout stays at exit(0)
since graceful stops should not cause supervisor recovery.

Closes openclaw#36822

* test(secrets): skip ACL-dependent runtime snapshot tests on windows

* fix: add changelog for restart timeout recovery (openclaw#40380) (thanks @dsantoreis)

* fix(browser): enforce redirect-hop SSRF checks

* fix(cron): restore owner-only tools for isolated runs

* test(cron): cover owner-only tool availability

* fix(msteams): enforce sender allowlists with route allowlists

* fix(gateway): validate config before restart to prevent crash + macOS permission loss (openclaw#35862)

When 'openclaw gateway restart' is run with an invalid config, the new
process crashes on startup due to config validation failure. On macOS,
this causes Full Disk Access (TCC) permissions to be lost because the
respawned process has a different PID.

Add getConfigValidationError() helper and pre-flight config validation
in both runServiceRestart() and runServiceStart(). If config is invalid,
abort with a clear error message instead of crashing.

The config watcher's hot-reload path already had this guard
(handleInvalidSnapshot), but the CLI restart/start commands did not.

AI-assisted (OpenClaw agent, fully tested)

* fix(gateway): catch startup failure in run loop to prevent process exit (openclaw#35862)

When an in-process restart (SIGUSR1) triggers a config-triggered restart
and the new config is invalid, params.start() throws and the while loop
exits, killing the process. On macOS this loses TCC permissions.

Wrap params.start() in try/catch: on failure, set server=null, log the
error, and wait for the next SIGUSR1 instead of crashing.

* test: add runServiceStart config pre-flight tests (openclaw#35862)

Address Greptile review: add test coverage for runServiceStart path.
The error message copy-paste issue was already fixed in the DRY refactor
(uses params.serviceNoun instead of hardcoded 'restart').

* fix: address bot review feedback on openclaw#35862

- Remove dead 'return false' in runServiceStart (Greptile)
- Include stack trace in run-loop crash guard error log (Greptile)
- Only catch startup errors on subsequent restarts, not initial start (Codex P1)
- Add JSDoc note about env var false positive edge case (Codex P1)

* fix: move config pre-flight before onNotLoaded in runServiceRestart (Codex P2)

The config check was positioned after onNotLoaded, which could send
SIGUSR1 to an unmanaged process before config was validated.

* fix: release gateway lock on restart failure + reply to Codex reviews

- Release gateway lock when in-process restart fails, so daemon
  restart/stop can still manage the process (Codex P2)
- P1 (env mismatch) already addressed: best-effort by design, documented
  in JSDoc

* fix(gateway): detect launchd supervision via XPC_SERVICE_NAME

On macOS, launchd sets XPC_SERVICE_NAME on managed processes but does
not set LAUNCH_JOB_LABEL or LAUNCH_JOB_NAME. Without checking
XPC_SERVICE_NAME, isLikelySupervisedProcess() returns false for
launchd-managed gateways, causing restartGatewayProcessWithFreshPid()
to fork a detached child instead of returning "supervised". The
detached child holds the gateway lock while launchd simultaneously
respawns the original process (KeepAlive=true), leading to an infinite
lock-timeout / restart loop.

Co-Authored-By: Claude Opus 4.6 <[email protected]>

* fix: detect launchd supervision via xpc service name (openclaw#20555) (thanks @dimat)

* fix(node-host): bind bun and deno approval scripts

* fix(skills): pin validated download roots

* fix(telegram): abort in-flight getUpdates fetch on shutdown

When the gateway receives SIGTERM, runner.stop() stops the grammY polling
loop but does not abort the in-flight getUpdates HTTP request. That request
hangs for up to 30 seconds (the Telegram API timeout). If a new gateway
instance starts polling during that window, Telegram returns a 409 Conflict
error, causing message loss and requiring exponential backoff recovery.

This is especially problematic with service managers (launchd, systemd)
that restart the process immediately after SIGTERM.

Wire an AbortController into the fetch layer so every Telegram API request
(especially the long-polling getUpdates) aborts immediately on shutdown:

- bot.ts: Accept optional fetchAbortSignal in TelegramBotOptions; wrap
  the grammY fetch with AbortSignal.any() to merge the shutdown signal.
- monitor.ts: Create a per-iteration AbortController, pass its signal to
  createTelegramBot, and abort it from the SIGTERM handler, force-restart
  path, and finally block.

Co-Authored-By: Claude Opus 4.6 <[email protected]>

* fix(telegram): use manual signal forwarding to avoid cross-realm AbortSignal

AbortSignal.any() fails in Node.js when signals come from different module
contexts (grammY's internal signal vs local AbortController), producing:
"The signals[0] argument must be an instance of AbortSignal. Received an
instance of AbortSignal".

Replace with manual event forwarding that works across all realms.

Co-Authored-By: Claude Opus 4.6 <[email protected]>

* fix: abort telegram getupdates on shutdown (openclaw#23950) (thanks @Gkinthecodeland)

* fix(cron): stagger missed jobs on restart to prevent gateway overload

When the gateway restarts with many overdue cron jobs, they are now
executed with staggered delays to prevent overwhelming the gateway.

- Add missedJobStaggerMs config (default 5s between jobs)
- Add maxMissedJobsPerRestart limit (default 5 jobs immediately)
- Prioritize most overdue jobs by sorting by nextRunAtMs
- Reschedule deferred jobs to fire gradually via normal timer

Fixes openclaw#18892

* fix: stagger missed cron jobs on restart (openclaw#18925) (thanks @rexlunae)

* build: update app deps except carbon

* refactor: extract telegram polling session

* refactor: split cron startup catch-up flow

* refactor: flatten supervisor marker hints

* docs: reorder 2026.3.8 changelog by impact

* build: sync pnpm lockfile

* chore: refresh secrets baseline

* docs: move 2026.3.8 entries back to unreleased

* test: fix Node 24+ test runner and subagent registry mocks

* test: fix Windows fake runtime bin fixtures

* fix: normalize windows runtime shim executables

* chore: prepare 2026.3.8-beta.1 release

* chore: update appcast for 2026.3.8-beta.1

* test: fix windows runtime and restart loop harnesses

* fix(update): re-enable launchd service before updater bootstrap

* chore: prepare 2026.3.8 npm release

* test: narrow gateway loop signal harness

* fix(launchd): harden macOS launchagent install permissions

* fix(onboard): avoid persisting talk fallback on fresh setup

* build: bump unreleased version to 2026.3.9

* fix: stabilize launchd paths and appcast secret scan

* build: sync plugin versions for 2026.3.9

* fix(ui): preserve control-ui auth across refresh (openclaw#40892)

Merged via squash.

Prepared head SHA: f9b2375
Co-authored-by: velvet-shark <[email protected]>
Co-authored-by: velvet-shark <[email protected]>
Reviewed-by: @velvet-shark

* fix(kimi-coding): fix kimi tool format: use native Anthropic tool schema instead of OpenAI … (openclaw#40008)

Verified:
- pnpm install --frozen-lockfile
- pnpm build
- pnpm check
- pnpm test:macmini

Co-authored-by: opriz <[email protected]>
Co-authored-by: Tak Hoffman <[email protected]>

* fix(swiftformat): exclude HostEnvSecurityPolicy.generated.swift from formatters (openclaw#39969)

* test(context-engine): add bundle chunk isolation tests for registry (openclaw#40460)

Merged via squash.

Prepared head SHA: 44622ab
Co-authored-by: dsantoreis <[email protected]>
Co-authored-by: jalehman <[email protected]>
Reviewed-by: @jalehman

* fix(agents): bound compaction retry wait and drain embedded runs on restart (openclaw#40324)

Merged via squash.

Prepared head SHA: cfd9956
Co-authored-by: cgdusek <[email protected]>
Co-authored-by: jalehman <[email protected]>
Reviewed-by: @jalehman

* Allow ACP sessions.patch lineage fields on ACP session keys (openclaw#40995)

Merged via squash.

Prepared head SHA: c1191ed
Co-authored-by: xaeon2026 <[email protected]>
Co-authored-by: mbelinky <[email protected]>
Reviewed-by: @mbelinky

* Update CONTRIBUTING.md

* Add Robin Waslander to maintainers

* Update CONTRIBUTING.md

* fix(acp): map error states to end_turn instead of unconditional refusal (openclaw#41187)

* fix(acp): map error states to end_turn instead of unconditional refusal

* fix: map ACP error stop reason to end_turn (openclaw#41187) (thanks @pejmanjohn)

---------

Co-authored-by: Pejman Pour-Moezzi <[email protected]>
Co-authored-by: Onur <[email protected]>

* fix(acp): propagate setSessionMode gateway errors to client (openclaw#41185)

* fix(acp): propagate setSessionMode gateway errors to client

* fix: add changelog entry for ACP setSessionMode propagation (openclaw#41185) (thanks @pejmanjohn)

---------

Co-authored-by: Pejman Pour-Moezzi <[email protected]>
Co-authored-by: Onur <[email protected]>

* plugins: harden global hook runner state (openclaw#40184)

* fix(telegram): bridge direct delivery to internal message:sent hooks (openclaw#40185)

* telegram: bridge direct delivery message hooks

* telegram: align sent hooks with command session

* Cron: enforce cron-owned delivery contract (openclaw#40998)

Merged via squash.

Prepared head SHA: 5877389
Co-authored-by: mbelinky <[email protected]>
Co-authored-by: mbelinky <[email protected]>
Reviewed-by: @mbelinky

* Agents: add embedded error observations (openclaw#41336)

Merged via squash.

Prepared head SHA: 4900042
Co-authored-by: altaywtf <[email protected]>
Co-authored-by: altaywtf <[email protected]>
Reviewed-by: @altaywtf

* Doctor: fix non-interactive cron repair gating (openclaw#41386)

* iOS: reconnect gateway on foreground return (openclaw#41384)

Merged via squash.

Prepared head SHA: 0e2e0dc
Co-authored-by: mbelinky <[email protected]>
Co-authored-by: mbelinky <[email protected]>
Reviewed-by: @mbelinky

* fix(cron): do not misclassify empty/NO_REPLY as interim acknowledgement (openclaw#41401)

* fix(cron): do not misclassify empty/NO_REPLY as interim acknowledgement

When a cron task's agent returns NO_REPLY, the payload filter strips the
silent token, leaving an empty text string. isLikelyInterimCronMessage()
previously returned true for empty input, causing the cron runner to
inject a forced rerun prompt ('Your previous response was only an
acknowledgement...').

Change the empty-string branch to return false: empty text after payload
filtering means the agent deliberately chose silent completion, not that
it sent an interim 'on it' message.

Fixes openclaw#41246

* fix(cron): do not misclassify empty/NO_REPLY as interim acknowledgement

Fixes openclaw#41246. (openclaw#41383) thanks @jackal092927.

---------

Co-authored-by: xaeon2026 <[email protected]>

* fix(auth): reset cooldown error counters on expiry to prevent infinite escalation (openclaw#41028)

Merged via squash.

Prepared head SHA: 89bd83f
Co-authored-by: zerone0x <[email protected]>
Co-authored-by: altaywtf <[email protected]>
Reviewed-by: @altaywtf

* Gateway: add pending node work primitives (openclaw#41409)

Merged via squash.

Prepared head SHA: a6d7ca9
Co-authored-by: mbelinky <[email protected]>
Co-authored-by: mbelinky <[email protected]>
Reviewed-by: @mbelinky

* Gateway: tighten node pending drain semantics (openclaw#41429)

Merged via squash.

Prepared head SHA: 361c2eb
Co-authored-by: mbelinky <[email protected]>
Co-authored-by: mbelinky <[email protected]>
Reviewed-by: @mbelinky

* acp: fail honestly in bridge mode (openclaw#41424)

Merged via squash.

Prepared head SHA: b5e6e13
Co-authored-by: mbelinky <[email protected]>
Co-authored-by: mbelinky <[email protected]>
Reviewed-by: @mbelinky

* acp: restore session context and controls (openclaw#41425)

Merged via squash.

Prepared head SHA: fcabdf7
Co-authored-by: mbelinky <[email protected]>
Co-authored-by: mbelinky <[email protected]>
Reviewed-by: @mbelinky

* Sandbox: import STATE_DIR from paths directly (openclaw#41439)

* acp: enrich streaming updates for ide clients (openclaw#41442)

Merged via squash.

Prepared head SHA: 0764368
Co-authored-by: mbelinky <[email protected]>
Co-authored-by: mbelinky <[email protected]>
Reviewed-by: @mbelinky

* acp: forward attachments into ACP runtime sessions (openclaw#41427)

Merged via squash.

Prepared head SHA: f2ac51d
Co-authored-by: mbelinky <[email protected]>
Co-authored-by: mbelinky <[email protected]>
Reviewed-by: @mbelinky

* acp: add regression coverage and smoke-test docs (openclaw#41456)

Merged via squash.

Prepared head SHA: 514d587
Co-authored-by: mbelinky <[email protected]>
Co-authored-by: mbelinky <[email protected]>
Reviewed-by: @mbelinky

* fix(agents): probe single-provider billing cooldowns (openclaw#41422)

Merged via squash.

Prepared head SHA: bbc4254
Co-authored-by: altaywtf <[email protected]>
Co-authored-by: altaywtf <[email protected]>
Reviewed-by: @altaywtf

* acp: harden follow-up reliability and attachments (openclaw#41464)

Merged via squash.

Prepared head SHA: 7d167df
Co-authored-by: mbelinky <[email protected]>
Co-authored-by: mbelinky <[email protected]>
Reviewed-by: @mbelinky

* Agents: add fallback error observations (openclaw#41337)

Merged via squash.

Prepared head SHA: 852469c
Co-authored-by: altaywtf <[email protected]>
Co-authored-by: altaywtf <[email protected]>
Reviewed-by: @altaywtf

* build(protocol): regenerate Swift models after pending node work schemas (openclaw#41477)

Merged via squash.

Prepared head SHA: cae0aaf
Co-authored-by: mbelinky <[email protected]>
Co-authored-by: mbelinky <[email protected]>
Reviewed-by: @mbelinky

* fix(discord): apply effective maxLinesPerMessage in live replies (openclaw#40133)

Merged via squash.

Prepared head SHA: 031d032
Co-authored-by: rbutera <[email protected]>
Co-authored-by: altaywtf <[email protected]>
Reviewed-by: @altaywtf

* Logging: harden probe suppression for observations (openclaw#41338)

Merged via squash.

Prepared head SHA: d18356c
Co-authored-by: altaywtf <[email protected]>
Co-authored-by: altaywtf <[email protected]>
Reviewed-by: @altaywtf

* ci(sre:PLA-760): fix smoke workflow fallbacks

* test(sre:PLA-760): allowlist secret-scan false positives

* test(sre:PLA-760): refresh secret baseline for upstream sync

* test(sre:PLA-760): update detect-secrets baseline

* ci(sre:PLA-760): exclude auto-response from zizmor

* fix(ci:PLA-760): unblock audit and bun test lane

* ci(sre:PLA-760): run linux-only ci

---------

Signed-off-by: sallyom <[email protected]>
Co-authored-by: Peter Steinberger <[email protected]>
Co-authored-by: Tak Hoffman <[email protected]>
Co-authored-by: Ayaan Zaidi <[email protected]>
Co-authored-by: darkamenosa <[email protected]>
Co-authored-by: Hermione <[email protected]>
Co-authored-by: rbutera <[email protected]>
Co-authored-by: altaywtf <[email protected]>
Co-authored-by: Altay <[email protected]>
Co-authored-by: echoVic <[email protected]>
Co-authored-by: echoVic <[email protected]>
Co-authored-by: Ayaan Zaidi <[email protected]>
Co-authored-by: Vincent Koc <[email protected]>
Co-authored-by: GitBuck <[email protected]>
Co-authored-by: starbuck100 <[email protected]>
Co-authored-by: jalehman <[email protected]>
Co-authored-by: yuweuii <[email protected]>
Co-authored-by: Rémi <[email protected]>
Co-authored-by: remusao <[email protected]>
Co-authored-by: gumadeiras <[email protected]>
Co-authored-by: Mariano <[email protected]>
Co-authored-by: Shrey Pandya <[email protected]>
Co-authored-by: Claude Opus 4.6 (1M context) <[email protected]>
Co-authored-by: Josh Lehman <[email protected]>
Co-authored-by: Joe Harouni <[email protected]>
Co-authored-by: AaronWander <[email protected]>
Co-authored-by: Matt Van Horn <[email protected]>
Co-authored-by: Charles Dusek <[email protected]>
Co-authored-by: Nimrod Gutman <[email protected]>
Co-authored-by: shichangs <[email protected]>
Co-authored-by: Gustavo Madeira Santana <[email protected]>
Co-authored-by: langdon <[email protected]>
Co-authored-by: sallyom <[email protected]>
Co-authored-by: Tyler Yust <[email protected]>
Co-authored-by: langdon <[email protected]>
Co-authored-by: ademczuk <[email protected]>
Co-authored-by: ademczuk <[email protected]>
Co-authored-by: Doruk Ardahan <[email protected]>
Co-authored-by: 0xsline <[email protected]>
Co-authored-by: bbblending <[email protected]>
Co-authored-by: Radek Sienkiewicz <[email protected]>
Co-authored-by: velvet-shark <[email protected]>
Co-authored-by: Tyson Cung <[email protected]>
Co-authored-by: obviyus <[email protected]>
Co-authored-by: Mariano <[email protected]>
Co-authored-by: Kesku <[email protected]>
Co-authored-by: Kyle <[email protected]>
Co-authored-by: Kyle <[email protected]>
Co-authored-by: Bronko <[email protected]>
Co-authored-by: GazeKingNuWu <[email protected]>
Co-authored-by: GazeKingNuWu <[email protected]>
Co-authored-by: scoootscooob <[email protected]>
Co-authored-by: Daniel dos Santos Reis <[email protected]>
Co-authored-by: DevMac <[email protected]>
Co-authored-by: merlin <[email protected]>
Co-authored-by: dimatu <[email protected]>
Co-authored-by: George Kalogirou <[email protected]>
Co-authored-by: rexlunae <[email protected]>
Co-authored-by: opriz <[email protected]>
Co-authored-by: opriz <[email protected]>
Co-authored-by: Joshua Lelon Mitchell <[email protected]>
Co-authored-by: dsantoreis <[email protected]>
Co-authored-by: Charles Dusek <[email protected]>
Co-authored-by: xaeon2026 <[email protected]>
Co-authored-by: xaeon2026 <[email protected]>
Co-authored-by: Robin Waslander <[email protected]>
Co-authored-by: Pejman Pour-Moezzi <[email protected]>
Co-authored-by: Pejman Pour-Moezzi <[email protected]>
Co-authored-by: Onur <[email protected]>
Co-authored-by: zerone0x <[email protected]>
Co-authored-by: zerone0x <[email protected]>
Hua688 pushed a commit to Hua688/openclaw that referenced this pull request Mar 27, 2026
…w#41266)

Manual cron.run dispatches enqueue onto CommandLane.Cron (concurrency=1).
Isolated job execution internally re-enters the same cron lane via
runEmbeddedPiAgent, causing a classic deadlock: the outer task holds
the single slot while the inner enqueue waits for a free slot.

Move the outer enqueueRun dispatch to CommandLane.Nested so it does not
compete with the inner cron-lane work.  Scheduled runs are unaffected
(they call executeJobCoreWithTimeout directly without enqueueRun).

Regression introduced in 38543d8 (fix(cron): consolidate announce
delivery, fire-and-forget trigger, and minimal prompt mode openclaw#40204).

Co-authored-by: Copilot <[email protected]>
Hua688 pushed a commit to Hua688/openclaw that referenced this pull request Mar 27, 2026
…w#41266)

Manual cron.run dispatches enqueue onto CommandLane.Cron (concurrency=1).
Isolated job execution internally re-enters the same cron lane via
runEmbeddedPiAgent, causing a classic deadlock: the outer task holds
the single slot while the inner enqueue waits for a free slot.

Move the outer enqueueRun dispatch to CommandLane.Nested so it does not
compete with the inner cron-lane work.  Scheduled runs are unaffected
(they call executeJobCoreWithTimeout directly without enqueueRun).

Regression introduced in 38543d8 (fix(cron): consolidate announce
delivery, fire-and-forget trigger, and minimal prompt mode openclaw#40204).

Co-authored-by: Copilot <[email protected]>
Hua688 pushed a commit to Hua688/openclaw that referenced this pull request Mar 29, 2026
…w#41266)

Manual cron.run dispatches enqueue onto CommandLane.Cron (concurrency=1).
Isolated job execution internally re-enters the same cron lane via
runEmbeddedPiAgent, causing a classic deadlock: the outer task holds
the single slot while the inner enqueue waits for a free slot.

Move the outer enqueueRun dispatch to CommandLane.Nested so it does not
compete with the inner cron-lane work.  Scheduled runs are unaffected
(they call executeJobCoreWithTimeout directly without enqueueRun).

Regression introduced in 38543d8 (fix(cron): consolidate announce
delivery, fire-and-forget trigger, and minimal prompt mode openclaw#40204).

Co-authored-by: Copilot <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

agents Agent runtime and tooling cli CLI command changes docs Improvements or additions to documentation gateway Gateway runtime maintainer Maintainer-authored PR size: L

Projects

None yet

Development

Successfully merging this pull request may close these issues.

fix(cron): manual cron run trigger should be fire-and-forget fix(cron): announce delivery + wake event cause double message delivery

1 participant