🤖 perf: context-efficient plan mode (#1072)

ThomasK33 · web-flow · commit 385830d72cd0 · 2025-12-10T15:24:36.000Z
## Summary Optimizes context usage in plan mode by: 1. Removing redundant `planContent` from `propose_plan` tool results (plan is already visible via `file_edit_*` diffs) 2. Including the full plan in the mode transition message when switching plan → exec ## Changes - **`propose_plan` tool**: No longer returns `planContent` in the result, saving context during iterative planning sessions - **Mode transition (plan → exec)**: Now includes the full plan with soft framing: "evaluate whether it's relevant to the user's request" - **UI**: `ProposePlanToolCall` fetches content on-demand for the latest plan; shows path info for historical plans without embedded content - **Backwards compatibility**: Old chat history with `planContent` in results still renders correctly ## Context Flow | Phase | Before | After | |-------|--------|-------| | During planning | Plan in `file_edit_*` diffs + full plan in `propose_plan` result | Plan in `file_edit_*` diffs only | | On mode switch | Generic "mode switched" message | Full plan with "evaluate relevance" framing | ## Testing - Added 4 new tests for plan content in mode transitions - All existing tests pass --- <details> <summary>Plan</summary> # Plan: Context-Efficient Plan Mode ## Summary Improve the plan mode → exec mode transition to include the approved plan content in the model's context only when relevant, avoiding redundant context when the plan is already visible in conversation history or no plan was created. ## Current Behavior ### How Plan Mode Works Now 1. **System Prompt**: When in plan mode, `getPlanModeInstruction()` adds instructions telling the model to write its plan to `~/.mux/plans/{workspaceId}.md` 2. **propose_plan Tool**: When called, reads the plan file from disk and returns: ```typescript { success: true, planPath, planContent, message: "Plan proposed. Waiting for user approval." } ``` This content is stored in the tool result in chat history — **this is redundant** since the plan was already written via `file_edit_*` calls. 3. **Mode Transition**: When switching from plan → exec, `injectModeTransition()` inserts a synthetic user message: ``` [Mode switched from plan to exec. Follow exec mode instructions. Available tools: file_read, bash, ...] ``` ### Key Observation The plan content is duplicated in multiple places: 1. **`file_edit_*` tool calls** - The actual writes/edits to the plan file (as diffs) 2. **`propose_plan` tool result** - The full plan content (redundant!) 3. **Not included** - The mode transition message when switching to exec For iterative planning sessions, this means the plan content appears multiple times, wasting context: - Each plan revision has `file_edit_*` diffs ✓ (necessary, minimal) - Each `propose_plan` call duplicates the full content ✗ (redundant) - Final plan isn't surfaced when switching to exec ✗ (missing) ## Problem Statement When the user switches from plan mode to exec mode (implicitly by changing the mode selector), the model: 1. Doesn't receive explicit confirmation that the plan was approved 2. May not realize the plan from earlier in the conversation is what it should execute 3. Has to infer relevance from context rather than being told explicitly ## Proposed Solution Enhance the mode transition injection to optionally include the approved plan content when switching from plan → exec mode. ### Design Principles 1. **Only include plan when relevant** - The model should determine if the plan applies to the current user message 2. **Avoid redundancy** - Don't include plan if it's already visible in recent context 3. **Implicit approval** - Mux's mode switching is implicit (no explicit approve/reject), so we frame it as "plan available for reference" 4. **Graceful fallback** - If no plan file exists, proceed without it ### Implementation #### 1. Modify `injectModeTransition()` to accept plan content (~20 LoC) **File**: `src/browser/utils/messages/modelMessageTransform.ts` Add an optional `planContent` parameter that gets included when transitioning plan → exec: ```typescript export function injectModeTransition( messages: MuxMessage[], currentMode?: string, toolNames?: string[], planContent?: string // NEW: optional plan content for plan→exec transition ): MuxMessage[] { // ... existing logic ... // If transitioning from plan to exec AND plan content provided if (lastMode === "plan" && currentMode === "exec" && planContent) { transitionText += ` The following plan was developed in plan mode. Evaluate whether it's relevant to the user's request. If relevant, use it to guide your implementation: <approved-plan> ${planContent} </approved-plan>`; } // ... rest of function ... } ``` #### 2. Read plan content during stream preparation (~15 LoC) **File**: `src/node/services/aiService.ts` Before calling `injectModeTransition`, check if we're transitioning plan→exec and read the plan file: ```typescript // In prepareStream(), around line 959: let planContentForTransition: string | undefined; if (mode === "exec") { // Check if last assistant message was in plan mode const lastAssistantMessage = [...filteredMessages].reverse().find(m => m.role === "assistant"); if (lastAssistantMessage?.metadata?.mode === "plan") { // Read plan file for transition context const planFilePath = getPlanFilePath(workspaceId); try { planContentForTransition = await readFileString(runtime, planFilePath); } catch { // No plan file, proceed without } } } const messagesWithModeContext = injectModeTransition( messagesWithSentinel, mode, toolNamesForSentinel, planContentForTransition // NEW parameter ); ``` #### 3. Update test coverage (~40 LoC) **File**: `src/browser/utils/messages/modelMessageTransform.test.ts` Add tests for: - Plan content included when transitioning plan→exec with plan content - Plan content NOT included when transitioning exec→plan - Plan content NOT included when no plan content provided - Plan content NOT included when staying in same mode ### Alternative Considered: Include in System Prompt We could add the plan content to the system prompt during exec mode. However: - This would bust the system message cache on every plan change - Less contextually appropriate (plans are conversation-specific, not workspace-wide) - Mode transition injection is already the established pattern for mode-switching context ### Edge Cases 1. **No plan file exists**: Skip including plan content, use existing transition message 2. **Empty plan file**: Treat same as no plan 3. **Plan from previous conversation**: If user switches modes without a plan in current conversation, no plan content is included 4. **Very long plans**: Consider truncating after N characters with "... (plan truncated, see ~/.mux/plans/...)" #### 4. Remove planContent from propose_plan tool result (~5 LoC) **File**: `src/node/services/tools/propose_plan.ts` The tool currently returns the full plan content in the result. Since: - The plan is already in history via `file_edit_*` tool calls (as diffs) - The plan will be included in the mode transition message when switching to exec We can exclude it from the tool result to save context during iterative planning: ```typescript // Before: return { success: true as const, planPath, planContent, // REMOVE THIS message: "Plan proposed. Waiting for user approval.", }; // After: return { success: true as const, planPath, message: "Plan proposed. Waiting for user approval.", }; ``` #### 5. Update ProposePlanToolCall UI to fetch content on demand (~10 LoC) **File**: `src/browser/components/tools/ProposePlanToolCall.tsx` The UI component already has logic to fetch fresh content via `getPlanContent` for the latest plan. We need to ensure it falls back to this API call when `planContent` is not in the result: ```typescript // The component already handles this case - when isLatest is true AND freshContent is fetched // For historical plans (not latest), we can either: // 1. Fetch on demand when expanded (lazy load) // 2. Show "Plan content not available" with option to fetch ``` Since the UI already prioritizes `freshContent` from disk for the latest plan, this mostly works. For historical plans, we should show a minimal message indicating the plan exists at the path. ## Estimated LoC Changes | File | Change | LoC | |------|--------|-----| | `modelMessageTransform.ts` | Add planContent parameter | +20 | | `aiService.ts` | Read plan on transition | +15 | | `propose_plan.ts` | Remove planContent from result | -3 | | `ProposePlanToolCall.tsx` | Handle missing planContent | +10 | | `modelMessageTransform.test.ts` | New test cases | +40 | | **Total** | | **~82** | ## Design Decisions 1. **No truncation** - Include the full plan content. Can revisit if context limits become an issue. 2. **No "outdated" marking** - The existing file change notification system already handles external edits with diffs. 3. **Soft framing** - Use "developed in plan mode, evaluate relevance" rather than "approved" since Mux's approval is implicit. </details> --- _Generated with [mux](https://github.com/coder/mux)_ Signed-off-by: Thomas Kosiewski <tk@coder.com>
diff --git a/src/browser/components/tools/ProposePlanToolCall.tsx b/src/browser/components/tools/ProposePlanToolCall.tsx
@@ -27,19 +27,26 @@ import { usePopoverError } from "@/browser/hooks/usePopoverError";
 import { PopoverError } from "../PopoverError";
 
 /**
- * Check if the result is a successful file-based propose_plan result
+ * Check if the result is a successful file-based propose_plan result.
+ * Note: planContent may be absent in newer results (context optimization).
  */
 function isProposePlanResult(result: unknown): result is ProposePlanToolResult {
   return (
     result !== null &&
     typeof result === "object" &&
     "success" in result &&
     result.success === true &&
-    "planContent" in result &&
     "planPath" in result
   );
 }
 
+/**
+ * Result type that may have planContent (for backwards compatibility with old chat history)
+ */
+interface ProposePlanResultWithContent extends ProposePlanToolResult {
+  planContent?: string;
+}
+
 /**
  * Check if the result is an error from propose_plan tool
  */
@@ -173,11 +180,20 @@ export const ProposePlanToolCall: React.FC<ProposePlanToolCallProps> = (props) =
     const titleMatch = /^#\s+(.+)$/m.exec(freshContent);
     planTitle = titleMatch ? titleMatch[1] : (planPath?.split("/").pop() ?? "Plan");
   } else if (isProposePlanResult(result)) {
-    planContent = result.planContent;
+    // New format: planContent may be absent (context optimization)
+    // For backwards compatibility, check if planContent exists in old chat history
+    const resultWithContent = result as ProposePlanResultWithContent;
     planPath = result.planPath;
-    // Extract title from first markdown heading or use filename
-    const titleMatch = /^#\s+(.+)$/m.exec(result.planContent);
-    planTitle = titleMatch ? titleMatch[1] : (planPath.split("/").pop() ?? "Plan");
+    if (resultWithContent.planContent) {
+      // Old result with embedded content (backwards compatibility)
+      planContent = resultWithContent.planContent;
+      const titleMatch = /^#\s+(.+)$/m.exec(resultWithContent.planContent);
+      planTitle = titleMatch ? titleMatch[1] : (planPath.split("/").pop() ?? "Plan");
+    } else {
+      // New result without content - show path info, content is fetched for latest
+      planContent = `*Plan saved to ${planPath}*`;
+      planTitle = planPath.split("/").pop() ?? "Plan";
+    }
   } else if (isLegacyProposePlanResult(result)) {
     // Legacy format: title + plan passed directly (no file)
     planContent = result.plan;
diff --git a/src/browser/utils/messages/modelMessageTransform.test.ts b/src/browser/utils/messages/modelMessageTransform.test.ts
@@ -874,6 +874,155 @@ describe("injectModeTransition", () => {
       text: "[Mode switched from plan to exec. Follow exec mode instructions.]",
     });
   });
+
+  it("should include plan content when transitioning from plan to exec", () => {
+    const messages: MuxMessage[] = [
+      {
+        id: "user-1",
+        role: "user",
+        parts: [{ type: "text", text: "Let's plan a feature" }],
+        metadata: { timestamp: 1000 },
+      },
+      {
+        id: "assistant-1",
+        role: "assistant",
+        parts: [{ type: "text", text: "Here's the plan..." }],
+        metadata: { timestamp: 2000, mode: "plan" },
+      },
+      {
+        id: "user-2",
+        role: "user",
+        parts: [{ type: "text", text: "Now execute it" }],
+        metadata: { timestamp: 3000 },
+      },
+    ];
+
+    const planContent = "# My Plan\n\n## Step 1\nDo something\n\n## Step 2\nDo more";
+    const result = injectModeTransition(messages, "exec", undefined, planContent);
+
+    expect(result.length).toBe(4);
+    const transitionMessage = result[2];
+    expect(transitionMessage.role).toBe("user");
+    expect(transitionMessage.metadata?.synthetic).toBe(true);
+
+    const textPart = transitionMessage.parts[0];
+    expect(textPart.type).toBe("text");
+    if (textPart.type === "text") {
+      expect(textPart.text).toContain(
+        "[Mode switched from plan to exec. Follow exec mode instructions.]"
+      );
+      expect(textPart.text).toContain("The following plan was developed in plan mode");
+      expect(textPart.text).toContain("<plan>");
+      expect(textPart.text).toContain(planContent);
+      expect(textPart.text).toContain("</plan>");
+    }
+  });
+
+  it("should NOT include plan content when transitioning from exec to plan", () => {
+    const messages: MuxMessage[] = [
+      {
+        id: "user-1",
+        role: "user",
+        parts: [{ type: "text", text: "Done with feature" }],
+        metadata: { timestamp: 1000 },
+      },
+      {
+        id: "assistant-1",
+        role: "assistant",
+        parts: [{ type: "text", text: "Feature complete" }],
+        metadata: { timestamp: 2000, mode: "exec" },
+      },
+      {
+        id: "user-2",
+        role: "user",
+        parts: [{ type: "text", text: "Let's plan the next one" }],
+        metadata: { timestamp: 3000 },
+      },
+    ];
+
+    const planContent = "# Old Plan\n\nSome content";
+    const result = injectModeTransition(messages, "plan", undefined, planContent);
+
+    expect(result.length).toBe(4);
+    const transitionMessage = result[2];
+    const textPart = transitionMessage.parts[0];
+    if (textPart.type === "text") {
+      expect(textPart.text).toBe(
+        "[Mode switched from exec to plan. Follow plan mode instructions.]"
+      );
+      expect(textPart.text).not.toContain("<plan>");
+    }
+  });
+
+  it("should NOT include plan content when no plan content provided", () => {
+    const messages: MuxMessage[] = [
+      {
+        id: "user-1",
+        role: "user",
+        parts: [{ type: "text", text: "Let's plan" }],
+        metadata: { timestamp: 1000 },
+      },
+      {
+        id: "assistant-1",
+        role: "assistant",
+        parts: [{ type: "text", text: "Planning..." }],
+        metadata: { timestamp: 2000, mode: "plan" },
+      },
+      {
+        id: "user-2",
+        role: "user",
+        parts: [{ type: "text", text: "Execute" }],
+        metadata: { timestamp: 3000 },
+      },
+    ];
+
+    const result = injectModeTransition(messages, "exec", undefined, undefined);
+
+    expect(result.length).toBe(4);
+    const transitionMessage = result[2];
+    const textPart = transitionMessage.parts[0];
+    if (textPart.type === "text") {
+      expect(textPart.text).toBe(
+        "[Mode switched from plan to exec. Follow exec mode instructions.]"
+      );
+      expect(textPart.text).not.toContain("<plan>");
+    }
+  });
+
+  it("should include both tools and plan content in transition message", () => {
+    const messages: MuxMessage[] = [
+      {
+        id: "user-1",
+        role: "user",
+        parts: [{ type: "text", text: "Plan done" }],
+        metadata: { timestamp: 1000 },
+      },
+      {
+        id: "assistant-1",
+        role: "assistant",
+        parts: [{ type: "text", text: "Plan ready" }],
+        metadata: { timestamp: 2000, mode: "plan" },
+      },
+      {
+        id: "user-2",
+        role: "user",
+        parts: [{ type: "text", text: "Go" }],
+        metadata: { timestamp: 3000 },
+      },
+    ];
+
+    const toolNames = ["file_read", "bash"];
+    const planContent = "# Plan\n\nDo stuff";
+    const result = injectModeTransition(messages, "exec", toolNames, planContent);
+
+    expect(result.length).toBe(4);
+    const textPart = result[2].parts[0];
+    if (textPart.type === "text") {
+      expect(textPart.text).toContain("Available tools: file_read, bash.]");
+      expect(textPart.text).toContain("<plan>");
+      expect(textPart.text).toContain(planContent);
+    }
+  });
 });
 
 describe("filterEmptyAssistantMessages", () => {
diff --git a/src/browser/utils/messages/modelMessageTransform.ts b/src/browser/utils/messages/modelMessageTransform.ts
@@ -112,15 +112,20 @@ export function addInterruptedSentinel(messages: MuxMessage[]): MuxMessage[] {
  * Inserts a synthetic user message before the final user message to signal the mode switch.
  * This provides temporal context that helps models understand they should follow new mode instructions.
  *
+ * When transitioning from plan → exec mode with plan content, includes the plan so the model
+ * can evaluate its relevance to the current request.
+ *
  * @param messages The conversation history
  * @param currentMode The mode for the upcoming assistant response (e.g., "plan", "exec")
  * @param toolNames Optional list of available tool names to include in transition message
+ * @param planContent Optional plan content to include when transitioning plan → exec
  * @returns Messages with mode transition context injected if needed
  */
 export function injectModeTransition(
   messages: MuxMessage[],
   currentMode?: string,
-  toolNames?: string[]
+  toolNames?: string[],
+  planContent?: string
 ): MuxMessage[] {
   // No mode specified, nothing to do
   if (!currentMode) {
@@ -175,6 +180,17 @@ export function injectModeTransition(
     transitionText += "]";
   }
 
+  // When transitioning plan → exec with plan content, include the plan for context
+  if (lastMode === "plan" && currentMode === "exec" && planContent) {
+    transitionText += `
+
+The following plan was developed in plan mode. Based on the user's message, determine if they have accepted the plan. If accepted and relevant, use it to guide your implementation:
+
+<plan>
+${planContent}
+</plan>`;
+  }
+
   const transitionMessage: MuxMessage = {
     id: `mode-transition-${Date.now()}`,
     role: "user",
diff --git a/src/common/types/tools.ts b/src/common/types/tools.ts
@@ -170,11 +170,12 @@ export type FileEditToolArgs =
 // Args derived from schema
 export type ProposePlanToolArgs = z.infer<typeof TOOL_DEFINITIONS.propose_plan.schema>;
 
-// Result type for new file-based propose_plan tool
+// Result type for file-based propose_plan tool
+// Note: planContent is NOT included to save context - plan is visible via file_edit_* diffs
+// and will be included in mode transition message when switching to exec mode
 export interface ProposePlanToolResult {
   success: true;
   planPath: string;
-  planContent: string;
   message: string;
 }
 
diff --git a/src/node/services/aiService.ts b/src/node/services/aiService.ts
diff --git a/src/node/services/tools/propose_plan.ts b/src/node/services/tools/propose_plan.ts