fix: detect Node.js / Python shebang scripts as their agent identity (#9870) by lonexreb · Pull Request #10022 · warpdotdev/warp

lonexreb · 2026-05-04T08:17:50Z

The bug

On Linux, agents distributed as Node.js or Python shebang scripts surface as node /usr/local/bin/<agent> because /proc/PID/comm is initialised from the executable basename, not from argv[0] or the symlink name. The bare prefix match in CLIAgent::detect saw node and missed the agent entirely, so the third-party CLI agent toolbar never appeared.

The canonical reproduction in #9870: the @openai/codex npm package installs /usr/local/bin/codex as a symlink to codex.js whose shebang is #!/usr/bin/env node. Detection failed for every Linux user of Codex.

The fix

Two new private helpers in app/src/terminal/cli_agent.rs, plus a small extension to detect() to consider their output:

path_basename_token peels a leading path component so /usr/local/bin/codex → codex for matching purposes. Covers fully-qualified invocations.
shebang_script_basename recognises a known script-runtime invocation (node, nodejs, bun, deno, python, python3) and pulls the script basename from argv[1], stripping common script extensions (.js, .mjs, .cjs, .ts, .py). Skips leading flags (node --inspect /usr/local/bin/codex).
detect() matches command_prefix() against the exact first word OR either of the two derived tokens.

The change is purely additive — every existing detect path still matches exactly as before, including the Uber-team aifx agent run claude special case. The new candidates only fire when the exact-word match fails, so there's no false-positive surface for unrelated commands.

Test coverage

cli_agent_tests.rs gets three new test functions covering:

node /usr/local/bin/codex → CLIAgent::Codex (the canonical Codex agent toolbelt does not appear on Linux because Node.js script process shows as 'node' not 'codex' #9870 case)
node --inspect /usr/local/bin/codex (runtime flag before script path)
node /path/to/codex.js (extension stripped)
nodejs, bun alternative runtimes
Path-prefixed binaries: /usr/local/bin/codex, ./goose, /opt/homebrew/bin/claude
Negative tests ensure no false positives: node, node ./my-app.js, python /home/user/script.py, node --version all return None.

Why not patch upstream `process.title`?

The reporter notes that adding process.title = 'codex' to the Codex CLI's entry script would update /proc/PID/comm to codex on Linux. That's a fix in the wrong tree — every shebang-script-distributed agent would need the same upstream patch, including agents Warp doesn't ship and can't influence. Recognising the pattern in Warp's detection layer covers all of them at once.

Test plan

On Ubuntu Linux with npm install -g @openai/codex, run codex in a Warp terminal — third-party agent toolbar appears (was: never appeared).
On macOS with the same Codex install, behavior is unchanged (the fix's new code paths are exercised but produce the same answer as today's exact-word match).
On Linux, run node ./my-random-app.js — no agent toolbar appears (negative case).
All existing cli_agent_tests.rs tests continue to pass.

Scope

Two files: app/src/terminal/cli_agent.rs (+77/-2), app/src/terminal/cli_agent_tests.rs (+103).
No public API changes.
No new dependencies.
Builds on top of master; no conflict with the in-flight Mistral Vibe PR (feat: recognize Mistral Vibe as a CLI agent (#9607) #9667) — both modify detect() but in non-overlapping arms (Vibe adds a vibe-acp exact-word check, this adds basename / runtime fallbacks). Either can rebase cleanly onto the other.

…arpdotdev#9870) On Linux, agents distributed as Node.js or Python shebang scripts surface as `node /usr/local/bin/codex` (the runtime, not the script) because `/proc/PID/comm` is initialised from the executable basename. The bare prefix match in `CLIAgent::detect` saw `node` and missed Codex entirely, so the third-party CLI agent toolbar never appeared. The reproduction in warpdotdev#9870 confirms this is the canonical case: the @openai/codex npm package installs `/usr/local/bin/codex` as a symlink to `codex.js` whose shebang is `#!/usr/bin/env node`. Detection failed for every Linux user of Codex, but worked on macOS where shebang process naming differs. Fix: - New helper `path_basename_token` peels a leading path component so `/usr/local/bin/codex` → `codex` for matching purposes. - New helper `shebang_script_basename` recognises a known script-runtime invocation (`node`, `nodejs`, `bun`, `deno`, `python`, `python3`) and pulls the script basename from `argv[1]`, stripping common script extensions (`.js`, `.mjs`, `.cjs`, `.ts`, `.py`). - `detect()` matches `command_prefix()` against the exact first word OR either of the two derived tokens. The change is purely additive — every existing detect path still matches exactly as before, including the Uber-team `aifx agent run claude` special case. The new candidates only fire when the exact-word match fails, so there's no false-positive surface for unrelated commands like `node ./my-app.js` or `python /home/user/script.py`. Tests in `cli_agent_tests.rs` cover: - `node /usr/local/bin/codex` → Codex (the canonical warpdotdev#9870 case) - `node` with flags before the script path - `node /path/to/codex.js` (extension stripped) - `nodejs`, `bun` runtimes - Path-prefixed binaries: `/usr/local/bin/codex`, `./goose` - Negative tests: `node`, `node ./my-app.js`, `python script.py`, `node --version` all return None. Followups deliberately out of scope: - A user-configurable runtime list (V0 hardcodes the 6 most common). - An `argv[0]` rewrite via `process.title` patches upstream — that's a Codex CLI repo concern, not a Warp concern.

oz-for-oss · 2026-05-04T08:18:04Z

@lonexreb

I'm starting a first review of this pull request.

You can view the conversation on Warp.

I completed the review and no human review was requested for this pull request.

Comment /oz-review on this pull request to retrigger a review (up to 3 times on the same pull request).

Powered by Oz

oz-for-oss

Overview

Reviewed the CLI agent detection changes for path-prefixed binaries and Node/Python-style shebang script invocations, along with the added unit coverage.

Concerns

Runtime flag parsing skips every leading flag without accounting for flags that consume the following token, which can make non-script invocations like node -e codex or python -c claude resolve as known agents.

Verdict

Found: 0 critical, 1 important, 0 suggestions

Request changes

Comment /oz-review on this pull request to retrigger a review (up to 3 times on the same pull request).

Powered by Oz

oz-for-oss · 2026-05-04T08:22:58Z

+    let _runtime = tokens.next()?;
+    // Skip leading flags like `-e '...'` or `--inspect`. Stop at the first
+    // non-flag token: that's the script path.
+    let script = tokens.find(|t| !t.starts_with('-'))?;


⚠️ [IMPORTANT] This treats the first non-flag token as a script even when the preceding runtime flag consumes an argument, so invocations like node -e codex, node --require codex, or python -c claude can false-positive as agents. Account for value-taking runtime flags before selecting the script token.

Resolves the merge conflict with master after warpdotdev#9667 (Mistral Vibe agent recognition) merged. Combines both surfaces: - The Vibe arm now also resolves through path_basename_token and shebang_script_basename, so `/usr/local/bin/vibe-acp` and `node /opt/vibe-acp` both detect as Vibe. Also addresses oz-for-oss review on PR warpdotdev#10022: > This treats the first non-flag token as a script even when the > preceding runtime flag consumes an argument, so invocations like > `node -e codex`, `node --require codex`, or `python -c claude` can > false-positive as agents. Adds VALUE_TAKING_RUNTIME_FLAGS, a list of flags that consume the following token as their value: - Node: -e, --eval, -p, --print, -r, --require, -C, --conditions - Python: -c, -m, -X, -W - Deno (subset): -A, --allow-read, --allow-write, --allow-net - Bun (subset): -d, --define shebang_script_basename now consumes the value alongside the flag, so `node -e codex` no longer false-positives as Codex. The `--key=value` form remains a single whitespace token and is handled by the existing `starts_with('-')` flag check. New regression tests in cli_agent_tests.rs: - `node -e codex` → None - `node --eval codex` → None - `node --require claude /home/user/app.js` → None - `python -c claude` → None - `python -m gemini` → None - `node --require some-mod /usr/local/bin/codex` → Codex (legitimate agent script after value-taking flag is consumed) - `node --inspect=127.0.0.1:9229 /usr/local/bin/codex` → Codex (--key=value form preserves correct script detection)

lonexreb · 2026-05-04T09:30:46Z

/oz-review

oz-for-oss · 2026-05-04T09:30:59Z

@lonexreb

I'm re-reviewing this pull request in response to a review request.

You can view the conversation on Warp.

I completed the review and no human review was requested for this pull request.

Comment /oz-review on this pull request to retrigger a review (up to 3 times on the same pull request).

Powered by Oz

oz-for-oss

Overview

This PR adds basename-based detection for path-prefixed CLI agents and Node/Python-style shebang runtime invocations.

Concerns

The basename fallback applies to every agent prefix, including the generic Cursor prefix agent, which creates false positives for unrelated scripts or binaries named agent.
The runtime parser does not stay aligned with extract_first_command when shell parsing skips leading environment assignments, so env-prefixed shebang invocations can still fail detection.

Verdict

Found: 0 critical, 2 important, 0 suggestions

Request changes

Comment /oz-review on this pull request to retrigger a review (up to 3 times on the same pull request).

Powered by Oz

oz-for-oss · 2026-05-04T09:36:58Z

+                    || candidate_basename.as_deref() == Some(prefix)
+                    || runtime_invoked_basename.as_deref() == Some(prefix)


⚠️ [IMPORTANT] These basename fallbacks apply to every command_prefix(), including CLIAgent::CursorCli whose prefix is the generic string agent; /tmp/agent, node /tmp/agent.js, or python /tmp/agent.py will now be detected as Cursor. Keep exact matching for generic prefixes or allowlist only unambiguous agent names for basename/runtime recovery.

oz-for-oss · 2026-05-04T09:36:58Z

+    }
+    // Use whitespace splitting (not shell parsing) — this is best-effort
+    // recovery, not security-critical input handling.
+    let mut tokens = command.split_whitespace();


⚠️ [IMPORTANT] This re-parses the original command instead of the same shell-parsed token stream used by extract_first_command; with escape_char set, FOO=1 node /usr/local/bin/codex produces resolved_first_word == "node", but this iterator consumes FOO=1 as the runtime and never reaches the Codex script. Strip the same leading assignments or share the parsed tokens before scanning runtime arguments.

Addresses two findings from oz-for-oss re-review on PR warpdotdev#10022: 1. `CLIAgent::CursorCli`'s `command_prefix()` is the bare string "agent", which is generic enough to false-positive when the basename / runtime- script fallbacks fire on unrelated scripts and binaries. `/tmp/agent`, `node /tmp/agent.js`, `python /tmp/agent.py` would all incorrectly detect as Cursor. New `GENERIC_AGENT_PREFIXES` const lists prefixes that are too generic for fallback recovery; `prefix_eligible_for_basename_recovery` gates the candidate_basename and runtime_invoked_basename branches in `CLIAgent::detect`. Exact-word match still works for Cursor, just not the basename fallbacks. Distinctive prefixes (codex, claude, etc.) continue to recover via basename normally. 2. `shebang_script_basename` did its own whitespace tokenization, which diverged from `extract_first_command`'s shell-aware parsing when the user's command starts with env-var assignments. `FOO=1 node /usr/local/ bin/codex` would have `resolved_first_word == "node"` (correct) but `shebang_script_basename` would read `FOO=1` as the runtime token and never reach the script. Added `is_env_var_assignment` helper (matches POSIX env-var name pattern: `[A-Za-z_][A-Za-z0-9_]*=`). `shebang_script_basename` now skips leading env-var assignments before reading the runtime token, keeping it aligned with `extract_first_command`. New regression tests: - `test_detect_generic_prefix_no_basename_false_positive`: - `agent` → CursorCli (exact match still works) - `/tmp/agent`, `node /tmp/agent.js`, `python /tmp/agent.py` → None - `/tmp/codex` → Codex (distinctive prefixes still recover) - `test_detect_node_shebang_with_env_var_prefix`: - `FOO=1 node /usr/local/bin/codex` → Codex - `FOO=1 BAR=baz node /usr/local/bin/codex` → Codex - `PATH=/tmp:/usr/bin node /usr/local/bin/codex` → Codex

lonexreb · 2026-05-05T19:07:00Z

The two important issues from the previous bot review are now addressed in commit 409b61b:

Generic-prefix false positives: introduced GENERIC_AGENT_PREFIXES and prefix_eligible_for_basename_recovery() so basename / shebang-runtime recovery only applies to unambiguous agent names. The generic agent prefix (CLIAgent::CursorCli) is now gated out of fallback matching.
Env-var prefix in shebang scripts: shebang_script_basename now uses is_env_var_assignment() to skip KEY=value tokens, aligning with extract_first_command's shell-aware parsing.

The bot's re-review at 09:30 UTC on May 4 completed without requesting further human review, but the PR still shows CHANGES_REQUESTED from the original review. Could a maintainer dismiss or re-approve so this can move forward? Happy to address anything else flagged.

harryalbert · 2026-05-06T13:29:10Z

The two important issues from the previous bot review are now addressed in commit 409b61b:

Generic-prefix false positives: introduced GENERIC_AGENT_PREFIXES and prefix_eligible_for_basename_recovery() so basename / shebang-runtime recovery only applies to unambiguous agent names. The generic agent prefix (CLIAgent::CursorCli) is now gated out of fallback matching.

Env-var prefix in shebang scripts: shebang_script_basename now uses is_env_var_assignment() to skip KEY=value tokens, aligning with extract_first_command's shell-aware parsing.

The bot's re-review at 09:30 UTC on May 4 completed without requesting further human review, but the PR still shows CHANGES_REQUESTED from the original review. Could a maintainer dismiss or re-approve so this can move forward? Happy to address anything else flagged.

You're able to re-request review from the agent by using the /oz-review command. Once you feel this PR is ready for review by a maintainer, please feel free to @/tag me! I'd be happy to take a look

oz-for-oss · 2026-05-06T13:29:24Z

@harryalbert

I'm re-reviewing this pull request in response to a review request.

You can view the conversation on Warp.

I reviewed this pull request and requested human review from: @zachbai.

Comment /oz-review on this pull request to retrigger a review (up to 3 times on the same pull request).

Powered by Oz

oz-for-oss · 2026-05-06T13:29:50Z

@harryalbert

I'm re-reviewing this pull request in response to a review request.

You can view the conversation on Warp.

I reviewed this pull request and requested human review from: @zachbai.

Comment /oz-review on this pull request to retrigger a review (up to 3 times on the same pull request).

Powered by Oz

Oz no longer requests changes for this pull request after the latest automated review.

oz-for-oss

Overview

This PR extends CLI agent detection to recover agent identities from path-prefixed binaries and Node/Python-style shebang runtime invocations, with regression coverage for Codex and false-positive cases.

Concerns

No blocking correctness or security concerns found.
Non-blocking follow-up: Deno is listed as a supported script runtime, but the current token parsing does not handle common Deno command shapes.

Verdict

Found: 0 critical, 0 important, 1 suggestions

Approve with nits

Comment /oz-review on this pull request to retrigger a review (up to 3 times on the same pull request).

Powered by Oz

oz-for-oss · 2026-05-06T13:38:24Z

+/// runtime, not the script name — so a command surfaces as e.g.
+/// `node /usr/local/bin/codex` and we have to look at the second token to
+/// recover the agent identity. See #9870 for the canonical Codex case.
+const SCRIPT_RUNTIMES: &[&str] = &["node", "nodejs", "bun", "deno", "python", "python3"];


💡 [SUGGESTION] deno is listed as a runtime, but typical invocations put run before the script and permission flags like --allow-read can be boolean, so the helper can return run or consume the script path. Handle Deno's command shape or remove it from this list until supported.

oz-for-oss

Overview

This PR expands CLI agent detection to recover agent names from path-prefixed commands and common runtime-invoked shebang scripts, with coverage for the Codex Linux failure, basename recovery, value-taking runtime flags, generic prefix gating, and env-var prefixes.

Concerns

Deno is listed as a supported runtime, but common Deno shebangs use deno run ... <script>, so the current first-positional parser still will not recover those script basenames.

Verdict

Found: 0 critical, 0 important, 1 suggestions

Approve with nits

Comment /oz-review on this pull request to retrigger a review (up to 3 times on the same pull request).

Powered by Oz

oz-for-oss · 2026-05-06T13:41:25Z

+/// runtime, not the script name — so a command surfaces as e.g.
+/// `node /usr/local/bin/codex` and we have to look at the second token to
+/// recover the agent identity. See #9870 for the canonical Codex case.
+const SCRIPT_RUNTIMES: &[&str] = &["node", "nodejs", "bun", "deno", "python", "python3"];


💡 [SUGGESTION] deno does not follow the helper's argv[1] is the script assumption for common shebangs (deno run [flags] <script>), so this list advertises Deno support that will still return None. Either skip the run subcommand and Deno flags before selecting the script, or remove deno until that path is supported.

harryalbert

@lonexreb, after looking over this PR a bit more, I think I'm a little hesitant about the solution here. While it would potentially address the issue, it's depending on a lot of special node, python, etc. state that could change with new versions or older versions (and is not generalizable to other runtimes that folks might be using). I'd prefer a less specialized solution if possible.

I chatted with the agent a bit, and I think that a more robust solution shape might look something like:

Persist the shell preexec command
◦ Add preexec_command: Option to Block.
◦ In Block::preexec at app/src/terminal/model/block.rs:3247, store data.command before/while handling PreexecValue.
◦ Add a narrow getter, e.g. preexec_command(&self) -> Option<&str>.
Use preexec command for CLI-agent detection
◦ In detect_cli_agent_from_model at app/src/terminal/view/use_agent_footer/mod.rs:352, prefer:
▪ active_block.preexec_command()
▪ fallback to active_block.command_with_secrets_obfuscated(false) at app/src/terminal/view/use_agent_footer/mod.rs:373
◦ This makes detection use what the shell was about to execute, not whatever command text/process shape Warp later reconstructed.
Simplify CLIAgent::detect
◦ Keep:
exact first-token match: codex
alias expansion
path basename match: /usr/local/bin/codex
existing special cases like aifx agent run claude / vibe-acp
◦ Remove the broad runtime parser for Node/Python/Deno/Bun flags from app/src/terminal/cli_agent.rs.
Curious on your thoughts on this shape/if you agree!

cla-bot Bot added the cla-signed label May 4, 2026

github-actions Bot added the external-contributor Indicates that a PR has been opened by someone outside the Warp team. label May 4, 2026

oz-for-oss Bot previously requested changes May 4, 2026

View reviewed changes

harryalbert mentioned this pull request May 6, 2026

Codex agent toolbelt does not appear on Linux because Node.js script process shows as 'node' not 'codex' #9870

Open

harryalbert self-assigned this May 6, 2026

oz-for-oss Bot reviewed May 6, 2026

View reviewed changes

oz-for-oss Bot requested a review from zachbai May 6, 2026 13:38

oz-for-oss Bot reviewed May 6, 2026

View reviewed changes

harryalbert self-requested a review May 6, 2026 14:54

harryalbert reviewed May 6, 2026

View reviewed changes

		\|\| candidate_basename.as_deref() == Some(prefix)
		\|\| runtime_invoked_basename.as_deref() == Some(prefix)

Conversation

lonexreb commented May 4, 2026

The bug

The fix

Test coverage

Why not patch upstream process.title?

Test plan

Scope

Uh oh!

oz-for-oss Bot commented May 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oz-for-oss Bot left a comment

Choose a reason for hiding this comment

Overview

Concerns

Verdict

Uh oh!

oz-for-oss Bot May 4, 2026

Choose a reason for hiding this comment

Uh oh!

lonexreb commented May 4, 2026

Uh oh!

oz-for-oss Bot commented May 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oz-for-oss Bot left a comment

Choose a reason for hiding this comment

Overview

Concerns

Verdict

Uh oh!

oz-for-oss Bot May 4, 2026

Choose a reason for hiding this comment

Uh oh!

oz-for-oss Bot May 4, 2026

Choose a reason for hiding this comment

Uh oh!

lonexreb commented May 5, 2026

Uh oh!

harryalbert commented May 6, 2026

Uh oh!

oz-for-oss Bot commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oz-for-oss Bot commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oz-for-oss Bot left a comment

Choose a reason for hiding this comment

Overview

Concerns

Verdict

Uh oh!

oz-for-oss Bot May 6, 2026

Choose a reason for hiding this comment

Uh oh!

oz-for-oss Bot left a comment

Choose a reason for hiding this comment

Overview

Concerns

Verdict

Uh oh!

oz-for-oss Bot May 6, 2026

Choose a reason for hiding this comment

Uh oh!

harryalbert left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Why not patch upstream `process.title`?

oz-for-oss Bot commented May 4, 2026 •

edited

Loading

oz-for-oss Bot commented May 4, 2026 •

edited

Loading

oz-for-oss Bot commented May 6, 2026 •

edited

Loading

oz-for-oss Bot commented May 6, 2026 •

edited

Loading