feat(web-search): set Exa as default provider via auto-detection (EXA_API_KEY activates Exa) by louiswalsh · Pull Request #35062 · openclaw/openclaw

louiswalsh · 2026-03-04T23:32:24Z

Summary

Problem: OpenClaw has no native Exa search integration — users who want Exa's semantic search must use external workarounds.
Why it matters: Exa provides high-quality semantic search with token-efficient highlights, useful for AI assistant workflows.
What changed: Added Exa as a 6th web_search provider with auto-detection — setting EXA_API_KEY is sufficient to activate Exa (no explicit provider config required). Exa is first in the auto-detect priority chain; when the key is absent, the chain falls through to Brave.
What did NOT change (scope boundary): No changes to existing providers, no changes to web_fetch, no breaking changes to existing configs.

Change Type (select all)

Scope (select all touched areas)

Linked Issue/PR

Related Feature: Add Exa as a native web_search provider #20134
Variant of feat: add Exa as web_search provider #31310 (opt-in only) — this PR adds auto-detection on top

User-visible / Behavior Changes

New provider: "exa" option for tools.web.search.provider
New config keys: tools.web.search.exa.apiKey, tools.web.search.exa.numResults (default: 5), tools.web.search.exa.highlightsMaxChars (default: 8000)
New env var: EXA_API_KEY — when set with no explicit provider config, Exa is selected automatically
Auto-detection priority: Exa (fix: add @lid format support and allowFrom wildcard handling #1) → Brave (Login fails with 'WebSocket Error (socket hang up)' ECONNRESET #2) → Gemini (WA business, groups & office hours #3) → Kimi (Images not passed to Claude CLI - only path reference in text #4) → Perplexity (CLI: add Opencode integration #5) → Grok (Clarification for clawd.md #6)
Tool description changes when Exa is selected: "Search the web using Exa search. Returns structured results with titles, URLs, and highlights."
Users without EXA_API_KEY: no behavior change — chain falls through to Brave as before

Security Impact (required)

New permissions/capabilities? No
Secrets/tokens handling changed? Yes — new EXA_API_KEY env var / exa.apiKey config, follows same pattern as existing providers
New/changed network calls? Yes — new outbound call to https://api.exa.ai/search with x-exa-integration: "openclaw" header
Command/tool execution surface changed? No
Data access scope changed? No
If any Yes, explain risk + mitigation: API key is marked sensitive in Zod schema (same as all other provider keys). Network call fires automatically if EXA_API_KEY is present. Uses existing withTrustedWebSearchEndpoint wrapper with timeout and abort support.

Repro + Verification

Environment

OS: Linux
Runtime/container: Node.js (vitest)
Model/provider: N/A (unit tests only)
Integration/channel (if any): N/A
Relevant config (redacted): { tools: { web: { search: { exa: { apiKey: "exa-..." } } } } } (no explicit provider needed)

Steps

Set EXA_API_KEY in environment (no other config change required)
Invoke web_search tool with a query
Observe Exa auto-selected in verbose logs

Expected

Returns structured results with title, URL, highlights, published date, author
Results capped at numResults (default 5) with highlights limited to highlightsMaxChars (default 8000)

Actual

Matches expected (verified via unit tests)

Evidence

Failing test/log before + passing after

20 tests passing in config.web-search-provider.test.ts, 47 in web-search.test.ts.

Human Verification (required)

Verified scenarios: Auto-detection with EXA_API_KEY set, Exa priority over Brave when both keys present, fallback to Brave when EXA_API_KEY absent
Edge cases checked: Multiple keys present (Exa wins), no keys (falls back to Brave/error), explicit provider override still respected
What you did not verify: Live API call to Exa (unit tests mock the network layer)

Compatibility / Migration

Backward compatible? Yes
Config/env changes? Yes — new optional EXA_API_KEY env var and tools.web.search.exa config block
Migration needed? No
If yes, exact upgrade steps: N/A

Failure Recovery (if this breaks)

How to disable/revert this change quickly: Unset EXA_API_KEY or set provider: "brave" in config — falls back to Brave
Files/config to restore: N/A (additive change, no existing behavior modified)
Known bad symptoms reviewers should watch for: If EXA_API_KEY is invalid, the tool returns an API error message (same pattern as other providers)

Risks and Mitigations

Risk: Exa API availability/rate limits
- Mitigation: If EXA_API_KEY is unset, there is zero impact on existing users
Risk: Unintended Exa activation for users who set EXA_API_KEY for other purposes
- Mitigation: Key is namespaced (EXA_API_KEY) and only read by the web_search tool resolver

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 3c0475f7ac

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-03-04T23:35:55Z

src/config/zod-schema.agent-runtime.ts

    suppressToolErrorWarnings: z.boolean().optional(),
-    lightContext: z.boolean().optional(),
  })


Restore heartbeat.lightContext in runtime config schema

HeartbeatSchema is strict, so removing lightContext here turns heartbeat.lightContext into an unknown key and causes config validation to fail for existing installations that already use this flag. The runtime still reads this option (for example, heartbeat execution checks heartbeat?.lightContext to choose lightweight bootstrap context), so this change introduces a backwards-incompatible break where valid prior configs are now rejected at load time.

Useful? React with 👍 / 👎.

greptile-apps · 2026-03-04T23:38:37Z

Greptile Summary

This PR adds Exa as the 6th web_search provider and places it first in the auto-detection chain (activated by EXA_API_KEY). The Exa integration is well-structured — resolver helpers are properly guarded, the cache key correctly encodes Exa-specific parameters, and the HTTP integration uses the right authentication header. Tests cover auto-detection priority, config resolution edge cases, and invalid-input fallbacks.

Critical issue:

lightContext: z.boolean().optional() was removed from HeartbeatSchema in src/config/zod-schema.agent-runtime.ts. Since the schema uses .strict(), any user with heartbeat.lightContext: true in their config file will receive a Zod validation error at startup. The runtime code in heartbeat-runner.ts still reads heartbeat?.lightContext to enable lightweight context mode, so this is a functional regression that will break existing user configurations.

Minor issue:

The Zod schema for exa.numResults accepts any positive integer, but resolveSearchCount silently clamps the value to MAX_SEARCH_COUNT (10). A user setting exa.numResults: 25 will pass config validation but only receive 10 results with no warning or error. Adding .max(10) to the schema or documenting the 1–10 cap would make this behavior transparent.

Confidence Score: 2/5

Not safe to merge — a breaking change (removal of lightContext from HeartbeatSchema) will cause config validation errors for existing users.
The Exa integration itself is well-implemented and tested. However, the accidental removal of lightContext: z.boolean().optional() from the strict HeartbeatSchema is a critical breaking change that will cause validation failures for any user with heartbeat.lightContext: true set in their config. This must be reverted before merging. The minor issue with silent clamping of exa.numResults does not block merge but should be addressed for clarity.
src/config/zod-schema.agent-runtime.ts — must restore the lightContext field to HeartbeatSchema and optionally add .max(10) to the exa.numResults schema or document the cap in types.tools.ts.

_{Last reviewed commit: 3c0475f}

greptile-apps · 2026-03-04T23:38:41Z

src/config/zod-schema.agent-runtime.ts

    prompt: z.string().optional(),
    ackMaxChars: z.number().int().nonnegative().optional(),
    suppressToolErrorWarnings: z.boolean().optional(),
-    lightContext: z.boolean().optional(),
  })


lightContext: z.boolean().optional() was removed from HeartbeatSchema, but the field is still in use throughout the codebase:

src/infra/heartbeat-runner.ts:747 reads heartbeat?.lightContext === true

src/config/types.agent-defaults.ts:248 still declares lightContext?: boolean

Tests still verify this behavior

Because HeartbeatSchema uses .strict(), any user with heartbeat.lightContext: true in their config file will now receive a Zod validation error at startup. This is a breaking change.

Suggested change

prompt: z.string().optional(),

ackMaxChars: z.number().int().nonnegative().optional(),

suppressToolErrorWarnings: z.boolean().optional(),

lightContext: z.boolean().optional(),

})

ackMaxChars: z.number().int().nonnegative().optional(),

suppressToolErrorWarnings: z.boolean().optional(),

lightContext: z.boolean().optional(),

})

Prompt To Fix With AI

This is a comment left during a code review. Path: src/config/zod-schema.agent-runtime.ts Line: 33-36 Comment: `lightContext: z.boolean().optional()` was removed from `HeartbeatSchema`, but the field is still in use throughout the codebase: - `src/infra/heartbeat-runner.ts:747` reads `heartbeat?.lightContext === true` - `src/config/types.agent-defaults.ts:248` still declares `lightContext?: boolean` - Tests still verify this behavior Because `HeartbeatSchema` uses `.strict()`, any user with `heartbeat.lightContext: true` in their config file will now receive a Zod validation error at startup. This is a breaking change. ```suggestion ackMaxChars: z.number().int().nonnegative().optional(), suppressToolErrorWarnings: z.boolean().optional(), lightContext: z.boolean().optional(), }) ``` How can I resolve this? If you propose a fix, please make it concise.

greptile-apps · 2026-03-04T23:38:42Z

src/config/zod-schema.agent-runtime.ts

+    exa: z
+      .object({
+        apiKey: SecretInputSchema.optional().register(sensitive),
+        numResults: z.number().int().positive().optional(),
+        highlightsMaxChars: z.number().int().positive().optional(),
+      })
+      .strict()
+      .optional(),


The Zod schema for exa.numResults accepts any positive integer, but at runtime resolveSearchCount() (called in createWebSearchTool) silently clamps the value to MAX_SEARCH_COUNT (10). A user who sets exa.numResults: 25 will pass config validation but only receive 10 results — with no warning.

Consider adding .max(10) to the schema to catch this at config-parse time, or add a comment in types.tools.ts documenting the 1–10 cap so users understand the constraint.

Prompt To Fix With AI

This is a comment left during a code review. Path: src/config/zod-schema.agent-runtime.ts Line: 309-316 Comment: The Zod schema for `exa.numResults` accepts any positive integer, but at runtime `resolveSearchCount()` (called in `createWebSearchTool`) silently clamps the value to `MAX_SEARCH_COUNT` (10). A user who sets `exa.numResults: 25` will pass config validation but only receive 10 results — with no warning. Consider adding `.max(10)` to the schema to catch this at config-parse time, or add a comment in `types.tools.ts` documenting the 1–10 cap so users understand the constraint. How can I resolve this? If you propose a fix, please make it concise.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 30817541c5

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-03-04T23:51:53Z

src/agents/tools/web-search.ts

+          url, // Keep raw for tool chaining
+          description: description ? wrapWebContent(description, "web_search") : "",
+          published: entry.publishedDate || undefined,
+          author: entry.author || undefined,


Wrap Exa author metadata as untrusted external content

runExaSearch marks the payload as externalContent.wrapped: true, but author is returned raw while title/description are wrapped. Exa author values come from web page metadata and can contain arbitrary text, so attacker-controlled pages can inject unwrapped instructions into web_search output despite the wrapper contract. This weakens the prompt-injection boundary for any workflow that consumes Exa search results.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-04T23:51:53Z

src/agents/tools/web-search.ts

+function resolveExaApiKey(exa?: ExaSearchConfig): string | undefined {
+  const fromConfig = normalizeApiKey(exa?.apiKey);
+  if (fromConfig) {


Resolve SecretInput refs when reading Exa API key

tools.web.search.exa.apiKey is validated as SecretInputSchema, but resolveExaApiKey only normalizes plain strings. If a user configures this field as a SecretRef object (which validation allows), it is treated as empty and the code falls back to env vars, producing a false missing_exa_api_key error when EXA_API_KEY is unset. This is a runtime mismatch between accepted config shape and key resolution.

Useful? React with 👍 / 👎.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 5075f15bca

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-03-05T00:03:18Z

src/secrets/runtime-config-collectors-core.ts

      : undefined;
  const paths = [
    "apiKey",
+    "exa.apiKey",


Include exa in selected-provider gating for web search secrets

Adding exa.apiKey to the resolved-path list without also recognizing provider: "exa" in selectedProvider causes explicit Exa configs to be treated like auto-detect mode (selectedProvider === undefined). In that state, secret refs for other providers (for example grok.apiKey) are incorrectly marked active and resolved; if those refs are unresolved/missing, prepareSecretsRuntimeSnapshot will throw and block startup even though those providers are inactive for this config.

Useful? React with 👍 / 👎.

Adds Exa (exa.ai) as a bundled web search provider using the plugin architecture. Uses highlights-based content retrieval matching the Hermes agent call signature, with x-exa-integration: openclaw header. - New extensions/exa/ directory with plugin config, entry point, and search provider implementation - autoDetectOrder: 5 (first in auto-detect chain when EXA_API_KEY set) - Supports date_after/date_before filtering via ISO-8601 date ranges - Updates bundled plugin IDs, contract registry, and all related tests Co-Authored-By: Louis Walsh <[email protected]>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 8937b55240

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-03-19T06:25:45Z

extensions/exa/index.ts

+  register(api) {
+    api.registerWebSearchProvider(createExaWebSearchProvider());


Wire Exa into the live web_search tool path

Registering the Exa plugin here does not make it reachable in production yet. src/agents/openclaw-tools.ts:87-90 still exposes createWebSearchTool(), and that implementation only resolves brave|perplexity|grok|gemini|kimi (src/agents/tools/web-search.ts:261-268 and 1337-1376). In other words, an agent session will keep using the old hard-coded provider stack even when EXA_API_KEY is set, so the feature added by this commit never activates.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-03-19T06:25:45Z

extensions/exa/src/exa-web-search-provider.ts

+    id: "exa",
+    label: "Exa Search",
+    hint: "AI-native semantic search · date filters · highlights",
+    envVars: ["EXA_API_KEY"],


Add exa to the runtime web-search config schema

This introduces a new provider id, but the validated config surface still rejects tools.web.search.provider: "exa". src/config/zod-schema.agent-runtime.ts:261-269 and src/config/types.tools.ts:433-475 only allow the pre-existing five providers, so users cannot explicitly pin Exa in config or have config-writing flows persist that selection. The only way to use Exa is to leave provider unset and rely on auto-detection, which is a regression for anyone who already has another search provider configured.

Useful? React with 👍 / 👎.

vincentkoc · 2026-03-23T03:58:46Z

Good call raising this follow-up.

This is related, but I am not treating it as the same change. #52617 landed the Exa bundled provider itself. This PR is about a different policy question: whether Exa should become the default auto-detect choice when EXA_API_KEY is present.

I'm leaving this separate for now because that selection policy diverges from the landed scope. If you want this folded differently, tell me the intended auto-detect rule and I can reassess it quickly.

openclaw-barnacle bot added docs Improvements or additions to documentation agents Agent runtime and tooling size: M labels Mar 4, 2026

chatgpt-codex-connector bot reviewed Mar 4, 2026

View reviewed changes

greptile-apps bot reviewed Mar 4, 2026

View reviewed changes

louiswalsh force-pushed the feat/exa-web-search-default branch from 3c0475f to 3081754 Compare March 4, 2026 23:44

louiswalsh changed the title ~~feat(web-search): add Exa as 6th web_search provider with auto-detection (EXA_API_KEY)~~ feat(web-search): set Exa as default provider via auto-detection (EXA_API_KEY activates Exa) Mar 4, 2026

chatgpt-codex-connector bot reviewed Mar 4, 2026

View reviewed changes

chatgpt-codex-connector bot reviewed Mar 5, 2026

View reviewed changes

github-actions bot mentioned this pull request Mar 5, 2026

🦞 OpenClaw 生态日报 2026-03-05 duanyytop/agents-radar#77

Open

MonkeyLeeT mentioned this pull request Mar 11, 2026

feat(web-search): add Exa as web search provider #32529

Closed

6 tasks

devin-ai-integration bot force-pushed the feat/exa-web-search-default branch from 6e8e5b0 to 8937b55 Compare March 19, 2026 06:09

openclaw-barnacle bot added size: XL and removed docs Improvements or additions to documentation agents Agent runtime and tooling size: M labels Mar 19, 2026

chatgpt-codex-connector bot reviewed Mar 19, 2026

View reviewed changes

louiswalsh mentioned this pull request Mar 19, 2026

Adding Exa as a web search plugin #50281

Closed

20 tasks

vincentkoc mentioned this pull request Mar 23, 2026

feat(web-search): add Exa as bundled web search plugin #52617

Merged

20 tasks

		register(api) {
		api.registerWebSearchProvider(createExaWebSearchProvider());

Uh oh!

Conversation

louiswalsh commented Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Change Type (select all)

Scope (select all touched areas)

Linked Issue/PR

User-visible / Behavior Changes

Security Impact (required)

Repro + Verification

Environment

Steps

Expected

Actual

Evidence

Human Verification (required)

Compatibility / Migration

Failure Recovery (if this breaks)

Risks and Mitigations

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot commented Mar 4, 2026

Greptile Summary

Confidence Score: 2/5

Uh oh!

greptile-apps bot Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

vincentkoc commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

louiswalsh commented Mar 4, 2026 •

edited

Loading