security: sanitize MCP tool descriptions at registration to prevent tool-poisoning injection (Log-To-Leak)

## Source

[Log-To-Leak: Prompt Injection Attacks on Tool-Using LLM Agents via Model Context Protocol](https://openreview.net/forum?id=UVgbFuXPaO)

## Finding

Malicious MCP servers can embed injection instructions in `description` and `inputSchema.description` fields of their tool definitions. When Zeph fetches the tool catalog from an MCP server and injects it into the LLM context during planning, these fields are processed as trusted system content — bypassing `ContentSanitizer` which only applies to user/web content.

This is a distinct attack vector from indirect injection via web scraping: it targets the tool catalog ingestion path, not message content.

## Impact

- Any MCP server (dynamic or static) can inject arbitrary instructions into the LLM's planning context
- `ContentSanitizer` + `ExfiltrationGuard` pipeline does NOT cover tool definitions
- `zeph-mcp`'s `McpToolRegistry` stores tool definitions in Qdrant without sanitization
- Attack surface: all sessions with external MCP servers (`/mcp add`, config-based servers)

## Fix

Add a sanitization pass over MCP tool definitions at registration time in `zeph-mcp` (before storing in Qdrant):
1. Apply `SecurityPatterns` regexes to `description` and all parameter `description` fields
2. Cap `description` field length (e.g. 512 bytes max)
3. Log WARN when injection patterns detected; optionally strip/truncate the offending field
4. Do NOT block tool registration — just sanitize the text before it reaches the LLM context

## Severity

High — active unmitigated attack surface affecting all sessions with external MCP servers. Fix is contained to `zeph-mcp` tool registration and requires no schema changes.

## Research Reference

The Zeph `ContentSanitizer` pipeline (Untrusted Content Isolation epic #1195) applies a similar pattern to web/tool output content. The same approach is applicable here at the tool catalog ingestion layer.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

security: sanitize MCP tool descriptions at registration to prevent tool-poisoning injection (Log-To-Leak) #1691

Source

Finding

Impact

Fix

Severity

Research Reference

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

security: sanitize MCP tool descriptions at registration to prevent tool-poisoning injection (Log-To-Leak) #1691

Description

Source

Finding

Impact

Fix

Severity

Research Reference

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions