feat(security): exfiltration guards (Phase 4, #1195) by bug-ops · Pull Request #1246 · bug-ops/zeph

bug-ops · 2026-03-05T20:06:17Z

Summary

Phase 4 of Untrusted Content Isolation epic (#1195). Adds ExfiltrationGuard to zeph-core with three independently toggleable guards:

Markdown image scanner: strips inline (![](url)) and reference-style (![alt][ref]) image injection from LLM output at 5 integration points (streaming, non-streaming, native text, ToolUse text, cache hits)
Tool URL validator: cross-references URLs in tool call arguments against flagged untrusted content; flag-only approach (no blocking) consistent with Phase 1 philosophy
Memory write guard: skips Qdrant embedding for injection-flagged content while preserving SQLite conversation continuity

Config

[security.exfiltration_guard]
block_markdown_images = true   # default
validate_tool_urls = true      # default
guard_memory_writes = true     # default

Changes

crates/zeph-core/src/sanitizer/exfiltration.rs (new): ExfiltrationGuard, ExfiltrationGuardConfig, ExfiltrationEvent, 20 unit tests
crates/zeph-core/src/agent/tool_execution.rs: 5 scan integration points, tool URL validation, flagged URL collection
crates/zeph-core/src/agent/persistence.rs: has_injection_flags parameter on persist_message
crates/zeph-core/src/config/types.rs: ExfiltrationGuardConfig under SecurityConfig
crates/zeph-core/src/metrics.rs: 3 new counters
Docs, READMEs, CHANGELOG updated

Review process

Architecture critic (4 significant + 4 minor gaps) → all addressed in implementation → code review (2 critical + 3 important) → all fixed and re-review approved.

Known limitations (Phase 5)

Percent-encoded scheme bypass (%68ttps://)
HTML <img> tag injection
Unicode zero-width joiner bypass
Streaming chunks visible before accumulated scan

Test plan

20 unit tests for ExfiltrationGuard (scan_output, validate_tool_call, memory guard)
cargo nextest run --features full: 4029/4029 pass
cargo clippy --features full: 0 warnings
cargo +nightly fmt --check: clean

Closes phase 4 of #1195.

… writes (Phase 4, #1195) Add ExfiltrationGuard to zeph-core with three independently toggleable guards under [security.exfiltration_guard] config section: - Output scanner: strips markdown image injection (inline + reference-style) from LLM responses at 5 integration points (streaming, non-streaming, native text, ToolUse text, cache hits) - Tool URL validator: cross-references URLs in tool call arguments against flagged untrusted content using HashSet for O(1) lookups - Memory write guard: skips Qdrant embedding for injection-flagged content while preserving SQLite conversation continuity Addresses all architecture critic findings (S1-S4, M1-M4) and code review issues (SEC-EX-01 through SEC-EX-06). Adds 20 unit tests, 3 metrics counters, and updates docs and READMEs.

…-guards

github-actions bot added documentation Improvements or additions to documentation rust Rust code changes core zeph-core crate enhancement New feature or request size/XL Extra large PR (500+ lines) labels Mar 5, 2026

This was linked to issues Mar 5, 2026

[SEC-4.1] Markdown image exfiltration guard #1205

Closed

[SEC-4.2] Tool call argument validation guard #1206

Closed

[SEC-4.3] Memory write poisoning guard #1207

Closed

Merge remote-tracking branch 'origin/main' into feat/m32/exfiltration…

6348205

…-guards

bug-ops merged commit 61c4627 into main Mar 5, 2026
28 checks passed

bug-ops deleted the feat/m32/exfiltration-guards branch March 5, 2026 20:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(security): exfiltration guards (Phase 4, #1195)#1246

feat(security): exfiltration guards (Phase 4, #1195)#1246
bug-ops merged 2 commits intomainfrom
feat/m32/exfiltration-guards

bug-ops commented Mar 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

bug-ops commented Mar 5, 2026

Summary

Config

Changes

Review process

Known limitations (Phase 5)

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant