-
Notifications
You must be signed in to change notification settings - Fork 2
[SEC-4.1] Markdown image exfiltration guard #1205
Copy link
Copy link
Closed
Labels
corezeph-core cratezeph-core cratepriority/highHigh priorityHigh prioritysecuritySecurity-related issueSecurity-related issuesize/SSmall PR (11-50 lines)Small PR (11-50 lines)
Description
Part of #1195 — Phase 4
Block markdown image injection in LLM output — a known exfiltration vector where injected instructions trick the model into rendering .
Crates: zeph-core
Depends on: SEC-1.1
Tasks:
- Output scanner: detect
patterns in LLM response - Strip or replace with
[blocked external image: ...]when URL is not in user's original message - Allowlist for known safe domains (configurable)
- Config:
[security.exfiltration_guard] block_markdown_images = true - Unit tests with various markdown image injection variants
Files: crates/zeph-core/src/sanitizer/exfiltration.rs (new)
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
corezeph-core cratezeph-core cratepriority/highHigh priorityHigh prioritysecuritySecurity-related issueSecurity-related issuesize/SSmall PR (11-50 lines)Small PR (11-50 lines)