-
Notifications
You must be signed in to change notification settings - Fork 2
research(security): AEGIS pre-execution firewall — 3-stage intercept + Ed25519 chained audit, 1.2% FPR, 8.3ms overhead (arXiv:2603.12621) #2305
Copy link
Copy link
Closed
Labels
P2High value, medium complexityHigh value, medium complexityresearchResearch-driven improvementResearch-driven improvementsecuritySecurity-related issueSecurity-related issue
Description
Paper
arXiv:2603.12621 — AEGIS: No Tool Call Left Unchecked — A Pre-Execution Firewall and Audit Layer for AI Agents
Key Finding
Framework-agnostic 3-stage tool call intercept: (1) argument extraction, (2) risk scan, (3) policy validation. Produces an Ed25519+SHA-256 chained audit trail. Blocked 48/48 attack scenarios at 1.2% false positive rate and 8.3ms median overhead across 14 agent frameworks.
Applicability to Zeph
- zeph-tools: Zeph's
ToolExecutortrait runs tools after the LLM returns atool_useblock. AEGIS-style pre-execution intercept fits naturally here — validate tool arguments against a policy before executing. - Audit trail: PR feat(tools): ClaimSource provenance, ErrorDomain recovery, MCP tool pruning #2293 adds
ClaimSourceandAuditEntry. AEGIS's chained Ed25519 audit would extend this: eachAuditEntrycould sign over the previous hash, making the audit log tamper-evident. - Risk scan stage: Complements the existing
ExfiltrationGuardand sandbox checks. The 3-stage pipeline (extract → scan → validate) is more structured than current ad-hoc checks inlegacy.rs/native.rs. - Implementation sketch: Add
PreExecutionFirewalltrait tozeph-tools. Default impl passes through; security impl runs the 3 stages. Wire intoCompositeExecutor.execute()before dispatching to the concrete executor. - 1.2% FPR at 8.3ms: Well within acceptable bounds for interactive agent use.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
P2High value, medium complexityHigh value, medium complexityresearchResearch-driven improvementResearch-driven improvementsecuritySecurity-related issueSecurity-related issue