feat(skills): malicious skill trust tier enforcement (#1853) by bug-ops · Pull Request #1878 · bug-ops/zeph

bug-ops · 2026-03-15T20:29:40Z

Summary

Extends the skill trust enforcement subsystem based on findings from arXiv 2602.06547 (empirical study: 157 confirmed malicious SKILL.md files, 26.1% community skill vulnerability prevalence).

Fix QUARANTINE_DENIED tool IDs: replace dead "file_write" with actual executor IDs (write, edit, delete_path, move_path, copy_path, create_directory) and add memory_save, fetch
Fix execute_tool_call_confirmed incorrectly delegating to the unconfirmed path
Add zeph-tools::patterns module: relocate RAW_INJECTION_PATTERNS + strip_format_chars from zeph-mcp for shared access
Add SkillContentScanner (zeph-skills::scanner): scans skill body at load time using injection patterns, emits WARN with match count; documented as advisory-only
Add scan_on_load = true config flag to TrustConfig
Integrate scanner in bootstrap for skills below Trusted tier
Add --scan-skills-on-load CLI flag, /skill scan TUI command, --init wizard step, --migrate-config step

Test plan

cargo +nightly fmt --check passes
cargo clippy --workspace --features full -- -D warnings passes (0 warnings)
cargo nextest run --config-file .github/nextest.toml --workspace --features full --lib --bins passes (5868 tests)
Verify QUARANTINE_DENIED blocks bash, write, edit, delete_path, move_path, copy_path, create_directory, memory_save, web_scrape, fetch for Quarantined skills
Verify SkillContentScanner emits WARN on injection pattern match
Verify scan_on_load = false disables scanner

Follow-up

security: QUARANTINE_DENIED does not cover MCP tool namespace #1876: MCP tools bypass QUARANTINE_DENIED due to namespaced tool IDs (deferred)

Extend TrustLevel enforcement in response to empirical study findings (arXiv 2602.06547): 157 confirmed malicious SKILL.md files, 26.1% vulnerability prevalence in community skills. Changes: - Fix QUARANTINE_DENIED tool IDs: replace dead "file_write" with actual FileExecutor IDs (write, edit, delete_path, move_path, copy_path, create_directory) and add memory_save and fetch - Fix execute_tool_call_confirmed delegating to unconfirmed path - Add zeph-tools::patterns module: relocate RAW_INJECTION_PATTERNS and strip_format_chars from zeph-mcp for shared access - Add SkillContentScanner in zeph-skills::scanner: scans skill body at load time using injection patterns, emits WARN with match count; documented as advisory-only (not a security boundary) - Add scan_on_load config flag (default: true) to TrustConfig - Integrate scanner in bootstrap: called for skills below Trusted tier - Add --scan-skills-on-load CLI flag - Add /skill scan TUI command - Update --init wizard with scan_on_load step - Add --migrate-config step for scan_on_load Follow-up: #1876 (MCP tool namespace bypass in QUARANTINE_DENIED)

merge: sync with origin/main

5110281

bug-ops enabled auto-merge (squash) March 15, 2026 20:32

bug-ops added 5 commits March 15, 2026 21:52

fix(tests): update config snapshot for scan_on_load field

cc98a99

merge: sync with origin/main

4ddcb39

fix(tests): update all config snapshots for scan_on_load field

036d115

merge: sync with origin/main

2c85217

fix(tests): add guardrail config section to lsp_policy snapshot

6aa6a8b

bug-ops linked an issue Mar 15, 2026 that may be closed by this pull request

security: malicious skill trust tier enforcement (community skill security empirical study) #1853

Closed

bug-ops merged commit b47fb5e into main Mar 15, 2026
20 checks passed

bug-ops deleted the security-malicious-skill-trust branch March 15, 2026 21:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(skills): malicious skill trust tier enforcement (#1853)#1878

feat(skills): malicious skill trust tier enforcement (#1853)#1878
bug-ops merged 7 commits intomainfrom
security-malicious-skill-trust

bug-ops commented Mar 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

bug-ops commented Mar 15, 2026

Summary

Test plan

Follow-up

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant