fix(security): sanitizer classifier 401 and regex false positives by bug-ops · Pull Request #2314 · bug-ops/zeph

bug-ops · 2026-03-28T00:02:42Z

Summary

Wire ZEPH_HF_TOKEN from vault into all five hf_hub::api::sync::Api::new() call sites via ApiBuilder::with_token(); add hf_token: Option<String> to ClassifiersConfig and CandleConfig, resolved in resolve_secrets()
Add scan_user_input: bool (default false) to ClassifiersConfig; gate DeBERTa classifier in agent/mod.rs behind this flag — prevents false positives on direct user chat messages ("hello, who are you?", "what is 2+2?")
Upgrade silent warn! fallback in classify_injection to error!; add tracing::error! at cached load-failure path in CandleClassifier to surface permanent classifier degradation visibly
Add 9 regression tests: regex false-positive coverage, injection detection, scan_user_input flag behavior, hf_token propagation

Test plan

cargo +nightly fmt --check — clean
cargo clippy --workspace --features full -- -D warnings — zero warnings
cargo nextest run --workspace --features full --lib --bins — 6847 passed, 22 skipped

) - Wire ZEPH_HF_TOKEN from vault into all five hf_hub Api call sites via ApiBuilder::with_token(); add hf_token field to ClassifiersConfig and CandleConfig, resolved in resolve_secrets() - Add scan_user_input flag (default false) to ClassifiersConfig; gate DeBERTa classifier in agent/mod.rs behind this flag to prevent false positives on direct user chat messages - Upgrade silent warn! fallback in classify_injection to error! and add tracing::error! at cached load-failure path in CandleClassifier - Add 9 regression tests: regex false-positive coverage for greetings and arithmetic, injection detection, scan_user_input flag, hf_token propagation

bug-ops mentioned this pull request Mar 28, 2026

bug(security): sanitizer classifier 401 on HuggingFace download — regex fallback blocks benign queries #2292

Closed

github-actions bot added bug Something isn't working size/L Large PR (201-500 lines) documentation Improvements or additions to documentation llm zeph-llm crate (Ollama, Claude) rust Rust code changes core zeph-core crate and removed size/L Large PR (201-500 lines) labels Mar 28, 2026

bug-ops enabled auto-merge (squash) March 28, 2026 00:02

bug-ops force-pushed the 2292-sanitizer-classifier-401 branch from 87161db to e5b3a12 Compare March 28, 2026 00:23

github-actions bot added the size/L Large PR (201-500 lines) label Mar 28, 2026

bug-ops merged commit 6d0dd57 into main Mar 28, 2026
25 checks passed

bug-ops deleted the 2292-sanitizer-classifier-401 branch March 28, 2026 00:31

bug-ops mentioned this pull request Mar 28, 2026

bug(a2a): message/send responses shifted by one — request N receives response to request N-1 #2326

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(security): sanitizer classifier 401 and regex false positives#2314

fix(security): sanitizer classifier 401 and regex false positives#2314
bug-ops merged 1 commit intomainfrom
2292-sanitizer-classifier-401

bug-ops commented Mar 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

bug-ops commented Mar 28, 2026

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant