test(classifiers): add unit tests and benchmarks for CandleClassifier (#2190) by bug-ops · Pull Request #2212 · bug-ops/zeph

bug-ops · 2026-03-27T07:52:38Z

Summary

Add 9 fast unit tests for CandleClassifier: constructor, backend_name, Debug format, Arc clone sharing, and 5 validate_safetensors edge cases (valid header, truncated buffer, wrong magic, zero-length header, oversized claim)
Add 6 #[ignore] integration tests requiring HF Hub model download: real injection/safe classification, input chunking, empty input, error caching, download timeout
Add 3 Criterion benchmarks in crates/zeph-llm/benches/classifier.rs measuring warm inference latency for injection, safe, and long-input chunking scenarios (public ClassifierBackend::classify API, tokio runtime, black_box on results)

Deferred

feat(classifiers): add NER/PII token-level classifier backend (piiranha) #2211 — PII/NER token-level classifier backend (piiranha): requires DebertaV2NERModel, new result type with token-level spans, and BIO span decoding — not implementable with the current sequence classification backend
feat(core): add DetectorMode::Model for classifier-backed feedback detection #2210 — DetectorMode::Model variant for FeedbackDetector

Test plan

cargo +nightly fmt --check — passes
cargo clippy --workspace --features full -- -D warnings — passes (0 warnings)
cargo nextest run --config-file .github/nextest.toml --workspace --features full --lib --bins — 6605 passed (+9 vs baseline 6596)
#[ignore] tests: run manually with cargo nextest run -p zeph-llm --features classifiers -- --ignored after pre-downloading protectai/deberta-v3-small-prompt-injection-v2

…#2190) Add 9 fast unit tests for CandleClassifier covering constructor, backend_name, Debug format, Arc clone sharing, and 5 validate_safetensors edge cases. Add 6 #[ignore] integration tests for real-model inference, chunking, empty input, and error handling (require HF Hub download). Add 3 Criterion benchmarks measuring warm inference latency for injection, safe, and long-input scenarios. PII/NER token-level tests deferred to #2211 (piiranha requires NER backend). FeedbackDetector Model variant deferred to #2210.

github-actions bot added tests Test-related changes size/L Large PR (201-500 lines) llm zeph-llm crate (Ollama, Claude) rust Rust code changes dependencies Dependency updates and removed tests Test-related changes labels Mar 27, 2026

bug-ops enabled auto-merge (squash) March 27, 2026 07:52

bug-ops merged commit 7014895 into main Mar 27, 2026
25 checks passed

bug-ops deleted the candle-classifier-tests branch March 27, 2026 08:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test(classifiers): add unit tests and benchmarks for CandleClassifier (#2190)#2212

test(classifiers): add unit tests and benchmarks for CandleClassifier (#2190)#2212
bug-ops merged 1 commit intomainfrom
candle-classifier-tests

bug-ops commented Mar 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

bug-ops commented Mar 27, 2026

Summary

Deferred

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant