Problem
The chaos module has strong adversarial testing (8 attack types: prompt injection, policy bypass, privilege escalation, data exfiltration, tool abuse, identity spoofing) and infrastructure faults (latency, errors, timeouts). However, it lacks behavioral conflict scenarios:
- No deadlock detection (two agents waiting on each other's output)
- No contradictory instruction injection (agent receives conflicting directives)
- No dynamic trust perturbation mid-task (trust score changes during execution)
- Enterprise fault types (
delegation_reject, llm_degraded) raise NotImplementedError
What Exists
adversarial.py — 8 attack techniques + OWASP playbooks ✅
engine.py — FaultType enum with infrastructure faults ✅
test_scenarios.py — Trust degradation detection (event-based) ✅
Proposed Solution
- Add
DEADLOCK_INJECTION fault type — simulate circular dependency between agents
- Add
CONTRADICTORY_INSTRUCTION fault — inject conflicting directives mid-task
- Add
TRUST_PERTURBATION fault — dynamically change trust scores during execution
- Implement the stubbed
delegation_reject and llm_degraded enterprise faults
- Add corresponding experiment templates to
ChaosLibrary
Acceptance Criteria
/cc @imran-siddique
Problem
The chaos module has strong adversarial testing (8 attack types: prompt injection, policy bypass, privilege escalation, data exfiltration, tool abuse, identity spoofing) and infrastructure faults (latency, errors, timeouts). However, it lacks behavioral conflict scenarios:
delegation_reject,llm_degraded) raiseNotImplementedErrorWhat Exists
adversarial.py— 8 attack techniques + OWASP playbooks ✅engine.py—FaultTypeenum with infrastructure faults ✅test_scenarios.py— Trust degradation detection (event-based) ✅Proposed Solution
DEADLOCK_INJECTIONfault type — simulate circular dependency between agentsCONTRADICTORY_INSTRUCTIONfault — inject conflicting directives mid-taskTRUST_PERTURBATIONfault — dynamically change trust scores during executiondelegation_rejectandllm_degradedenterprise faultsChaosLibraryAcceptance Criteria
FaultTypeenum/cc @imran-siddique