research(reliability): Characterizing Faults in Agentic AI — empirical taxonomy of 385 faults across 40 OSS agent repos (arXiv:2603.06847)

## Summary

arXiv:2603.06847 — submitted 6 March 2026.

Empirical grounded-theory study of 385 faults sampled from 13,602 GitHub issues across 40 open-source agentic AI repos; derives taxonomies of fault types, observable symptoms, and root causes specific to tool-invocation and long-horizon task execution.

## Applicability to Zeph

**HIGH** — Direct complement to #2203 (12-category taxonomy, arXiv:2601.16280): that paper is dataset-derived and taxonomic; this one is empirically grounded in real OSS agent repos. The symptom/root-cause linkage is directly usable for designing Zeph's error-handler dispatch logic and retry signal classification in `zeph-tools`.

## Implementation Sketch

- Map the paper's symptom taxonomy to existing `ToolError` variants in `zeph-tools`
- Use root-cause linkage to inform retry strategy in the error taxonomy design from #2203
- Potential: structured error classification at the `ShellExecutor` and `WebScrapeExecutor` level

## References

- https://arxiv.org/abs/2603.06847
- Related: #2203 (12-category taxonomy), #2199 (AgentDebug structured feedback)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

research(reliability): Characterizing Faults in Agentic AI — empirical taxonomy of 385 faults across 40 OSS agent repos (arXiv:2603.06847) #2206

Summary

Applicability to Zeph

Implementation Sketch

References

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

research(reliability): Characterizing Faults in Agentic AI — empirical taxonomy of 385 faults across 40 OSS agent repos (arXiv:2603.06847) #2206

Description

Summary

Applicability to Zeph

Implementation Sketch

References

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions