Published May 6, 2026
| Version v1
Preprint
Open
Agent Trajectory Replay for Debugging Tool-Using AI Workflow Regressions
Authors/Creators
Description
Tool-using AI agents are difficult to debug because a failure may emerge from a sequence of planning steps, tool calls, intermediate errors, and final-output decisions rather than from a single response. This paper presents Agent Trajectory Replay, a small zero-dependency JavaScript package for summarizing, replaying, and diffing agent event traces. The package removes unstable timing fields before comparison, counts tool calls and errors, exposes final-output changes, and allows event handlers to rebuild state from a recorded trajectory. The contribution is a lightweight regression-debugging pattern for teams that need a simple way to compare agent behavior across model, prompt, or tool changes.
This artifact bundle includes the manuscript, PDF, workflow figure, bibliography, metadata, and source notes grounded in the agent-trajectory-replay package.
Files
agent-trajectory-replay-zenodo-package.zip
Files
(9.6 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:b7959c4812095e0653d879dac168c7bd
|
9.6 kB | Preview Download |