feat: add async evaluation and run artifacts by MukundaKatta · Pull Request #4 · MukundaKatta/AgentBench

MukundaKatta · 2026-04-20T14:44:20Z

Summary:

add async evaluation support for agent callables without breaking sync usage
add async agent comparison support
define a structured run artifact bundle with a stable run id and leaderboard entry
export benchmark bundles and leaderboard summaries to disk
document async usage and artifact export in the README

Closes #2
Closes #3

Testing:

python3 -m py_compile src/agentbench/core.py src/agentbench/init.py tests/test_core.py
PYTHONPATH=/tmp/AgentBench-fix/src python3 -m pytest /tmp/AgentBench-fix/tests

feat: add async evaluation and run artifacts

ae4b7a8

MukundaKatta merged commit 37fde6c into main Apr 20, 2026
0 of 3 checks passed

Provide feedback