docs: add multi tool agent failure modes troubleshooting note by onestardao · Pull Request #15 · RUC-NLPIR/DeepAgent

onestardao · 2026-02-23T07:36:02Z

Hi,

This PR adds the small docs-only troubleshooting note we discussed in #14.

Summary

Adds docs/multi_tool_agent_failure_modes.md, a short troubleshooting page focused on multi-tool agent failure modes when running ToolBench, API-Bank, ToolHop, GAIA, HLE and other benchmarks.
Links the new page from the README under a short “Troubleshooting multi tool failures” section near the evaluation flags.

The note is intentionally compact and DeepAgent-specific:

Starts with a one-screen quick checklist to run when the agent looks stuck or keeps calling strange tools.
Breaks down typical failure patterns (wrong tool, argument mismatch, environment/config mismatch, tool outputs ignored, etc.) into “symptom → likely cause → minimal checks”.
Keeps all examples aligned with the existing configuration flags and scripts in this repo.

Scope

Docs only, no code or config changes.
Does not affect any training or evaluation scripts.

Testing

Rendered both the README and docs/multi_tool_agent_failure_modes.md in GitHub’s preview to check formatting and links.

Closes #14.

Thanks for reviewing!

Point users to the new multi tool failure modes checklist from the main README.

sunnynexus · 2026-03-09T01:51:33Z

Thank you for your effort! We have merged this commit.

onestardao added 2 commits February 23, 2026 15:14

docs: add multi tool agent failure modes note

6381934

docs: link multi tool troubleshooting page from README

5533a0a

Point users to the new multi tool failure modes checklist from the main README.

sunnynexus merged commit 93c44b5 into RUC-NLPIR:main Mar 9, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: add multi tool agent failure modes troubleshooting note#15

docs: add multi tool agent failure modes troubleshooting note#15
sunnynexus merged 2 commits intoRUC-NLPIR:mainfrom
onestardao:main

onestardao commented Feb 23, 2026

Uh oh!

sunnynexus commented Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

onestardao commented Feb 23, 2026

Summary

Scope

Testing

Uh oh!

sunnynexus commented Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants