Skip to content
/ GTTA Public

This codebase is to reproduce the results of the paper "Grounded Test-Time Adaptation for LLM Agents".

License

Notifications You must be signed in to change notification settings

r2llab/GTTA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Grounded Test-Time Adaptation

This is the codebase to reproduce the results of the paper Grounded Test-Time Adaptation for LLM Agents.

Parametric Adaptation Framework Non-Parametric Adaptation Framework

WebArena

We adopt NNetnav's codebase for web navigation exploration and task evaluation. To reproduce our results on WebArena, please refer to this.

BFCLv3

For BFCLv3 experiment, we modify our method based on the official gorilla codebase. To reproduce our results on BFCLv3, please refer to this.

Tau-Bench

For Tau-Bench experiment, please refer to official codebase with parametric adaptation enabled.

Citation

If you find this work useful, please cite:

@article{chen2025grounded,
  title={Grounded Test-Time Adaptation for LLM Agents},
  author={Chen, Arthur and Liu, Zuxin and Zhang, Jianguo and Prabhakar, Akshara and Liu, Zhiwei and Heinecke, Shelby and Savarese, Silvio and Zhong, Victor and Xiong, Caiming},
  journal={arXiv preprint arXiv:2511.04847},
  year={2025}
}

License

This work is licensed under the MIT License.

About

This codebase is to reproduce the results of the paper "Grounded Test-Time Adaptation for LLM Agents".

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published