About Me

Hello! My name is Arthur Chen (陳皓楠). I’m a master’s student at the R2L Lab of the University of Waterloo, advised by Victor Zhong. I am also a researcher at the Vector Institute. During my undergraduate studies, I had the privilege of working with Jimmy Lin and Wenhu Chen building retrievers.

Research Interests

I sit at the intersection of Machine Learning (ML) and Natural Language Processing (NLP). To that end, my primary interest is using natural language to enable ML systems (e.g., intelligent agents) to automatically adapt to new environments. I’m particularly interested in:

  • Language Grounding: using language to establish a shared context between the ML model and the environment.
  • Test-time Adaptation: adapting ML models to new environments without human supervision at deployment/test-time.
  • Automatic Evaluation: evaluating ML model performance automatically with minimal human effort (e.g., labeling, human evaluation).

News

  • [Dec 2025]: Invited talk on “Test-Time Adaptation via Data Synthesis” at Bloomberg CTO Office.
  • [May 2025]: I started my internship at Salesforce AI Research!

Selected Publications

For update-to-date publications and preprints, please refer to Google Scholar.

GTTA Thumbnail
Grounded Test-Time Adaptation for LLM Agents
Arthur Chen, Zuxin Liu, Jianguo Zhang, Akshara Prabhakar, Zhiwei Liu, Shelby Heinecke, Silvio Savarese, Victor Zhong, Caiming Xiong
Introduced two adaptation strategies for LLM agents to adapt at test time without human supervision.
Links: paper · code · project page
SynQuE Thumbnail
SynQuE: Estimating Synthetic Dataset Quality Without Annotations
Arthur Chen, Victor Zhong
SynQuE is a framework and benchmark for ranking synthetic datasets by their expected real-world performance without requiring any labeled real data.
Links: paper · code · project page