


default search action
Shunyu Yao 0006
Person information
- affiliation: OpenAI, USA
- affiliation: Princeton University, USA
Other persons with the same name
- Shunyu Yao — disambiguation page
- Shunyu Yao 0001
— China Telecom Research Institute, Institute of Big Data and Artificial Intelligence, Beijing, China - Shunyu Yao 0002
— City University of Hong Kong, Hong Kong - Shunyu Yao 0003
— China Institute of Water Resources and Hydropower Research, Beijing, China - Shunyu Yao 0004
— Dalian University of Technology, School of Software Technology, China - Shunyu Yao 0005
— University of Arizona, Department of Systems and Industrial Engineering, Tucson, AZ, USA
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[c23]Yitao Liu, Chenglei Si, Karthik R. Narasimhan, Shunyu Yao:
Contextual Experience Replay for Self-Improvement of Language Agents. ACL (1) 2025: 14179-14198
[c22]Bowen Li, Wenhan Wu, Ziwei Tang, Lin Shi, John Yang, Jinyang Li, Shunyu Yao, Chen Qian, Binyuan Hui, Qicheng Zhang, Zhiyin Yu, He Du, Ping Yang, Dahua Lin, Chao Peng, Kai Chen:
Prompting Large Language Models to Tackle the Full Software Development Lifecycle: A Case Study. COLING 2025: 7511-7531
[c21]Shunyu Yao, Noah Shinn, Pedram Razavi, Karthik R. Narasimhan:
{τ}-bench: A Benchmark for \underline{T}ool-\underline{A}gent-\underline{U}ser Interaction in Real-World Domains. ICLR 2025
[i26]Quan Shi, Carlos E. Jimenez, Shunyu Yao, Nick Haber, Diyi Yang, Karthik Narasimhan:
When Models Know More Than They Can Explain: Quantifying Knowledge Transfer in Human-AI Collaboration. CoRR abs/2506.05579 (2025)
[i25]Yitao Liu, Chenglei Si, Karthik Narasimhan, Shunyu Yao:
Contextual Experience Replay for Self-Improvement of Language Agents. CoRR abs/2506.06698 (2025)- 2024
[j1]Theodore R. Sumers, Shunyu Yao, Karthik Narasimhan, Thomas L. Griffiths:
Cognitive Architectures for Language Agents. Trans. Mach. Learn. Res. 2024 (2024)
[c20]Michael Tang, Shunyu Yao, John Yang, Karthik Narasimhan:
Referral Augmentation for Zero-Shot Information Retrieval. ACL (Findings) 2024: 13452-13461
[c19]Yu Su, Diyi Yang, Shunyu Yao, Tao Yu:
Language Agents: Foundations, Prospects, and Risks. EMNLP (Tutorial Abstracts) 2024: 17-24
[c18]Carlos E. Jimenez, John Yang, Alexander Wettig, Shunyu Yao, Kexin Pei, Ofir Press, Karthik R. Narasimhan:
SWE-bench: Can Language Models Resolve Real-world Github Issues? ICLR 2024
[c17]Shunyu Yao, Howard Chen, Austin W. Hanjie, Runzhe Yang, Karthik R. Narasimhan:
COLLIE: Systematic Construction of Constrained Text Generation Tasks. ICLR 2024
[c16]John Yang, Carlos E. Jimenez, Alexander Wettig, Kilian Lieret, Shunyu Yao, Karthik Narasimhan, Ofir Press:
SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering. NeurIPS 2024
[i24]Zhiyong Wu, Chengcheng Han, Zichen Ding
, Zhenmin Weng, Zhoumianze Liu, Shunyu Yao, Tao Yu, Lingpeng Kong:
OS-Copilot: Towards Generalist Computer Agents with Self-Improvement. CoRR abs/2402.07456 (2024)
[i23]Bowen Li, Wenhan Wu, Ziwei Tang, Lin Shi, John Yang, Jinyang Li, Shunyu Yao, Chen Qian, Binyuan Hui, Qicheng Zhang, Zhiyin Yu, He Du, Ping Yang, Dahua Lin, Chao Peng
, Kai Chen:
DevBench: A Comprehensive Benchmark for Software Development. CoRR abs/2403.08604 (2024)
[i22]Quan Shi, Michael Tang, Karthik Narasimhan, Shunyu Yao:
Can Language Models Solve Olympiad Programming? CoRR abs/2404.10952 (2024)
[i21]John Yang, Carlos E. Jimenez, Alexander Wettig, Kilian Lieret
, Shunyu Yao, Karthik Narasimhan, Ofir Press:
SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering. CoRR abs/2405.15793 (2024)
[i20]Shunyu Yao, Noah Shinn, Pedram Razavi, Karthik Narasimhan:
τ-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains. CoRR abs/2406.12045 (2024)
[i19]R. Thomas McCoy, Shunyu Yao, Dan Friedman, Mathew D. Hardy, Thomas L. Griffiths:
When a language model is optimized for reasoning, does it still show embers of autoregression? An analysis of OpenAI o1. CoRR abs/2410.01792 (2024)- 2023
[c15]Yao Mu, Shunyu Yao, Mingyu Ding, Ping Luo, Chuang Gan:
EC2: Emergent Communication for Embodied Control. CVPR 2023: 6704-6714
[c14]Shunyu Yao, Jeffrey Zhao, Dian Yu, Nan Du, Izhak Shafran, Karthik R. Narasimhan, Yuan Cao:
ReAct: Synergizing Reasoning and Acting in Language Models. ICLR 2023
[c13]Noah Shinn, Federico Cassano, Ashwin Gopinath, Karthik Narasimhan, Shunyu Yao:
Reflexion: language agents with verbal reinforcement learning. NeurIPS 2023
[c12]John Yang, Akshara Prabhakar, Karthik Narasimhan, Shunyu Yao:
InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback. NeurIPS 2023
[c11]Shunyu Yao, Dian Yu, Jeffrey Zhao, Izhak Shafran, Tom Griffiths, Yuan Cao, Karthik Narasimhan:
Tree of Thoughts: Deliberate Problem Solving with Large Language Models. NeurIPS 2023
[i18]Yao Mu, Shunyu Yao, Mingyu Ding, Ping Luo, Chuang Gan:
EC^2: Emergent Communication for Embodied Control. CoRR abs/2304.09448 (2023)
[i17]Shunyu Yao, Dian Yu, Jeffrey Zhao, Izhak Shafran, Thomas L. Griffiths, Yuan Cao, Karthik Narasimhan:
Tree of Thoughts: Deliberate Problem Solving with Large Language Models. CoRR abs/2305.10601 (2023)
[i16]Michael Tang, Shunyu Yao, John Yang, Karthik Narasimhan:
Referral Augmentation for Zero-Shot Information Retrieval. CoRR abs/2305.15098 (2023)
[i15]John Yang, Akshara Prabhakar, Karthik Narasimhan, Shunyu Yao:
InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback. CoRR abs/2306.14898 (2023)
[i14]Shunyu Yao, Howard Chen, Austin W. Hanjie, Runzhe Yang, Karthik Narasimhan:
COLLIE: Systematic Construction of Constrained Text Generation Tasks. CoRR abs/2307.08689 (2023)
[i13]Theodore R. Sumers, Shunyu Yao, Karthik Narasimhan, Thomas L. Griffiths:
Cognitive Architectures for Language Agents. CoRR abs/2309.02427 (2023)
[i12]R. Thomas McCoy, Shunyu Yao, Dan Friedman, Matthew Hardy, Thomas L. Griffiths:
Embers of Autoregression: Understanding Large Language Models Through the Problem They are Trained to Solve. CoRR abs/2309.13638 (2023)
[i11]Baian Chen, Chang Shu, Ehsan Shareghi
, Nigel Collier, Karthik Narasimhan, Shunyu Yao:
FireAct: Toward Language Agent Fine-tuning. CoRR abs/2310.05915 (2023)
[i10]Carlos E. Jimenez, John Yang, Alexander Wettig, Shunyu Yao, Kexin Pei, Ofir Press, Karthik Narasimhan:
SWE-bench: Can Language Models Resolve Real-World GitHub Issues? CoRR abs/2310.06770 (2023)- 2022
[c10]Yi Gu
, Shunyu Yao, Chuang Gan, Josh Tenenbaum, Mo Yu:
Revisiting the Roles of "Text" in Text Games. EMNLP (Findings) 2022: 6867-6876
[c9]Jens Tuyls, Shunyu Yao, Sham M. Kakade, Karthik Narasimhan:
Multi-Stage Episodic Control for Strategic Exploration in Text Games. ICLR 2022
[c8]Shunyu Yao, Mo Yu, Yang Zhang, Karthik R. Narasimhan, Joshua B. Tenenbaum, Chuang Gan:
Linking Emergent and Natural Languages via Corpus Transfer. ICLR 2022
[c7]Shunyu Yao, Howard Chen, John Yang, Karthik Narasimhan:
WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents. NeurIPS 2022
[i9]Jens Tuyls, Shunyu Yao, Sham M. Kakade, Karthik Narasimhan:
Multi-Stage Episodic Control for Strategic Exploration in Text Games. CoRR abs/2201.01251 (2022)
[i8]Shunyu Yao, Mo Yu, Yang Zhang, Karthik R. Narasimhan, Joshua B. Tenenbaum, Chuang Gan:
Linking Emergent and Natural Languages via Corpus Transfer. CoRR abs/2203.13344 (2022)
[i7]Shunyu Yao, Howard Chen, John Yang, Karthik Narasimhan:
WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents. CoRR abs/2207.01206 (2022)
[i6]Shunyu Yao, Jeffrey Zhao, Dian Yu, Nan Du, Izhak Shafran, Karthik Narasimhan, Yuan Cao:
ReAct: Synergizing Reasoning and Acting in Language Models. CoRR abs/2210.03629 (2022)
[i5]Yi Gu, Shunyu Yao, Chuang Gan, Joshua B. Tenenbaum, Mo Yu:
Revisiting the Roles of "Text" in Text Games. CoRR abs/2210.08384 (2022)- 2021
[c6]Shunyu Yao, Binghui Peng, Christos H. Papadimitriou, Karthik Narasimhan:
Self-Attention Networks Can Process Bounded Hierarchical Languages. ACL/IJCNLP (1) 2021: 3770-3785
[c5]Shunyu Yao, Karthik Narasimhan, Matthew J. Hausknecht:
Reading and Acting while Blindfolded: The Need for Semantics in Text Game Agents. NAACL-HLT 2021: 3097-3102
[i4]Shunyu Yao, Karthik Narasimhan, Matthew J. Hausknecht:
Reading and Acting while Blindfolded: The Need for Semantics in Text Game Agents. CoRR abs/2103.13552 (2021)
[i3]Shunyu Yao, Binghui Peng, Christos H. Papadimitriou, Karthik Narasimhan:
Self-Attention Networks Can Process Bounded Hierarchical Languages. CoRR abs/2105.11115 (2021)- 2020
[c4]Kevin Smith, Lingjie Mei, Shunyu Yao, Jiajun Wu, Elizabeth S. Spelke, Josh Tenenbaum, Tomer D. Ullman:
The fine structure of surprise in intuitive physics: when, why, and how much? CogSci 2020
[c3]Shunyu Yao, Rohan Rao, Matthew J. Hausknecht, Karthik Narasimhan:
Keep CALM and Explore: Language Models for Action Generation in Text-based Games. EMNLP (1) 2020: 8736-8754
[i2]Shunyu Yao, Rohan Rao, Matthew J. Hausknecht, Karthik Narasimhan:
Keep CALM and Explore: Language Models for Action Generation in Text-based Games. CoRR abs/2010.02903 (2020)
2010 – 2019
- 2019
[c2]Kevin Smith, Lingjie Mei, Shunyu Yao, Jiajun Wu, Elizabeth S. Spelke, Josh Tenenbaum, Tomer D. Ullman:
Modeling Expectation Violation in Intuitive Physics with Coarse Probabilistic Object Representations. NeurIPS 2019: 8983-8993- 2018
[c1]Shunyu Yao, Tzu-Ming Harry Hsu, Jun-Yan Zhu, Jiajun Wu, Antonio Torralba, Bill Freeman, Josh Tenenbaum:
3D-Aware Scene Manipulation via Inverse Graphics. NeurIPS 2018: 1891-1902
[i1]Shunyu Yao, Tzu-Ming Harry Hsu, Jun-Yan Zhu, Jiajun Wu, Antonio Torralba, William T. Freeman, Joshua B. Tenenbaum:
3D-Aware Scene Manipulation via Inverse Graphics. CoRR abs/1808.09351 (2018)
Coauthor Index
aka: Karthik R. Narasimhan

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-03-21 23:41 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







