


default search action
Runze Liu 0002
Person information
- affiliation: Tsinghua University, Shenzhen, China
Other persons with the same name
- Runze Liu — disambiguation page
- Runze Liu 0001 — Beijing Real Imaging Medical Technology Co., Ltd. (and 1 more)
- Runze Liu 0003
— Pennsylvania State University, PA, USA - Runze Liu 0004 — North Carolina State University, Raleigh, NC, USA
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
[c9]Jian Zhao, Runze Liu, Kaiyan Zhang, Zhimu Zhou, Junqi Gao, Dong Li, Jiafei Lyu, Zhouyi Qian, Biqing Qi, Xiu Li, Bowen Zhou:
GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning. AAAI 2026: 34932-34940
[i14]Jiafei Lyu, Jingwen Yang, Zhongjian Qiao, Runze Liu, Zeyuan Liu, Deheng Ye, Zongqing Lu, Xiu Li:
Temporal Difference Learning with Constrained Initial Representations. CoRR abs/2602.11800 (2026)- 2025
[j1]Shengjie Sun
, Runze Liu, Jiafei Lyu
, Jingwen Yang, Liangpeng Zhang, Xiu Li
:
A large language model-driven reward design framework via dynamic feedback for reinforcement learning. Knowl. Based Syst. 326: 114065 (2025)
[c8]Fengshuo Bai, Runze Liu, Yali Du, Ying Wen, Yaodong Yang:
RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors. AAAI 2025: 15453-15461
[c7]Sihang Zeng, Kai Tian, Kaiyan Zhang, Yuru Wang, Junqi Gao, Runze Liu, Sa Yang, Jingxuan Li, Xinwei Long, Jiaheng Ma, Biqing Qi, Bowen Zhou:
ReviewRL: Towards Automated Scientific Review with RL. EMNLP 2025: 16931-16943
[c6]Runze Liu, Chenjia Bai, Jiafei Lyu, Shengjie Sun, Yali Du, Xiu Li:
VLP: Vision-Language Preference Learning for Embodied Manipulation. EMNLP 2025: 28428-28444
[c5]Jiafei Lyu, Mengbei Yan, Zhongjian Qiao, Runze Liu, Xiaoteng Ma, Deheng Ye, Jingwen Yang, Zongqing Lu, Xiu Li:
Cross-Domain Offline Policy Adaptation with Optimal Transport and Dataset Constraint. ICLR 2025
[i13]Runze Liu, Junqi Gao, Jian Zhao, Kaiyan Zhang, Xiu Li, Biqing Qi, Wanli Ouyang, Bowen Zhou:
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling. CoRR abs/2502.06703 (2025)
[i12]Runze Liu, Chenjia Bai, Jiafei Lyu, Shengjie Sun, Yali Du, Xiu Li:
VLP: Vision-Language Preference Learning for Embodied Manipulation. CoRR abs/2502.11918 (2025)
[i11]Jian Zhao, Runze Liu, Kaiyan Zhang, Zhimu Zhou, Junqi Gao, Dong Li, Jiafei Lyu, Zhouyi Qian, Biqing Qi, Xiu Li, Bowen Zhou:
GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning. CoRR abs/2504.00891 (2025)
[i10]Jiakang Wang, Runze Liu, Fuzheng Zhang, Xiu Li, Guorui Zhou:
Stabilizing Knowledge, Promoting Reasoning: Dual-Token Constraints for RLVR. CoRR abs/2507.15778 (2025)
[i9]Sihang Zeng, Kai Tian, Kaiyan Zhang, Yuru wang, Junqi Gao, Runze Liu, Sa Yang, Jingxuan Li, Xinwei Long, Jiaheng Ma, Biqing Qi, Bowen Zhou:
ReviewRL: Towards Automated Scientific Review with RL. CoRR abs/2508.10308 (2025)
[i8]Kaiyan Zhang, Yuxin Zuo, Bingxiang He, Youbang Sun, Runze Liu, Che Jiang, Yuchen Fan, Kai Tian, Guoli Jia, Pengfei Li, Yu Fu, Xingtai Lv, Yuchen Zhang, Sihang Zeng, Shang Qu, Haozhan Li, Shijie Wang, Yuru Wang, Xinwei Long, Fangfu Liu, Xiang Xu, Jiaze Ma, Xuekai Zhu, Ermo Hua, Yihao Liu, Zonglin Li, Huayu Chen, Xiaoye Qu, Yafu Li, Weize Chen, Zhenzhao Yuan, Junqi Gao, Dong Li, Zhiyuan Ma, Ganqu Cui, Zhiyuan Liu, Biqing Qi, Ning Ding, Bowen Zhou:
A Survey of Reinforcement Learning for Large Reasoning Models. CoRR abs/2509.08827 (2025)
[i7]Runze Liu, Jiakang Wang, Yuling Shi, Zhihui Xie, Chenxin An, Kaiyan Zhang, Jian Zhao, Xiaodong Gu, Lei Lin, Wenping Hu, Xiu Li, Fuzheng Zhang, Guorui Zhou, Kun Gai:
Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models. CoRR abs/2509.26628 (2025)
[i6]Jiakang Wang, Runze Liu, Lei Lin, Wenping Hu, Xiu Li, Fuzheng Zhang, Guorui Zhou, Kun Gai:
ASPO: Asymmetric Importance Sampling Policy Optimization. CoRR abs/2510.06062 (2025)
[i5]Shengjie Sun, Jiafei Lyu, Runze Liu, Mengbei Yan, Bo Liu, Deheng Ye, Xiu Li:
PROF: An LLM-based Reward Code Preference Optimization Framework for Offline Imitation Learning. CoRR abs/2511.13765 (2025)- 2024
[c4]Shengjie Sun, Jiafei Lyu, Lu Li, Jiazhe Guo, Mengbei Yan, Runze Liu, Xiu Li:
Enhancing Visual Generalization in Reinforcement Learning with Cycling Augmentation. ICANN (4) 2024: 397-411
[c3]Jiafei Lyu, Xiaoteng Ma, Le Wan, Runze Liu, Xiu Li, Zongqing Lu:
SEABO: A Simple Search-Based Method for Offline Imitation Learning. ICLR 2024
[c2]Runze Liu, Yali Du, Fengshuo Bai, Jiafei Lyu, Xiu Li:
PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation. ICML 2024: 30946-30964
[i4]Jiafei Lyu, Xiaoteng Ma, Le Wan, Runze Liu, Xiu Li, Zongqing Lu:
SEABO: A Simple Search-Based Method for Offline Imitation Learning. CoRR abs/2402.03807 (2024)
[i3]Shengjie Sun, Runze Liu, Jiafei Lyu, Jingwen Yang, Liangpeng Zhang, Xiu Li:
A Large Language Model-Driven Reward Design Framework via Dynamic Feedback for Reinforcement Learning. CoRR abs/2410.14660 (2024)
[i2]Fengshuo Bai, Runze Liu, Yali Du, Ying Wen, Yaodong Yang:
RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors. CoRR abs/2412.10713 (2024)- 2023
[i1]Runze Liu, Yali Du, Fengshuo Bai, Jiafei Lyu, Xiu Li:
Zero-shot Preference Learning for Offline RL via Optimal Transport. CoRR abs/2306.03615 (2023)- 2022
[c1]Runze Liu, Fengshuo Bai, Yali Du, Yaodong Yang:
Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning. NeurIPS 2022
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-03-27 21:06 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







