default search action

combined dblp search
author search
venue search
publication search

ask others

Runze Liu 0002

> Home > Persons

Person information

affiliation: Tsinghua University, Shenzhen, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2026
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhaoLZZGLLQQLZ26
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhaoLZZGLLQQLZ26
Jian Zhao, Runze Liu, Kaiyan Zhang, Zhimu Zhou, Junqi Gao, Dong Li, Jiafei Lyu, Zhouyi Qian, Biqing Qi, Xiu Li, Bowen Zhou:
GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning. AAAI 2026: 34932-34940
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2602-11800
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2602-11800
Jiafei Lyu, Jingwen Yang, Zhongjian Qiao, Runze Liu, Zeyuan Liu, Deheng Ye, Zongqing Lu, Xiu Li:
Temporal Difference Learning with Constrained Initial Representations. CoRR abs/2602.11800 (2026)
2025
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/kbs/SunLLYZL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/kbs/SunLLYZL25
Shengjie Sun, Runze Liu, Jiafei Lyu, Jingwen Yang, Liangpeng Zhang, Xiu Li:
A large language model-driven reward design framework via dynamic feedback for reinforcement learning. Knowl. Based Syst. 326: 114065 (2025)
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/Bai000025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/Bai000025
Fengshuo Bai, Runze Liu, Yali Du, Ying Wen, Yaodong Yang:
RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors. AAAI 2025: 15453-15461
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/emnlp/ZengTZWGLYLLMQZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ZengTZWGLYLLMQZ25
Sihang Zeng, Kai Tian, Kaiyan Zhang, Yuru Wang, Junqi Gao, Runze Liu, Sa Yang, Jingxuan Li, Xinwei Long, Jiaheng Ma, Biqing Qi, Bowen Zhou:
ReviewRL: Towards Automated Scientific Review with RL. EMNLP 2025: 16931-16943
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/emnlp/LiuBLSDL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/LiuBLSDL25
Runze Liu, Chenjia Bai, Jiafei Lyu, Shengjie Sun, Yali Du, Xiu Li:
VLP: Vision-Language Preference Learning for Embodied Manipulation. EMNLP 2025: 28428-28444
[c5]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LyuYQ0MYY0025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LyuYQ0MYY0025
Jiafei Lyu, Mengbei Yan, Zhongjian Qiao, Runze Liu, Xiaoteng Ma, Deheng Ye, Jingwen Yang, Zongqing Lu, Xiu Li:
Cross-Domain Offline Policy Adaptation with Optimal Transport and Dataset Constraint. ICLR 2025
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-06703
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-06703
Runze Liu, Junqi Gao, Jian Zhao, Kaiyan Zhang, Xiu Li, Biqing Qi, Wanli Ouyang, Bowen Zhou:
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling. CoRR abs/2502.06703 (2025)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-11918
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-11918
Runze Liu, Chenjia Bai, Jiafei Lyu, Shengjie Sun, Yali Du, Xiu Li:
VLP: Vision-Language Preference Learning for Embodied Manipulation. CoRR abs/2502.11918 (2025)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-00891
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-00891
Jian Zhao, Runze Liu, Kaiyan Zhang, Zhimu Zhou, Junqi Gao, Dong Li, Jiafei Lyu, Zhouyi Qian, Biqing Qi, Xiu Li, Bowen Zhou:
GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning. CoRR abs/2504.00891 (2025)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-15778
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-15778
Jiakang Wang, Runze Liu, Fuzheng Zhang, Xiu Li, Guorui Zhou:
Stabilizing Knowledge, Promoting Reasoning: Dual-Token Constraints for RLVR. CoRR abs/2507.15778 (2025)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-10308
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-10308
Sihang Zeng, Kai Tian, Kaiyan Zhang, Yuru wang, Junqi Gao, Runze Liu, Sa Yang, Jingxuan Li, Xinwei Long, Jiaheng Ma, Biqing Qi, Bowen Zhou:
ReviewRL: Towards Automated Scientific Review with RL. CoRR abs/2508.10308 (2025)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-08827
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-08827
Kaiyan Zhang, Yuxin Zuo, Bingxiang He, Youbang Sun, Runze Liu, Che Jiang, Yuchen Fan, Kai Tian, Guoli Jia, Pengfei Li, Yu Fu, Xingtai Lv, Yuchen Zhang, Sihang Zeng, Shang Qu, Haozhan Li, Shijie Wang, Yuru Wang, Xinwei Long, Fangfu Liu, Xiang Xu, Jiaze Ma, Xuekai Zhu, Ermo Hua, Yihao Liu, Zonglin Li, Huayu Chen, Xiaoye Qu, Yafu Li, Weize Chen, Zhenzhao Yuan, Junqi Gao, Dong Li, Zhiyuan Ma, Ganqu Cui, Zhiyuan Liu, Biqing Qi, Ning Ding, Bowen Zhou:
A Survey of Reinforcement Learning for Large Reasoning Models. CoRR abs/2509.08827 (2025)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-26628
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-26628
Runze Liu, Jiakang Wang, Yuling Shi, Zhihui Xie, Chenxin An, Kaiyan Zhang, Jian Zhao, Xiaodong Gu, Lei Lin, Wenping Hu, Xiu Li, Fuzheng Zhang, Guorui Zhou, Kun Gai:
Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models. CoRR abs/2509.26628 (2025)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-06062
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-06062
Jiakang Wang, Runze Liu, Lei Lin, Wenping Hu, Xiu Li, Fuzheng Zhang, Guorui Zhou, Kun Gai:
ASPO: Asymmetric Importance Sampling Policy Optimization. CoRR abs/2510.06062 (2025)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2511-13765
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2511-13765
Shengjie Sun, Jiafei Lyu, Runze Liu, Mengbei Yan, Bo Liu, Deheng Ye, Xiu Li:
PROF: An LLM-based Reward Code Preference Optimization Framework for Offline Imitation Learning. CoRR abs/2511.13765 (2025)
2024
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icann/SunLLGYLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icann/SunLLGYLL24
Shengjie Sun, Jiafei Lyu, Lu Li, Jiazhe Guo, Mengbei Yan, Runze Liu, Xiu Li:
Enhancing Visual Generalization in Reinforcement Learning with Cycling Augmentation. ICANN (4) 2024: 397-411
[c3]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LyuMWL0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LyuMWL0L24
Jiafei Lyu, Xiaoteng Ma, Le Wan, Runze Liu, Xiu Li, Zongqing Lu:
SEABO: A Simple Search-Based Method for Offline Imitation Learning. ICLR 2024
[c2]
- view
- export record
  dblp key:
  - conf/icml/Liu0BL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Liu0BL024
Runze Liu, Yali Du, Fengshuo Bai, Jiafei Lyu, Xiu Li:
PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation. ICML 2024: 30946-30964
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-03807
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-03807
Jiafei Lyu, Xiaoteng Ma, Le Wan, Runze Liu, Xiu Li, Zongqing Lu:
SEABO: A Simple Search-Based Method for Offline Imitation Learning. CoRR abs/2402.03807 (2024)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-14660
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-14660
Shengjie Sun, Runze Liu, Jiafei Lyu, Jingwen Yang, Liangpeng Zhang, Xiu Li:
A Large Language Model-Driven Reward Design Framework via Dynamic Feedback for Reinforcement Learning. CoRR abs/2410.14660 (2024)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-10713
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-10713
Fengshuo Bai, Runze Liu, Yali Du, Ying Wen, Yaodong Yang:
RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors. CoRR abs/2412.10713 (2024)
2023
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-03615
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-03615
Runze Liu, Yali Du, Fengshuo Bai, Jiafei Lyu, Xiu Li:
Zero-shot Preference Learning for Offline RL via Optimal Transport. CoRR abs/2306.03615 (2023)
2022
[c1]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiuBD022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiuBD022
Runze Liu, Fengshuo Bai, Yali Du, Yaodong Yang:
Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning. NeurIPS 2022

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.