Zizhuo Zhang 

Ph.D. @ TMLR Group, HKBU

Trustworthy Machine Learning and Reasoning (TMLR) Group,
Department of Computer Science, Hong Kong Baptist University (HKBU)

[Google Scholar] [Github] [Xiaohongshu] [LinkedIn]
E-mail: [email protected] or [email protected]

About Me

I am a Ph.D. student at Trustworthy Machine Learning and Reasoning (TMLR) group in the Department of Computer Science, Hong Kong Baptist University (HKBU), advised by Prof. Bo Han and Prof. Jiangchao Yao. Recently, my research focuses on Large Language Model (LLM) Post-training, including reinforcement learning (RL), LLM preference alignment, and LLM unlearning. Previously, I also gained research experience in Recommender System and AI4Science, including news recommendation and molecular docking. I received my MPhil Eng. degree and Bachelor Eng. degree from Huazhong University of Science and Technology (HUST), advised by Prof. Bang Wang. I am always open to discussions and possible collaborations. Please feel free to email me if you would like to chat.

Selected Research Work

(* indicates equal contribution. Full list can be found on my Google Scholar .)

Prompt4NR
Prompt Learning for News Recommendation
Zizhuo Zhang, Bang Wang
SIGIR 2023 (Full paper, oral). [paper]. [code]
FABFlex
Fast and Accurate Blind Flexible Docking
Zizhuo Zhang, Lijun Wu, Kaiyuan Gao, Jiangchao Yao, Tao Qin, Bo Han
ICLR 2025. [paper]. [code]. [slides]
Co-rewarding
Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models
Zizhuo Zhang*, Jianing Zhu*, Xinmu Ge*, Zihua Zhao*, Zhanke Zhou, Xuan Li, Xiao Feng, Jiangchao Yao, Bo Han
Arxiv:2508.00410. NeurIPS 2025 Workshop MATH-AI. [paper]. [code]
LossDiff-IRM
Towards Understanding Valuable Preference Data for Large Language Model Alignment
Zizhuo Zhang, Qizhou Wang, Shanshan Ye, Jianing Zhu, Jiangchao Yao, Bo Han, Masashi Sugiyama
Arxiv:2510.13212. [paper].

Education

Experience

Honors & Awards

Competitions

Academic Services

Teaching Assistant

(Assistance in following courses)