[Google Scholar] [Github] [Twitter] [LinkedIn]
I am currently a third-year Ph.D. Candidate at Institute for AI, School of Intelligence Science and Technology, Peking University. Specifically, I am in the team of PAIR-Lab led by Prof. Yaodong Yang. The long-term goal of my research is to build a strong and human-like AI system. To this end, my research focuses on AI Alignment, Reinforcement Learning, and Multi-Agent System. In particular, I am currently quite interested in investigating the complete closed-loop process for LLM alignment, which includes exploring AI for finding human consensus, RL for improving LLM's instruction following, and test-time alignment algorithms. I welcome more friends to discuss these topics with me ☺️.