Research Interests
- machine learning
- computer vision
- foundation models
- multimodal agents
- embodied AI
Prof. Kaiyang Zhou is an Assistant Professor in the Department of Computer Science at Hong Kong Baptist University. His research interests include machine learning, computer vision, and multimodality. He has published an edited book on Large Vision-Language Models and more than 50 journal and conference papers in top-tier venues, such as TPAMI, TIP, IJCV, CVPR, ICCV, ECCV, NeurIPS, ICLR, ICML, and AAAI. His work has been cited over 18,000 times. He is an associate editor for the International Journal of Computer Vision and regularly serves as an area chair for prestigious conferences such as CVPR, ECCV, NeurIPS, ICML, and ICLR. Before joining HKBU, he was a postdoc at Nanyang Technological University, working with Prof. Ziwei Liu and Prof. Chen Change Loy. He received his PhD in Computer Science from the University of Surrey, under the supervision of Prof. Tao Xiang.
News
- Dec 2025 Invited to serve as area chair of ECCV 2026.
- Nov 2025 Invited to serve as area chair of ICML 2026.
- Sep 2025 Our edited book Large Vision-Language Models is online.
- Aug 2025 Invited to serve as area chair of ICLR 2026.
- Aug 2025 Invited to serve as area chair of CVPR 2026.
- Jul 2025 Invited to serve as area chair of AAAI 2026.
Research
I am interested in developing ML models that can learn effectively from limited supervision, generalize across domains, and understand and interact with the world through multiple modalities. My recent work focuses on multimodal foundation models, particularly in the areas of reasoning, grounding, safety, efficiency, and video understanding.
Team
I am recruiting motivated PhD students/research assistants interested in LLM/VLM/Agents/Robotics. Ideal candidates should have a strong background in ML/CV/NLP, solid coding skills, and prior research experience. If you are passionate about doing cutting-edge AI research with us, please send me an email with your CV, transcripts, relevant publications or projects, and research statement (if any).
PhD Students
- Jiaer Xia (2024 - Present)
- Sifeng Shang (2024 - Present)
- Jiayi Zhou (2025 - Present)
- Chenyu Lin (2025 - Present)
Research Assistants
- Linchao Pan (2025 - Present)
- Haichen He (2025 - Present)
Alumni
- Yu Tong (RA 2025)
- Bingkui Tong (RA 2024-25, now PhD at MBZUAI)
Teaching
Services
- Associate Editor, International Journal of Computer Vision (IJCV) (2023 - Present)
- Guest Editor, IJCV Special Issue on Visual Domain Generalization in Real-World Applications (2024)
- Guest Editor, IJCV Special Issue on The Promises and Dangers of Large Vision Models (2023)
- Area Chair, International Conference on Machine Learning (ICML) (2025, 2026)
- Area Chair, International Conference on Learning Representations (ICLR) (2025, 2026)
- Area Chair, Neural Information Processing Systems (NeurIPS) (2024, 2025)
- Area Chair, Computer Vision and Pattern Recognition (CVPR) (2024, 2026)
- Area Chair, European Conference on Computer Vision (ECCV) (2024, 2026)
- Area Chair, AAAI Conference on Artificial Intelligence (AAAI) (2023 - 2026)
- Area Chair, British Machine Vision Conference (BMVC) (2022, 2024)
- Organizer, CVPR 2025 Workshop on Domain Generalization
- Organizer, ECCV 2024 Workshop on Green Foundation Models
- Organizer, CVPR 2024 Workshop on Prompting in Vision
- Organizer, CVPR 2023 Tutorial on Prompting in Vision
- Organizer, ICLR 2023 Workshop on What Do We Need for Successful Domain Generalization
- Organizer, The AI Talks