Publications

CapRL++: Unified Reinforcement Learning with Verifiable Rewards for Dense Image and Video Captioning teaser
Submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2026

CapRL++: Unified Reinforcement Learning with Verifiable Rewards for Dense Image and Video Captioning

Penghui Yang*, Long Xing*, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Yibin Wang, Yujie Zhou, Jiazi Bu, Jianze Liang, Qidong Huang, Jiaqi Wang, Feng Wu, and Dahua Lin

A unified RLVR framework for dense image and video captioning, where caption quality is optimized through verifiable downstream question-answering rewards.