Guanzhong He, Zhen Yang, Jinxin Liu, Bin Xu, Lei Hou, Juanzi Li: WebSeer: Training Deeper Search Agents through Reinforcement Learning with Self-Reflection. CoRR abs/2510.18798 (2025)