


default search action
Xuanjing Huang 0001
Person information
- affiliation: Fudan University, School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Shanghai, China
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
[j37]Zhiheng Xi, Wenxiang Chen, Xin Guo, Wei He, Yiwen Ding, Boyang Hong, Ming Zhang
, Junzhe Wang
, Senjie Jin, Enyu Zhou, Rui Zheng, Xiaoran Fan, Xiao Wang, Limao Xiong, Yuhao Zhou, Weiran Wang
, Changhao Jiang, Yicheng Zou, Xiangyang Liu, Zhangyue Yin, Shihan Dou, Rongxiang Weng, Wenjuan Qin, Yongyan Zheng, Xipeng Qiu, Xuanjing Huang
, Qi Zhang, Tao Gui:
The rise and potential of large language model based agents: a survey. Sci. China Inf. Sci. 68(2) (2025)
[j36]Xuanjing Huang, Shihan Dou, Zhangyue Yin:
The dual-edged sword: artificial intelligence's evolving role in academic peer review. Sci. China Inf. Sci. 68(11) (2025)
[j35]Changze Lv, Tianlong Li
, Wenhao Liu, Yufei Gu, Jianhan Xu, Cenyuan Zhang, Muling Wu, Xiaoqing Zheng, Xuanjing Huang
:
SpikeCLIP: A contrastive language-image pretrained spiking neural network. Neural Networks 188: 107475 (2025)
[j34]Yuxin Wang
, Xiannian Hu
, Quan Gan
, Xuanjing Huang
, Xipeng Qiu
, David Wipf
:
Efficient Link Prediction via GNN Layers Induced by Negative Sampling. IEEE Trans. Knowl. Data Eng. 37(1): 253-264 (2025)
[c395]Shihan Dou, Yan Liu, Enyu Zhou, Songyang Gao, Tianlong Li, Limao Xiong, Xin Zhao, Haoxiang Jia, Junjie Ye
, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang
:
Alleviating Shifted Distribution in Human Preference Alignment through Meta-Learning. AAAI 2025: 23805-23813
[c394]Jianing He
, Qi Zhang, Hongyun Zhang, Xuanjing Huang
, Usman Naseem, Duoqian Miao:
COSEE: Consistency-Oriented Signal-Based Early Exiting via Calibrated Sample Weighting Mechanism. AAAI 2025: 24023-24031
[c393]Shengbin Yue
, Siyuan Wang, Wei Chen, Xuanjing Huang
, Zhongyu Wei:
Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks. AAAI 2025: 25796-25804
[c392]Wei Tang, Yixin Cao, Yang Deng, Jiahao Ying, Bo Wang, Yizhe Yang, Yuyue Zhao, Qi Zhang, Xuanjing Huang, Yu-Gang Jiang, Yong Liao:
EvoWiki: Evaluating LLMs on Evolving Knowledge. ACL (1) 2025: 948-964
[c391]Ming Zhang, Yuhui Wang, Yujiong Shen, Tingyi Yang, Changhao Jiang, Yilong Wu, Shihan Dou, Qinhao Chen, Zhiheng Xi, Zhihao Zhang, Yi Dong, Zhen Wang, Zhihui Fei, Mingyang Wan, Tao Liang, Guojun Ma, Qi Zhang, Tao Gui, Xuanjing Huang:
PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts. ACL (Findings) 2025: 2626-2649
[c390]Junjie Ye, Zhengyin Du, Xuesong Yao, Weijian Lin, Yufei Xu, Zehui Chen, Zaiyuan Wang, Sining Zhu, Zhiheng Xi, Siyu Yuan, Tao Gui, Qi Zhang, Xuanjing Huang, Jiecao Chen:
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use. ACL (1) 2025: 2995-3021
[c389]Zhangyue Yin, Qiushi Sun, Zhiyuan Zeng, Qinyuan Cheng, Xipeng Qiu, Xuanjing Huang:
Dynamic and Generalizable Process Reward Modeling. ACL (1) 2025: 4203-4233
[c388]Shihan Dou, Jiayi Chen, Chenhao Huang, Feng Chen, Wei Chengzhi, Huiyuan Zheng, Shichun Liu, Yan Liu, Chenxiao Liu, Chao Xin, Lin Yan, Zongzhang Zhang, Tao Gui, Qi Zhang, Xuanjing Huang:
Lost in the Context: Insufficient and Distracted Attention to Contexts in Preference Modeling. ACL (1) 2025: 5710-5728
[c387]Wenhao Liu, Siyu An, Junru Lu, Muling Wu, Tianlong Li, Xiaohua Wang, Changze Lv, Xiaoqing Zheng, Di Yin, Xing Sun, Xuanjing Huang:
Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing. ACL (Findings) 2025: 5983-6005
[c386]Xuhao Hu, Dongrui Liu, Hao Li, Xuanjing Huang, Jing Shao:
VLSBench: Unveiling Visual Leakage in Multimodal Safety. ACL (1) 2025: 8285-8316
[c385]Kaimin Wang, Yuanzhe Shen, Changze Lv, Xiaoqing Zheng, Xuanjing Huang:
TripTailor: A Real-World Benchmark for Personalized Travel Planning. ACL (Findings) 2025: 9705-9723
[c384]Wenxiang Chen, Wei He, Zhiheng Xi, Honglin Guo, Boyang Hong, Jiazheng Zhang, Nijun Li, Tao Gui, Yun Li, Qi Zhang, Xuanjing Huang:
Better Process Supervision with Bi-directional Rewarding Signals. ACL (Findings) 2025: 14471-14485
[c383]Ruicheng Yin, Xuan Gao, Changze Lv, Xiaohua Wang, Xiaoqing Zheng, Xuanjing Huang:
Improving Continual Pre-training Through Seamless Data Packing. ACL (Findings) 2025: 15014-15032
[c382]Yuming Yang, Yang Nan, Junjie Ye, Shihan Dou, Xiao Wang, Shuo Li, Huijie Lv, Tao Gui, Qi Zhang, Xuanjing Huang:
Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric. ACL (1) 2025: 18530-18549
[c381]Shujun Liu, Xiaoyu Shen, Yuhang Lai, Siyuan Wang, Shengbin Yue, Zengfeng Huang, Xuanjing Huang, Zhongyu Wei:
HAF-RM: A Hybrid Alignment Framework for Reward Model Training. ACL (1) 2025: 18874-18893
[c380]Zhiheng Xi, Yiwen Ding, Wenxiang Chen, Boyang Hong, Honglin Guo, Junzhe Wang, Xin Guo, Dingwen Yang, Chenyang Liao, Wei He, Songyang Gao, Lu Chen, Rui Zheng, Yicheng Zou, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang:
AgentGym: Evaluating and Training Large Language Model-based Agents across Diverse Environments. ACL (1) 2025: 27914-27961
[c379]Siyuan Wang, Dianyi Wang, Chengxing Zhou, Zejun Li, Zhihao Fan, Xuanjing Huang, Zhongyu Wei:
Activating Distributed Visual Region within LLMs for Efficient and Effective Vision-Language Training and Inference. ACL (1) 2025: 30715-30727
[c378]Yuxin Wang, Botian Jiang, Yiran Guo, Quan Gan, David Wipf, Xuanjing Huang, Xipeng Qiu:
Prior-Fitted Networks Scale to Larger Datasets When Treated as Weak Learners. AISTATS 2025: 1090-1098
[c377]Xiawei Liu, Shiyue Yang, Xinnong Zhang, Haoyu Kuang, Libo Sun, Yihang Yang, Siming Chen, Xuanjing Huang, Zhongyu Wei:
AI-Press: A Multi-Agent News Generating and Feedback Simulation System Powered by Large Language Models. COLING (Demonstrations) 2025: 63-82
[c376]Junjie Ye, Guanyu Li, Songyang Gao, Caishuang Huang, Yilong Wu, Sixian Li, Xiaoran Fan, Shihan Dou, Tao Ji, Qi Zhang, Tao Gui, Xuanjing Huang:
ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios. COLING 2025: 156-187
[c375]Tianlong Li, Zhenghua Wang, Wenhao Liu, Muling Wu, Shihan Dou, Changze Lv, Xiaohua Wang, Xiaoqing Zheng, Xuanjing Huang:
Revisiting Jailbreaking for Large Language Models: A Representation Engineering Perspective. COLING 2025: 3158-3178
[c374]Siyuan Wang, Zhuohan Long, Zhihao Fan, Xuanjing Huang, Zhongyu Wei:
Benchmark Self-Evolving: A Multi-Agent Framework for Dynamic LLM Evaluation. COLING 2025: 3310-3328
[c373]Yuming Yang, Wantong Zhao, Caishuang Huang, Junjie Ye, Xiao Wang, Huiyuan Zheng, Yang Nan, Yuran Wang, Xueying Xu, Kaixin Huang, Yunke Zhang, Tao Gui, Qi Zhang, Xuanjing Huang:
Beyond Boundaries: Learning a Universal Entity Taxonomy across Datasets and Languages for Open Named Entity Recognition. COLING 2025: 10902-10923
[c372]Yunfan Shao, Linyang Li, Yichuan Ma, Peiji Li, Demin Song, Qinyuan Cheng, Shimin Li, Xiaonan Li, Pengyu Wang, Qipeng Guo, Hang Yan, Xipeng Qiu, Xuanjing Huang, Dahua Lin:
Case2Code: Scalable Synthetic Data for Code Generation. COLING 2025: 11056-11069
[c371]Yongting Zhang, Lu Chen, Guodong Zheng, Yifeng Gao, Rui Zheng, Jinlan Fu, Zhenfei Yin, Senjie Jin, Yu Qiao, Xuanjing Huang, Feng Zhao, Tao Gui, Jing Shao:
SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Models. CVPR 2025: 19867-19878
[c370]Shuo Li, Tao Ji, Xiaoran Fan, Linsheng Lu, Leyi Yang, Yuming Yang, Zhiheng Xi, Rui Zheng, Yuran Wang, Xiaohui Zhao, Tao Gui, Qi Zhang, Xuanjing Huang:
Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs. ICLR 2025
[c369]Enyu Zhou, Guodong Zheng, Binghai Wang, Zhiheng Xi, Shihan Dou, Rong Bao, Wei Shen, Limao Xiong, Jessica Fan, Yurong Mou, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang:
RMB: Comprehensively benchmarking reward models in LLM alignment. ICLR 2025
[c368]Changze Lv, Jingwen Xu, Yiyang Lu, Xiaohua Wang, Zhenghua Wang, Zhibo Xu, Di Yu, Xin Du, Xiaoqing Zheng, Xuanjing Huang:
Dendritic Localized Learning: Toward Biologically Plausible Algorithm. ICML 2025
[c367]Siyin Wang, Xingsong Ye, Qinyuan Cheng, Junwen Duan, Shimin Li, Jinlan Fu, Xipeng Qiu, Xuanjing Huang:
Safe Inputs but Unsafe Output: Benchmarking Cross-modality Safety Alignment of Large Vision-Language Models. NAACL (Findings) 2025: 3563-3605
[c366]Zejun Li, Ruipu Luo, Jiwen Zhang, Minghui Qiu, Xuanjing Huang, Zhongyu Wei:
VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models. NAACL (Long Papers) 2025: 3769-3798
[c365]Zhijie Bao, Qingyun Liu, Xuanjing Huang, Zhongyu Wei:
SFMSS: Service Flow aware Medical Scenario Simulation for Conversational Data Generation. NAACL (Findings) 2025: 4586-4604
[c364]Xinyi Mou, Jingcong Liang, Jiayu Lin, Xinnong Zhang, Xiawei Liu, Shiyue Yang, Rong Ye, Lei Chen, Haoyu Kuang, Xuanjing Huang, Zhongyu Wei:
AgentSense: Benchmarking Social Intelligence of Language Agents through Interactive Scenarios. NAACL (Long Papers) 2025: 4975-5001
[c363]ShengbinYue ShengbinYue, Ting Huang, Zheng Jia, Siyuan Wang, Shujun Liu, Yun Song, Xuanjing Huang, Zhongyu Wei:
Multi-Agent Simulator Drives Language Models for Legal Intensive Interaction. NAACL (Findings) 2025: 6537-6570
[c362]Yiwen Ding, Zhiheng Xi, Wei He
, Lizhuoyuan Lizhuoyuan, Yitao Zhai, Shi Xiaowei, Xunliang Cai, Tao Gui, Qi Zhang, Xuanjing Huang:
Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling. NAACL (Long Papers) 2025: 10627-10646
[i323]Junjie Ye
, Zhengyin Du, Xuesong Yao, Weijian Lin, Yufei Xu, Zehui Chen, Zaiyuan Wang, Sining Zhu, Zhiheng Xi, Siyu Yuan, Tao Gui, Qi Zhang, Xuanjing Huang
, Jiecao Chen:
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use. CoRR abs/2501.02506 (2025)
[i322]Changze Lv, Jingwen Xu, Yiyang Lu, Xiaohua Wang, Zhenghua Wang, Zhibo Xu, Di Yu, Xin Du, Xiaoqing Zheng, Xuanjing Huang
:
Dendritic Localized Learning: Toward Biologically Plausible Algorithm. CoRR abs/2501.09976 (2025)
[i321]Ningyu Xu, Qi Zhang, Chao Du, Qiang Luo, Xipeng Qiu, Xuanjing Huang
, Menghan Zhang:
Human-like conceptual representations emerge from language prediction. CoRR abs/2501.12547 (2025)
[i320]Yuhong Sun, Zhangyue Yin, Xuanjing Huang
, Xipeng Qiu, Hui Zhao:
Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework. CoRR abs/2501.15581 (2025)
[i319]Changze Lv, Yansen Wang, Dongqi Han, Yifei Shen, Xiaoqing Zheng, Xuanjing Huang
, Dongsheng Li:
Toward Relative Positional Encoding in Spiking Transformers. CoRR abs/2501.16745 (2025)
[i318]Changhao Jiang, Ming Zhang
, Junjie Ye
, Xiaoran Fan, Yifei Cao, Jiajun Sun, Zhiheng Xi, Shihan Dou, Yi Dong, Yujiong Shen, Jingqi Tong, Zhen Wang, Tao Liang, Zhihui Fei, Mingyang Wan, Guojun Ma, Qi Zhang, Tao Gui, Xuanjing Huang
:
Predicting Large Language Model Capabilities on Closed-Book QA Tasks Using Only Information Available Prior to Training. CoRR abs/2502.04066 (2025)
[i317]Shengbin Yue, Ting Huang, Zheng Jia, Siyuan Wang, Shujun Liu, Yun Song, Xuanjing Huang, Zhongyu Wei:
Multi-Agent Simulator Drives Language Models for Legal Intensive Interaction. CoRR abs/2502.06882 (2025)
[i316]Xin Zhou, Yiwen Guo, Ruotian Ma, Tao Gui, Qi Zhang, Xuanjing Huang
:
Self-Consistency of the Internal Reward Models Improves Self-Rewarding Language Models. CoRR abs/2502.08922 (2025)
[i315]Zhuohan Long, Siyuan Wang, Shujun Liu, Yuhang Lai, Xuanjing Huang
, Zhongyu Wei:
How Jailbreak Defenses Work and Ensemble? A Mechanistic Investigation. CoRR abs/2502.14486 (2025)
[i314]Qin Zhu, Fei Huang, Runyu Peng
, Keming Lu, Bowen Yu, Qinyuan Cheng, Xipeng Qiu, Xuanjing Huang
, Junyang Lin:
AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models. CoRR abs/2502.16906 (2025)
[i313]Xiaoran Liu, Ruixiao Li, Mianqiu Huang, Zhigeng Liu, Yuerong Song, Qipeng Guo, Siyang He, Qiqi Wang, Linlin Li, Qun Liu, Yaqian Zhou, Xuanjing Huang
, Xipeng Qiu:
Thus Spake Long-Context Large Language Model. CoRR abs/2502.17129 (2025)
[i312]Yuming Yang, Yang Nan, Junjie Ye
, Shihan Dou, Xiao Wang, Shuo Li, Huijie Lv, Mingqi Wu, Tao Gui, Qi Zhang, Xuanjing Huang
:
Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric. CoRR abs/2502.17184 (2025)
[i311]Lida Zhao, Shihan Dou, Yutao Hu, Yueming Wu, Jiahui Wu, Chengwei Liu, Lyuye Zhang, Yi Liu, Jun Sun, Xuanjing Huang
, Yang Liu:
Detecting Essence Code Clones via Information Theoretic Analysis. CoRR abs/2502.19219 (2025)
[i310]Yuxin Wang, Botian Jiang, Yiran Guo, Quan Gan, David Wipf, Xuanjing Huang
, Xipeng Qiu:
Prior-Fitted Networks Scale to Larger Datasets When Treated as Weak Learners. CoRR abs/2503.01256 (2025)
[i309]Yuhao Zhou, Sirui Song, Boyang Liu, Zhiheng Xi, Senjie Jin, Xiaoran Fan, Zhihao Zhang, Wei Li, Xuanjing Huang
:
EliteKV: Scalable KV Cache Compression via RoPE Frequency Selection and Joint Low-Rank Projection. CoRR abs/2503.01586 (2025)
[i308]Zhenghua Wang, Yiran Ding, Changze Lv, Zhibo Xu, Tianlong Li, Tianyuan Shi, Xiaoqing Zheng, Xuanjing Huang
:
Layer-Specific Scaling of Positional Encodings for Superior Long-Context Modeling. CoRR abs/2503.04355 (2025)
[i307]Wenxiang Chen, Wei He, Zhiheng Xi, Honglin Guo, Boyang Hong, Jiazheng Zhang, Rui Zheng, Nijun Li, Tao Gui, Yun Li, Qi Zhang, Xuanjing Huang
:
Better Process Supervision with Bi-directional Rewarding Signals. CoRR abs/2503.04618 (2025)
[i306]Yixin Wu, Feiran Zhang, Tianyuan Shi, Ruicheng Yin, Zhenghua Wang, Zhenliang Gan, Xiaohua Wang, Changze Lv, Xiaoqing Zheng, Xuanjing Huang
:
Explainable Synthetic Image Detection through Diffusion Timestep Ensembling. CoRR abs/2503.06201 (2025)
[i305]Ming Zhang, Yuhui Wang, Yujiong Shen, Tingyi Yang, Changhao Jiang, Yilong Wu, Shihan Dou, Qinhao Chen, Zhiheng Xi, Zhihao Zhang, Yi Dong, Zhen Wang, Zhihui Fei, Mingyang Wan, Tao Liang, Guojun Ma, Qi Zhang, Tao Gui, Xuanjing Huang
:
PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts. CoRR abs/2503.06706 (2025)
[i304]Shuo Li, Jiajun Sun, Guodong Zheng, Xiaoran Fan, Yujiong Shen, Yi Lu, Zhiheng Xi, Yuming Yang, Wenming Tan, Tao Ji, Tao Gui, Qi Zhang, Xuanjing Huang
:
Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations. CoRR abs/2503.14895 (2025)
[i303]Yuxin Wang, Yiran Guo, Yining Zheng, Zhangyue Yin, Shuo Chen, Jie Yang, Jiajun Chen, Xuanjing Huang
, Xipeng Qiu:
FamilyTool: A Multi-hop Personalized Tool Use Benchmark. CoRR abs/2504.06766 (2025)
[i302]Yixin Cao, Jiahao Ying, Yaoning Wang, Xipeng Qiu, Xuanjing Huang
, Yu-Gang Jiang:
Revisiting LLM Evaluation through Mechanism Interpretability: a New Metric and Model Utility Law. CoRR abs/2504.07440 (2025)
[i301]Xinnong Zhang, Jiayu Lin, Xinyi Mou, Shiyue Yang, Xiawei Liu, Libo Sun, Hanjia Lyu
, Yihang Yang, Weihong Qi, Yue Chen, Guanying Li, Ling Yan, Yao Hu, Siming Chen, Yu Wang, Xuanjing Huang
, Jiebo Luo, Shiping Tang, Libo Wu, Baohua Zhou, Zhongyu Wei:
SocioVerse: A World Model for Social Simulation Powered by LLM Agents and A Pool of 10 Million Real-World Users. CoRR abs/2504.10157 (2025)
[i300]Shihan Dou, Muling Wu, Jingwen Xu, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang
:
Improving RL Exploration for LLM Reasoning through Retrospective Replay. CoRR abs/2504.14363 (2025)
[i299]Yixin Cao, Shibo Hong, Xinze Li, Jiahao Ying, Yubo Ma, Haiyuan Liang, Yantao Liu, Zijun Yao, Xiaozhi Wang, Dan Huang, Wenxuan Zhang, Lifu Huang, Muhao Chen, Lei Hou, Qianru Sun, Xingjun Ma, Zuxuan Wu, Min-Yen Kan, David Lo, Qi Zhang, Heng Ji, Jing Jiang, Juanzi Li, Aixin Sun, Xuanjing Huang
, Tat-Seng Chua, Yu-Gang Jiang:
Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks. CoRR abs/2504.18838 (2025)
[i298]Yi Lu, Wanxu Zhao, Xin Zhou, Chenxin An, Chenglong Wang, Shuo Li, Yuming Yang, Jun Zhao, Tao Ji, Tao Gui, Qi Zhang, Xuanjing Huang
:
Effective Length Extrapolation via Dimension-Wise Positional Embeddings Manipulation. CoRR abs/2504.18857 (2025)
[i297]Xinyi Mou, Chen Qian, Wei Liu, Xuanjing Huang
, Zhongyu Wei:
EcoLANG: Efficient and Effective Agent Communication Language Induction for Social Simulation. CoRR abs/2505.06904 (2025)
[i296]Junjie Ye
, Caishuang Huang, Zhuohan Chen, Wenjie Fu, Chenyuan Yang, Leyi Yang, Yilong Wu, Peng Wang, Meng Zhou, Xiaolong Yang, Tao Gui, Qi Zhang, Zhongchao Shi, Jianping Fan, Xuanjing Huang
:
A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models. CoRR abs/2505.07591 (2025)
[i295]Binghai Wang, Runji Lin, Keming Lu, Le Yu, Zhenru Zhang, Fei Huang, Chujie Zheng, Kai Dang, Yang Fan, Xingzhang Ren, An Yang, Binyuan Hui, Dayiheng Liu, Tao Gui, Qi Zhang, Xuanjing Huang
, Yu-Gang Jiang, Bowen Yu, Jingren Zhou, Junyang Lin:
WorldPM: Scaling Human Preference Modeling. CoRR abs/2505.10527 (2025)
[i294]Jingqi Tong, Jixin Tang, Hangcheng Li, Yurong Mou, Ming Zhang, Jun Zhao, Yanbo Wen, Fan Song, Jiahao Zhan, Yuyang Lu, Chaoran Tao, Zhiyuan Guo, Jizhou Yu, Tianhao Cheng, Changhao Jiang, Zhen Wang, Tao Liang, Zhihui Fei, Mingyang Wan, Guojun Ma, Weifeng Ge, Guanhua Chen, Tao Gui, Xipeng Qiu, Qi Zhang, Xuanjing Huang
:
Code2Logic: Game-Code-Driven Data Synthesis for Enhancing VLMs General Reasoning. CoRR abs/2505.13886 (2025)
[i293]Jianxiang Zang, Meiling Ning, Yongda Wei, Shihan Dou, Jiazheng Zhang, Nijia Mo, Binhong Li, Tao Gui, Qi Zhang, Xuanjing Huang
:
Compression Hacking: A Supplementary Perspective on Informatics Metric of Language Models from Geometric Distortion. CoRR abs/2505.17793 (2025)
[i292]Xu Zheng, Chenfei Liao
, Yuqian Fu, Kaiyu Lei, Yuanhuiyi Lyu, Lutao Jiang, Bin Ren, Jialei Chen, Jiawen Wang, Chengxin Li, Linfeng Zhang, Danda Pani Paudel, Xuanjing Huang
, Yu-Gang Jiang, Nicu Sebe, Dacheng Tao, Luc Van Gool, Xuming Hu:
MLLMs are Deeply Affected by Modality Bias. CoRR abs/2505.18657 (2025)
[i291]Wenhao Liu, Zhengkang Guo, Mingchen Xie, Jingwen Xu, Zisu Huang, Muzhao Tian, Jianhan Xu, Muling Wu, Xiaohua Wang, Changze Lv, He-Da Wang, Hu Yao, Xiaoqing Zheng, Xuanjing Huang
:
RECAST: Strengthening LLMs' Complex Instruction Following with Constraint-Verifiable Data. CoRR abs/2505.19030 (2025)
[i290]Ruicheng Yin, Xuan Gao, Changze Lv, Xiaohua Wang, Xiaoqing Zheng, Xuanjing Huang
:
Improving Continual Pre-training Through Seamless Data Packing. CoRR abs/2505.22018 (2025)
[i289]Shihan Dou, Ming Zhang, Chenhao Huang, Jiayi Chen, Feng Chen, Shichun Liu, Yan Liu, Chenxiao Liu, Cheng Zhong, Zongzhang Zhang, Tao Gui, Chao Xin, Wei Chengzhi, Lin Yan, Qi Zhang, Yonghui Wu, Xuanjing Huang
:
EvaLearn: Quantifying the Learning Capability and Efficiency of LLMs via Sequential Problem Solving. CoRR abs/2506.02672 (2025)
[i288]Kaiyan Chang, Mingzhi Chen, Yunji Chen, Zhirong Chen, Dongrui Fan, Junfeng Gong, Nan Guo, Yinhe Han, Qinfen Hao, Shuo Hou, Xuanjing Huang, Pengwei Jin, Changxin Ke, Cangyuan Li, Guangli Li, Huawei Li, Kuan Li, Naipeng Li, Shengwen Liang, Cheng Liu
, Hongwei Liu, Jiahua Liu, Junliang Lv, Jianan Mu, Jin Qin, Bin Sun, Chenxi Wang, Duo Wang, Mingjun Wang, Ying Wang
, Chenggang Wu, Peiyang Wu, Teng Wu, Xiao Xiao, Mengyao Xie, Chenwei Xiong, Ruiyuan Xu, Mingyu Yan, Xiaochun Ye, Kuai Yu, Rui Zhang, Shuoming Zhang, Jiacheng Zhao:
Large Processor Chip Model. CoRR abs/2506.02929 (2025)
[i287]Muling Wu, Qi Qian, Wenhao Liu, Xiaohua Wang, Zisu Huang, Di Liang, LI Miao, Shihan Dou, Changze Lv, Zhenghua Wang, Zhibo Xu, Lina Chen, Tianlong Li, Xiaoqing Zheng, Xuanjing Huang
:
Progressive Mastery: Customized Curriculum Learning with Guided Prompting for Mathematical Reasoning. CoRR abs/2506.04065 (2025)
[i286]Ming Zhang, Yujiong Shen, Zelin Li, Huayu Sha, Binze Hu, Yuhui Wang, Chenhao Huang, Shichun Liu, Jingqi Tong, Changhao Jiang, Mingxu Chai, Zhiheng Xi, Shihan Dou, Tao Gui, Qi Zhang, Xuanjing Huang
:
LLMEval-Med: A Real-world Clinical Benchmark for Medical LLMs with Physician Validation. CoRR abs/2506.04078 (2025)
[i285]Tianjiao Li, Mengran Yu, Chenyu Shi, Yanjun Zhao, Xiaojing Liu, Qiang Zhang, Qi Zhang, Xuanjing Huang
, Jiayin Wang:
RIVAL: Reinforcement Learning with Iterative and Adversarial Optimization for Machine Translation. CoRR abs/2506.05070 (2025)
[i284]Yu Li, Xingyu Qiu, Yuqian Fu, Jie Chen, Tianwen Qian, Xu Zheng, Danda Pani Paudel, Yanwei Fu, Xuanjing Huang
, Luc Van Gool, Yu-Gang Jiang:
Domain-RAG: Retrieval-Guided Compositional Image Generation for Cross-Domain Few-Shot Object Detection. CoRR abs/2506.05872 (2025)
[i283]Xiaoran Fan, Zhichao Sun, Yangfan Gao, Jingfei Xiong, Hang Yan, Yifei Cao, Jiajun Sun, Shuo Li, Zhihao Zhang, Zhiheng Xi, Yuhao Zhou, Senjie Jin, Changhao Jiang, Junjie Ye
, Ming Zhang, Rui Zheng, Zhenhua Han, Yunke Zhang, Demei Yan, Shaokang Dong, Tao Ji, Tao Gui, Qi Zhang, Xuanjing Huang
:
Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction. CoRR abs/2506.12537 (2025)
[i282]Zhihao Zhang, Qiaole Dong, Qi Zhang, Jun Zhao, Enyu Zhou, Zhiheng Xi, Senjie Jin, Xiaoran Fan, Yuhao Zhou, Yanwei Fu, Tao Ji, Tao Gui, Xuanjing Huang
:
Reinforcement Fine-Tuning Enables MLLMs Learning Novel Tasks Stably. CoRR abs/2506.23508 (2025)
[i281]Zhiheng Xi, Guanyu Li, Yutao Fan, Honglin Guo, Yufang Liu, Xiaoran Fan, Jiaqi Liu, Jingchao Ding, Wangmeng Zuo, Zhenfei Yin, Lei Bai, Tao Ji, Tao Gui, Qi Zhang, Philip Torr, Xuanjing Huang
:
BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset. CoRR abs/2507.03483 (2025)
[i280]Shihan Dou, Shichun Liu, Yuming Yang, Yicheng Zou, Yunhua Zhou, Shuhao Xing, Chenhao Huang, Qiming Ge, Demin Song, Haijun Lv, Songyang Gao, Chengqi Lv, Enyu Zhou, Honglin Guo, Zhiheng Xi, Wenwei Zhang, Qipeng Guo, Qi Zhang, Xipeng Qiu, Xuanjing Huang
, Tao Gui, Kai Chen:
Pre-Trained Policy Discriminators are General Reward Models. CoRR abs/2507.05197 (2025)
[i279]Zhangyue Yin, Qiushi Sun, Zhiyuan Zeng, Qinyuan Cheng, Xipeng Qiu, Xuanjing Huang
:
Dynamic and Generalizable Process Reward Modeling. CoRR abs/2507.17849 (2025)
[i278]Yuanzhe Shen, Kaimin Wang, Changze Lv, Xiaoqing Zheng, Xuanjing Huang
:
TripTailor: A Real-World Benchmark for Personalized Travel Planning. CoRR abs/2508.01432 (2025)
[i277]Changhao Jiang, Jiajun Sun, Yifei Cao, Jiabao Zhuang, Hui Li, Xiaoran Fan, Ming Zhang, Junjie Ye
, Shihan Dou, Zhiheng Xi, Jingqi Tong, Yilong Wu, Baoyu Fan, Zhen Wang, Tao Liang, Zhihui Fei, Mingyang Wan, Guojun Ma, Tao Ji, Tao Gui, Qi Zhang, Xuanjing Huang
:
SpeechRole: A Large-Scale Dataset and Benchmark for Evaluating Speech Role-Playing Agents. CoRR abs/2508.02013 (2025)
[i276]Liujian Tang, Shaokang Dong, Yijia Huang, Minqi Xiang, Hongtao Ruan, Bin Wang, Shuo Li, Zhiheng Xi, Zhihui Cao, Hailiang Pang, Heng Kong, He Yang, Mingxu Chai, Zhilin Gao, Xingyu Liu, Yingnan Fu, Jiaming Liu, Xuanjing Huang
, Yu-Gang Jiang, Tao Gui, Qi Zhang, Kang Wang, Yunke Zhang, Yuran Wang:
MagicGUI: A Foundational Mobile GUI Agent with Scalable Data Pipeline and Reinforcement Fine-tuning. CoRR abs/2508.03700 (2025)
[i275]Ming Zhang, Yujiong Shen, Jingyi Deng, Yuhui Wang, Yue Zhang, Junzhe Wang, Shichun Liu, Shihan Dou, Huayu Sha, Qiyuan Peng, Changhao Jiang, Jingqi Tong, Yilong Wu, Zhihao Zhang, Mingqi Wu, Zhiheng Xi, Mingxu Chai, Tao Liang, Zhihui Fei, Zhen Wang, Mingyang Wan, Guojun Ma, Tao Gui, Qi Zhang, Xuanjing Huang
:
LLMEval-3: A Large-Scale Longitudinal Study on Robust and Fair Evaluation of Large Language Models. CoRR abs/2508.05452 (2025)
[i274]Junjie Ye
, Changhao Jiang, Zhengyin Du, Yufei Xu, Xuesong Yao, Zhiheng Xi, Xiaoran Fan, Qi Zhang, Xuanjing Huang
, Jiecao Chen:
Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments. CoRR abs/2508.08791 (2025)
[i273]Dianyi Wang, Siyuan Wang, Zejun Li, Yikun Wang, Yitong Li, Duyu Tang, Xiaoyu Shen, Xuanjing Huang
, Zhongyu Wei:
MoIIE: Mixture of Intra- and Inter-Modality Experts for Large Vision Language Models. CoRR abs/2508.09779 (2025)
[i272]Zhibo Xu, Jianhao Zhu, Jingwen Xu, Changze Lv, Zisu Huang, Xiaohua Wang, Muling Wu, Qi Qian, Xiaoqing Zheng, Xuanjing Huang
:
Enhancing Model Privacy in Federated Learning with Random Masking and Quantization. CoRR abs/2508.18911 (2025)
[i271]Yuanzhe Shen, Zisu Huang, Zhengkang Guo, Yide Liu, Guanxu Chen, Ruicheng Yin, Xiaoqing Zheng, Xuanjing Huang
:
IntentionReasoner: Facilitating Adaptive LLM Safeguards through Intent Reasoning and Selective Query Refinement. CoRR abs/2508.20151 (2025)
[i270]Jie Yang, Jiajun Chen, Zhangyue Yin, Shuo Chen, Yuxin Wang, Yiran Guo, Yuan Li, Yining Zheng, Xuanjing Huang
, Xipeng Qiu:
VehicleWorld: A Highly Integrated Multi-Device Environment for Intelligent Vehicle Interaction. CoRR abs/2509.06736 (2025)
[i269]Zhiheng Xi, Jixuan Huang, Chenyang Liao, Baodai Huang, Honglin Guo, Jiaqi Liu, Rui Zheng, Junjie Ye
, Jiazheng Zhang, Wenxiang Chen, Wei He, Yiwen Ding, Guanyu Li, Zehui Chen, Zhengyin Du, Xuesong Yao, Yufei Xu, Jiecao Chen, Tao Gui, Zuxuan Wu, Qi Zhang, Xuanjing Huang
, Yu-Gang Jiang:
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning. CoRR abs/2509.08755 (2025)
[i268]Junjie Ye
, Yuming Yang, Yang Nan, Shuo Li, Qi Zhang, Tao Gui, Xuanjing Huang
, Peng Wang, Zhongchao Shi, Jianping Fan:
Analyzing the Effects of Supervised Fine-Tuning on Model Knowledge from Token and Parameter Levels. CoRR abs/2509.16596 (2025)
[i267]Changze Lv, Yifei Wang, Yanxun Zhang, Yiyang Lu, Jingwen Xu, Di Yu, Xin Du, Xuanjing Huang
, Xiaoqing Zheng:
Biologically Plausible Learning via Bidirectional Spike-Based Distillation. CoRR abs/2509.20284 (2025)
[i266]Boyang Liu, Yifan Hu, Senjie Jin, Shihan Dou, Gonglei Shi, Jie Shao, Tao Gui, Xuanjing Huang
:
Unlocking the Essence of Beauty: Advanced Aesthetic Reasoning with Relative-Absolute Policy Optimization. CoRR abs/2509.21871 (2025)
[i265]Hui Li, Changhao Jiang, Hongyu Wang, Ming Zhang, Jiajun Sun, Zhixiong Yang, Yifei Cao, Shihan Dou, Xiaoran Fan, Baoyu Fan, Tao Ji, Tao Gui, Qi Zhang, Xuanjing Huang
:
MDAR: A Multi-scene Dynamic Audio Reasoning Benchmark. CoRR abs/2509.22461 (2025)
[i264]Xingjian Zhao, Zhe Xu, Qinyuan Cheng, Zhaoye Fei, Luozhijie Jin, Yang Wang, Hanfu Chen, Yaozhou Jiang, Qinghui Gao, Ke Chen, Ruixiao Li, Mingshu Chen, Ruiming Wang, Wenbo Zhang, Yiyang Zhang, Donghua Yu, Yang Gao, Xiaogui Yang, Yitian Gong, Yuanfan Xu, Yaqian Zhou, Xuanjing Huang
, Xipeng Qiu:
MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance. CoRR abs/2510.00499 (2025)
[i263]Yifei Cao, Changhao Jiang, Jiabao Zhuang, Jiajun Sun, Ming Zhang, Zhiheng Xi, Hui Li, Shihan Dou, Yuran Wang, Yunke Zhang, Tao Ji, Tao Gui, Qi Zhang, Xuanjing Huang
:
From Scores to Preferences: Redefining MOS Benchmarking for Speech Quality Reward Modeling. CoRR abs/2510.00743 (2025)
[i262]Yuanzhe Shen, Yide Liu, Zisu Huang, Ruicheng Yin, Xiaoqing Zheng, Xuanjing Huang
:
SATER: A Self-Aware and Token-Efficient Approach to Routing and Cascading. CoRR abs/2510.05164 (2025)
[i261]Zhangyue Yin, Qiushi Sun, Zhiyuan Zeng, Zhiyuan Yu, Qipeng Guo, Xuanjing Huang
, Xipeng Qiu:
ARISE: An Adaptive Resolution-Aware Metric for Test-Time Scaling Evaluation in Large Reasoning Models. CoRR abs/2510.06014 (2025)
[i260]Deheng Zhang, Yuqian Fu, Runyi Yang, Yang Miao, Tianwen Qian, Xu Zheng, Guolei Sun, Ajad Chhatkuli, Xuanjing Huang
, Yu-Gang Jiang, Luc Van Gool, Danda Pani Paudel:
EgoNight: Towards Egocentric Vision Understanding at Night with a Challenging Benchmark. CoRR abs/2510.06218 (2025)
[i259]Yi Lu, Jianing Wang, Linsen Guo, Wei He, Hongyin Tang, Tao Gui, Xuanjing Huang
, Xuezhi Cao, Wei Wang, Xunliang Cai:
R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth? CoRR abs/2510.08189 (2025)
[i258]Xuhao Hu, Peng Wang, Xiaoya Lu, Dongrui Liu, Xuanjing Huang
, Jing Shao:
LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions. CoRR abs/2510.08211 (2025)
[i257]Zhiheng Xi, Xin Guo, Yang Nan, Enyu Zhou, Junrui Shen, Wenxiang Chen, Jiaqi Liu, Jixuan Huang, Zhihao Zhang, Honglin Guo, Xun Deng, Zhikai Lei, Miao Zheng, Guoteng Wang, Shuo Zhang, Peng Sun, Rui Zheng, Hang Yan, Tao Gui, Qi Zhang, Xuanjing Huang
:
BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping. CoRR abs/2510.18927 (2025)
[i256]Zhiheng Xi, Jixuan Huang, Xin Guo, Boyang Hong, Dingwen Yang, Xiaoran Fan, Shuo Li, Zehui Chen, Junjie Ye
, Siyu Yuan, Zhengyin Du, Xuesong Yao, Yufei Xu, Jiecao Chen, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang
:
Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning. CoRR abs/2510.24320 (2025)
[i255]Senjie Jin, Lu Chen, Zhiheng Xi, Yuhui Wang, Sirui Song, Yuhao Zhou, Xinbo Zhang, Peng Sun, Hong Lu, Tao Gui, Qi Zhang, Xuanjing Huang
:
Parrot: A Training Pipeline Enhances Both Program CoT and Natural Language CoT for Reasoning. CoRR abs/2510.25310 (2025)
[i254]Xin Guo, Zhiheng Xi, Yiwen Ding, Yitao Zhai, Xiaowei Shi, Xunliang Cai, Tao Gui, Qi Zhang, Xuanjing Huang
:
Counteracting Matthew Effect in Self-Improvement of LVLMs through Head-Tail Re-balancing. CoRR abs/2510.26474 (2025)
[i253]Jingqi Tong, Yurong Mou, Hangcheng Li, Mingzhe Li, Yongzhuo Yang, Ming Zhang, Qiguang Chen, Tianyi Liang, Xiaomeng Hu, Yining Zheng, Xinchi Chen, Jun Zhao, Xuanjing Huang
, Xipeng Qiu:
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm. CoRR abs/2511.04570 (2025)
[i252]Zhiheng Xi, Chenyang Liao, Guanyu Li, Yajie Yang, Wenxiang Chen, Zhihao Zhang, Binghai Wang, Senjie Jin, Yuhao Zhou, Jian Guan, Wei Wu, Tao Ji, Tao Gui, Qi Zhang, Xuanjing Huang
:
AgentPRM: Process Reward Models for LLM Agents via Step-Wise Promise and Progress. CoRR abs/2511.08325 (2025)
[i251]Yuxuan Cai, Lu Chen, Qiaoling Chen, Yuyang Ding, Liwen Fan, Wenjie Fu, Yufei Gao, Honglin Guo, Pinxue Guo, Zhenhua Han, Zhengfu He, Hanglei Hu, Kai Hu, Shengjia Hua, Tianyu Huai, Baodai Huang, Li Ji, Zhen Jiang, Zhikai Lei, Bufan Li, Jiahang Lin, Lizhi Lin, Jinxiu Liu, Shichun Liu, Ziming Liu, Yuchen Ni, Pengfang Qian, Yujiong Shen, Qingyun Shi, Wentao Shu, Peng Sun, Yiran Suo, Tian Tang, Boyu Tian, Guoteng Wang, Junzhe Wang, Peixin Wang, Zhiheng Xi, Hang Yan, Jie Yang, Zhixiong Yang, Tianchu Yao, Guangze Ye, Qianxi Yu, Shuo Zhang, Xinyue Zhang, Yiqi Zhang, Jiarong Zhao, Miao Zheng, Rui Zheng, Enyu Zhou, Jiazheng Zhou, Maosen Zhou, Yuhao Zhou, Tao Gui, Yining Zheng, Xinchi Chen, Jie Zhou, Siyuan Feng, Qin Chen, Liang He, Qi Zhang, Xuanjing Huang, Xipeng Qiu:
Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction. CoRR abs/2512.04987 (2025)
[i250]Shuo Li, Jiajun Sun, Zhihao Zhang, Xiaoran Fan, Senjie Jin, Hui Li, Yuming Yang, Junjie Ye, Lixing Shen, Tao Ji, Tao Gui, Qi Zhang, Xuanjing Huang:
The Role of Entropy in Visual Grounding: Analysis and Optimization. CoRR abs/2512.06726 (2025)
[i249]Yuyang Hu, Shichun Liu, Yanwei Yue, Guibin Zhang, Boyang Liu, Fangyi Zhu, Jiahang Lin, Honglin Guo, Shihan Dou, Zhiheng Xi, Senjie Jin, Jiejun Tan, Yanbin Yin, Jiongnan Liu, Zeyu Zhang, Zhongxiang Sun, Yutao Zhu, Hao Sun, Boci Peng, Zhenrong Cheng, Xuanbo Fan, Jiaxin Guo, Xinlei Yu, Zhenhong Zhou, Zewen Hu, Jiahao Huo, Junhao Wang, Yuwei Niu, Yu Wang, Zhenfei Yin, Xiaobin Hu, Yue Liao, Qiankun Li, Kun Wang, Wangchunshu Zhou, Yixin Liu, Dawei Cheng, Qi Zhang, Tao Gui, Shirui Pan, Yan Zhang, Philip Torr, Zhicheng Dou, Ji-Rong Wen, Xuanjing Huang, Yu-Gang Jiang, Shuicheng Yan:
Memory in the Age of AI Agents. CoRR abs/2512.13564 (2025)
[i248]Yuxin Wang, Shicheng Fang, Bo Wang, Qi Luo, Xuanjing Huang, Yining Zheng, Xipeng Qiu:
Multi-hop Reasoning via Early Knowledge Alignment. CoRR abs/2512.20144 (2025)- 2024
[j33]Mianxin Liu, Weiguo Hu
, Jinru Ding, Jie Xu
, Xiaoyang Li, Lifeng Zhu
, Zhian Bai, Xiaoming Shi, Benyou Wang, Haitao Song, Pengfei Liu, Xiaofan Zhang, Shanshan Wang


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID