Zinan Tang (唐撓ζ₯  in Chinese) is a senior undergraduate student majoring in Computer Science (CS) at the School of Computer Science (Pilot Software Engineering School, SCS), Beijing University of Posts and Telecommunication (BUPT), expecting his B.S. degree in 2026. He has been admitted to the Department of Automation (DA) at Tsinghua University (THU), where he will pursue his master’s degree in Big Data under the supervision of Dr. Biqing Huang. He currently is an intern of Ant Group.

His research interests are on Data-centric LLMs (e.g., AI4data, data4AI, SFT, post-training, pre-training, reasoning).

πŸ”₯ News

  • 2025.12: πŸ”₯πŸ”₯ The technical report of OpenDataArena is released!
  • 2025.09: πŸ”₯πŸ”₯ ScaleDiff is released on arXiv!
  • 2025.08: πŸŽ‰πŸŽ‰ Middo is accepted by EMNLP 2025 (Main). Thanks for all collaborators!
  • 2025.07: πŸ”₯πŸ”₯ REST is released on arXiv!
  • 2025.05: πŸŽ‰πŸŽ‰ MTRbench(fka. Big Escape Benchmark) is accepted by ACL 2025 (Workshop GEM$^2$). Thanks for all collaborators!

πŸ“ Publications

βœ‹ (Co) First-authored Publications

EMNLP 2025 (Main)
sym

Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning
Zinan Tang, Xin Gao, Zhuoshi Pan, Qizhi Pei, Mengzhang Cai, Jiang Wu, Conghui He, Lijun Wu

Project | | Hugging Face

ACL 2025 (Workshop GEM$^2$)
sym

MTRBench: A Multimodal Reasoning Benchmark from Reality Shows (fka. Big Escape Benchmark: Evaluating Human-Like Reasoning in Language Models via Real-World Escape Room Challenges)
Zinan Tang, QiYao Sun, Zhuoshi Pan, Qizhi Pei, Xin Gao, Mengyuan Sun, Honglin Lin, Mengzhang Cai, Yu Li, Chenlin Ming, Jiang Wu, Conghui He, Lijun Wu

πŸ“° Technical Reports

arXiv 2025
sym

OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value
Mengzhang Cai, Xin Gao, Yu Li, Honglin Lin, Zheng Liu, Zhuoshi Pan, Qizhi Pei, Xiaoran Shang, Mengyuan Sun, Zinan Tang, Xiaoyang Wang, Zhanping Zhong, Yun Zhu, Dahua Lin, Conghui He, Lijun Wu

Project | | Hugging Face

🀝 Co-authored Publications

arXiv 2025
sym

ScaleDiff: Scaling Difficult Problems for Advanced Mathematical Reasoning
Qizhi Pei, Zhuoshi Pan, Honglin Lin, Xin Gao, Yu Li, Zinan Tang, Conghui He, Rui Yan, Lijun Wu

Project | | Hugging Face

arXiv 2025
sym

REST: Stress Testing Large Reasoning Models by Asking Multiple Problems at Once
Zhuoshi Pan, Qizhi Pei, Yu Li, Qiyao Sun, Zinan Tang, H. Vicky Zhao, Conghui He, Lijun Wu

Project |

ACL 2025 (Main)
sym

David’s Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis
Xin Gao, Qizhi Pei, Zinan Tang, Yu Li, Honglin Lin, Jiang Wu, Lijun Wu, Conghui He

Project | | Hugging Face

ACL 2025 (Findings)
sym

LEMMA: Learning from Errors for MatheMatical Advancement in LLMs
Zhuoshi Pan, Yu Li, Honglin Lin, Qizhi Pei, Zinan Tang, Wei Wu, Chenlin Ming, H. Vicky Zhao, Conghui He, Lijun Wu

Project | | Hugging Face

πŸŽ– Honors and Awards

  • 2025, National Scholarship, Ministry of Education, PRC
  • 2024, Second Prize Scholarship, BUPT
  • 2023, National Scholarship, Ministry of Education, PRC

πŸ“– Educations

  • 2026.09 - 2028.06 (Expected), master’s student in DA, THU, major in Big Data.
  • 2022.09 - 2026.06, undergraduate student in SCS, BUPT, major in CS.

πŸ’» Internships

πŸ”— Link Exchange

Honglin Lin、Qizhi Pei、Xiaoyang Wang、Yu Li、Zhuoshi Pan