Lijun Wu is a Researcher in Shanghai AI Laboratory. Previously, he was a Research Scientist in ByteDance, a Senior Researcher in Microsoft Research. He got the Ph.D. degree from Sun Yat-sen University (SYSU), and was a member of joint Ph.D. program between SYSU and MSRA, advised by Dr. Tie-Yan Liu and Prof. Jianhuang Lai.

His research interests are on AI/LLMs (e.g., data-centric intelligence, SFT/RL), AI4Science (e.g., LLM4Science, scientific reasoning). His research works are published in top conferences and journals, such as Nature Communications, Nature Machine Intelligence, TPAMI, NeurIPS, ICML, ICLR, ACL, KDD and so on, with more than 8500+ citations. He has served as AC/SPC in top conferences, e.g., ICLR, NeurIPS, ACL, EMNLP, NAACL, AAAI, IJCAI and so on.

He has received numerous prestigious awards, including the 2018 MSRA Ph.D. Fellowship. He secured 8 championships in the WMT2019 Competition. He led his team to develop the BioT5 series of multimodal biomolecular models, winning 1st and 2nd place in the ACL 2024 Language+Molecule Shared Task. In 2025, he guided students to secure 2nd place in the 2025 NeurIPS CURE-Bench Internal Reasoning Competition. Many of his research innovations have been successfully translated into practical products. Notably, his R-Drop algorithm was deployed in Microsoft Translator across over 20 translation tasks and is widely used in business scenarios at companies like Meituan. His CT4Rec model was applied to Tencent News recommendation products. Furthermore, he participated in the development of the world’s first Chinese-English translation system to achieve human parity in 2018.

πŸ“„ Download CV (PDF)

We are hiring AI researchers working on LLM/MLLM and AI4Science, contact me if you are interested!

πŸ”₯ News

  • 2025.9 πŸŽ‰ Caco is accepted by NeurIPS-2025, which aims to scaling the reasoning data by code-assisted verfications.
  • 2025.8 πŸŽ‰ 3 papers are accepted by EMNLP-2025,topics cover math reasoning and advanced data synthesis. Check CFT, MetaLadder, Middo.
  • 2025.8 Invited to serve as Area Chair for ICLR-2026.
  • 2025.7 ΞΌFormer is accepted by Nature Machine Intelligence!
  • 2025.7 Invited to serve as Area Chair for NeurIPS-2025 workshop AI4Science and SEA.
  • 2025.7 Invited to serve as Area Chair for AAAI-2026.
  • 2025.6 CovDocker is accepted by KDD-2025.
  • 2025.5 6 papers are accepted by ACL-2025, topics cover math reasoning, data synthesis and LLM benchmarks. Check Mathfusion, GRA, Lemma, CipherBank.
  • 2025.3 Invited to serve as Area Chair for NeurIPS-2025.
  • 2025.3 NatureLM, a large scientific foundation model, is released.

πŸ’» Open-source Projects

  • OpenDataArena , a fair, open, and transparent Arena for data value benchmarking.
  • InternVL , a series of leading VLM models developed by Shanghai AI Laboratory.

πŸ“ƒ Surveys/Repos

πŸ“ Selected Publications

⭐️ LLM/MLLMs

πŸ”¬ AI4Science

⌨️ AI

πŸŽ– Honors and Awards

  • 2nd place in Internal Reasoning Track of CURE-Bench@NeurIPS2025, 2025
  • 1st place in Text2Molecule and 2nd place in Molecue2Tedt on Language+Molecule@ACL2024 shared task, 2024
  • Runner up of OGB-LSC @ KDD cup, 2021, Solution
  • Outstanding Graduate Awards of SYSU, 2020
  • Outstanding Reviewer of EMNLP, 2019
  • 1st Place of WMT 2019 in 5 translation directions: En->De, De->En, De->Fr, Fr->De and Ru->En, 2019
  • Microsoft Research Asia Ph.D. Fellowship, 2018
  • Graduate Student National Scholarship, 2018
  • Stars of Tomorrow Internship Award of Microsoft Research Asia, 2018
  • Outstanding Undergraduate Awards of SYSU, 2015
  • 1st Place of Global IBM/IEEE Smarter Planet Challenge, 2013
  • Undergraduate Student National Scholarship, 2012, 2013
  • First Class Scholarship of SYSU, 2012, 2013, 2014

πŸ“– Experience

  • 2025.08-Now, Young Scientist, Shanghai Artificial Intelligence Laboratory
  • 2024.05-2024.08, Research Scientist, ByteDance,
  • 2022.07-2024.05, Senior Researcher, MSR AI4Science
  • 2020.6-2022.07, Senoir Researcher, MSRA
  • 2014.07-2020.06, Research Intern, MSRA

πŸ’¬ Academic Services

  • AC: ICLR-26, NeurIPS-25, ACL-21/22/23/24/25, EMNLP-23/24/25, NNACL-22/23/24/25, EACL-24, COLING-23, ARR-21/22/23/24/25
  • SPC: AAAI-22/23/24/25/26, IJCAI-21
  • Conference reviewers: ICLR, ICML, NeurIPS, AAAI, IJCAI, ACL, CVPR, EMNLP, KDD, NAACL, COLING, EACL, AACL
  • Journal reviewers: TPAMI, TASLP, KBS, Neurocomputing, CSL