Shikai Qiu 裘释凯
I’m a fourth-year PhD student in Computer Science at NYU Courant, working with Andrew Gordon Wilson. I’m interested in the science of scaling neural networks. You can reach me at [email protected].
I’m supported by the Two Sigma PhD Fellowship. Previously, I was a Student Researcher at Google Research with Nikunj Saunshi and Elan Rosenfeld (2025), working on looped transformers for LLM reasoning, and at Google DeepMind with Jeffrey Pennington and Atish Agarwala (2024), where our work on scaling collapse received an ICML Oral. I also interned at Amazon AWS (2023) and Meta AI (2022).
I studied Physics and Computer Science at UC Berkeley, where I worked with Jennifer Listgarten on equivariant neural networks for drug discovery, and Haichen Wang and Ben Nachman on deep learning for high energy physics and search for physics beyond the Standard Model via the ATLAS Collaboration at CERN, which received the 2025 Breakthrough Prize in Fundamental Physics.
Selected Publications
* Equal contribution
- Advances in Neural Information Processing Systems (NeurIPS) 2025
- International Conference on Machine Learning (ICML) 2024 (Oral)
- International Conference on Machine Learning (ICML) 2024
- Advances in Neural Information Processing Systems 2023
- International Conference on Machine Learning (ICML) 2023
- The European Physical Journal C 2023