Weiduo Yuan

I'm a 2nd-year master student at University of Southern California in Los Angeles. Before that, I received my bachelor degree at ShanghaiTech University majoring in computer science and technology.

I will graduate from the USC in December 2026 and am currently seeking PhD positions for Spring/Fall 2027.

profile photo

Research Interests

My research interests generally lie in the following areas:

  • Latent action pretraining
  • Vision-Language-Action models
  • World action models

I'm open to collaboration or discussion about these topics. Feel free to reach out to me via email!

Research Experiences

University of Southern California [Aug 2025 - Present]: Research Internship. Advisior: Professor Yue Wang.

University of California, Riverside [Jan 2025 - Aug 2025]: Research Internship. Advisior: Professor Hang Qiu.

News

Aug 2025: I've started my research internship at Physical Superintelligence(PSI) lab advised by Yue Wang!

Aug 2025: Our paper BEVCalib has been accepted by CoRL 2025!

Jan 2025: I've started my research internship at CISL in UC Riverside advised by Hang Qiu!

Jan 2025: I joined USC in 25 Spring as a CS master student!

Jun 2024: I got my banchelor degree at ShanghaiTech University!

May 2023: I finished one-semester visit in UC Berkeley!

Publications (* means equal contribution)

DreamPlan DreamPlan: Efficient Reinforcement Fine-Tuning of Vision-Language Planners via Video World Models
Emily Yue-Ting Jia*, Weiduo Yuan*, Tianheng Shi, Vitor Guizilini, Jiageng Mao, Yue Wang
Arxiv Preprint
project page / arxiv / paper / cite
@misc{jia2026dreamplanefficientreinforcementfinetuning,
      title={DreamPlan: Efficient Reinforcement Fine-Tuning of Vision-Language Planners via Video World Models}, 
      author={Emily Yue-Ting Jia and Weiduo Yuan and Tianheng Shi and Vitor Guizilini and Jiageng Mao and Yue Wang},
      year={2026},
      eprint={2603.16860},
      archivePrefix={arXiv},
      primaryClass={cs.RO},
      url={https://arxiv.org/abs/2603.16860}, 
}

PSI-Zero Ψ₀: An Open Foundation Model Towards Universal Humanoid Loco-Manipulation
Psi-Zero team, Weiduo Yuan
Arxiv Preprint
project page / code / arxiv / paper / cite / GitHub stars
@misc{wei2026psi0,
  title={$\Psi_0$: An Open Foundation Model Towards Universal Humanoid Loco-Manipulation}, 
  author={Songlin Wei and Hongyi Jing and Boqian Li and Zhenyu Zhao and Jiageng Mao and Zhenhao Ni and Sicheng He and Jie Liu and Xiawei Liu and Kaidi Kang and Sheng Zang and Weiduo Yuan and Marco Pavone and Di Huang and Yue Wang},
  year={2026},
  eprint={2603.12263},
  archivePrefix={arXiv},
  primaryClass={cs.RO},
  url={https://arxiv.org/abs/2603.12263}, 
}

ICLR ICLR: In-Context Imitation Learning with Visual Reasoning
Toan Nguyen, Weiduo Yuan, Songlin Wei, Hui Li, Daniel Seita, Yue Wang
Arxiv Preprint
project page / arxiv / paper / cite
@misc{nguyen2026iclrincontextimitationlearning,
      title={ICLR: In-Context Imitation Learning with Visual Reasoning}, 
      author={Toan Nguyen and Weiduo Yuan and Songlin Wei and Hui Li and Daniel Seita and Yue Wang},
      year={2026},
      eprint={2603.07530},
      archivePrefix={arXiv},
      primaryClass={cs.RO},
      url={https://arxiv.org/abs/2603.07530}, 
}

LRM Large Reward Models: Generalizable Online Robot Reward Generation with Vision-Language Models
Yanru Wu, Weiduo Yuan, Ang Qi, Vitor Guizilini, Jiageng Mao, Yue Wang
Arxiv Preprint
project page / arxiv / paper / cite
@misc{wu2026largerewardmodelsgeneralizable,
      title={Large Reward Models: Generalizable Online Robot Reward Generation with Vision-Language Models}, 
      author={Yanru Wu and Weiduo Yuan and Ang Qi and Vitor Guizilini and Jiageng Mao and Yue Wang},
      year={2026},
      eprint={2603.16065},
      archivePrefix={arXiv},
      primaryClass={cs.RO},
      url={https://arxiv.org/abs/2603.16065}, 
}

The Earth Simulator The Earth Simulator: Street View World Modeling with 3D Gaussian Memory and Camera Control
Peilin Cai, Weiduo Yuan, Sicheng He, Cho-Ying Wu, David Paz, Hengyuan Zhang, Yuliang Guo, Xinyu Huang, Liu Ren, Jiageng Mao, Yue Wang
In submission
BEVCalib BEVCALIB: LiDAR-Camera Calibration via Geometry-Guided Bird's-Eye View Representations
Weiduo Yuan*, Jerry Li*, Justin Yue, Divyank Shah, Konstantinos Karydis, Hang Qiu
CoRL 2025
project page / code / arxiv / paper / cite / GitHub stars
@misc{yuan2025bevcaliblidarcameracalibrationgeometryguided,
      title={BEVCALIB: LiDAR-Camera Calibration via Geometry-Guided Bird's-Eye View Representations}, 
      author={Weiduo Yuan and Jerry Li and Justin Yue and Divyank Shah and Konstantinos Karydis and Hang Qiu},
      year={2025},
      eprint={2506.02587},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2506.02587}, 
}

Awards

Nov 2021: I got silver medal in the 46th International Collegiate Programming Contest (ICPC) Asia Jinan Regional Contest!

Apr 2021: I got gold medal(top 3%) in the 45th International Collegiate Programming Contest (ICPC) Asia Kunming Regional Contest!