News
Education
Seoul National University (SNU)
Sep. 2023 - Present, CSE, Integrated M.S. and Ph.D. (Adviser: Jaesik Park)
Pohang University of Science and Technology (POSTECH)
Feb. 2019 - Feb. 2023, CSE, B.S. (Summa Cum Laude)
Experience
Carnegie Mellon University (CMU)
Oct. 2025 - Jan. 2026, Robotics Institute, Visiting Researcher (Host: Jun-Yan Zhu)
Adobe Research
June 2025 - Sep. 2025, Research Scientist Intern
Working with Xun Huang, Zhengqi Li, Richard Zhang, Jun-Yan Zhu, and Eli Shechtman on fast and interactive video generative models
Publications
MotionStream: Real-Time Video Generation with Interactive Motion Controls
arXiv 2025 - [Paper | Project Page | Code | ]
Joonghyuk Shin, Zhengqi Li, Richard Zhang, Jun-Yan Zhu, Jaesik Park, Eli Shechtman, Xun Huang
MotionStream is a streaming (causal, real-time, and long-duration) video generation system with motion controls, operating at ~30 FPS on a single H100 GPU, unlocking new possibilities for interactive content generation.
JAM-Flow: Joint Audio-Motion Synthesis with Flow Matching
arXiv 2025 - [Paper | Project Page | ]
Mingi Kwon*, Joonghyuk Shin*, Jaeseok Jung, Jaesik Park†, Youngjung Uh†
We present a unified framework that jointly generates synchronized facial motion and speech using flow matching and MM-DiT, enabling diverse audio-visual synthesis tasks within a single model.
Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing
ICCV 2025 - [Paper | Project Page | Code | ]
Joonghyuk Shin, Alchan Hwang, Yujin Kim, Daneul Kim, Jaesik Park
We perform a systematic analysis of MM-DiT's bidirectional attention mechanism and introduce a robust prompt-based editing method working across diverse MM-DiT models (SD3 series and Flux).
InstantDrag: Improving Interactivity in Drag-based Image Editing
SIGGRAPH Asia 2024 - [Paper | Project Page | Code (230+) | ]
Joonghyuk Shin, Daehyeon Choi, Jaesik Park
We present InstantDrag, an optimization-free pipeline for fast, interactive drag-based image editing that requires only an image and drag instruction as input, learning from real-world video datasets.
Fill-Up: Balancing Long-Tailed Data with Generative Models
arXiv 2023 - [Paper | Project Page | ]
Joonghyuk Shin, Minguk Kang, Jaesik Park
We propose a two-stage method for long-tailed (LT) recognition using textual-inverted tokens to synthesize images, achieving SOTA results on standard benchmarks when trained from scratch.
StudioGAN: A Taxonomy and Benchmark of GANs for Image Synthesis
TPAMI 2023 - [Paper | Code (3500+) | ]
Minguk Kang, Joonghyuk Shin, Jaesik Park
We present StudioGAN, a comprehensive library for GANs that reproduces over 30 popular models, providing extensive benchmarks and a fair evaluation protocol for image synthesis tasks.
Personal
I am a big fan of baseball. I played for POSTECH baseball team (Tachyons) for 5 years, as a captain and a catcher.I love animals. I live with a dog named Poby. I also like Pokemon, travelling, and FIFA video games.
Last updated on Nov, 2025 · with Face Looker