Generative Models for Computer Vision

4th Workshop on Generative Models
for Computer Vision

CVPR 2026

8:45am - 5:00pm, Thursday, June 4th, 2026 Room 205, Denver Convention Center, Denver, Colorado

Overview

Recent advances in generative modeling leveraging generative adversarial networks, auto-regressive models, neural fields and diffusion models have enabled the synthesis of near photorealistic images, drastically increasing the visibility and popularity of generative modeling across the computer vision research community. However, these impressive advances in generative modeling have not yet found wide adoption in computer vision for visual recognition tasks. In this workshop, we aim to bring together researchers from the fields of image synthesis and computer vision to facilitate discussions and progress at the intersection of those two subfields. We investigate the question: "How can visual recognition benefit from the advances in generative image modeling?". We invite a diverse set of experts to discuss their recent research results and future directions for generative modeling and computer vision, with a particular focus on the intersection between image synthesis and visual recognition. We hope this workshop will lay the foundation for future development of generative models for computer vision tasks.

Invited Speakers

Schedule

4th of June, 2026
8:45	Opening
9:00	Efstratios Gavves: Physical World Models & Agents: Generative Vision for Embodied Intelligence
9:40	Sherry Yang: Improving Physical Agents in a Generative World Model
10:20	Coffee Break
10:40	Yilun Du: Visual Scene Understanding through Inverse Generative Modeling
11:20	Chelsea Finn: Emergent Physical Generalization
12:00	Lunch
13:00	Posters
14:00	Hila Chefer: Is Scale All You Need? A Case for Native Generative-Representation Learning
14:40	Matthias Niessner: 3D Generative Models
15:20	Coffee Break
15:40	Alan Yuille: TBD
16:20	Georgia Gkioxari: Beyond Image and Language: Building 3D Perception Systems
17:00	Closing

Covered Topics

Submission site:

OpenReview

Author kit:

CVPR Author KIT

short papers

long papers

non-archival

Not

Advances in generative image models
Inversion of generative image models
Training computer vision with realistic synthetic images
Benchmarking computer vision with generative models
Analysis-by-synthesis / render-and-compare approaches for visual recognition
Self-supervised learning with generative models
Adversarial attacks and defenses with generative models
Out-of-distribution generalization and detection with generative models
Ethical considerations in generative modeling, dataset and model biases

Important Dates

Event	Date (Anywhere on Earth)
Workshop paper submission deadline	May 10, 2026
Decisions	May 20, 2026

Accepted Papers

Re-Depth Anything: Test-Time Depth Refinement via Self-Supervised Re-lighting [Paper]
Best PaperFreeOrbit4D: Training-Free Arbitrary Camera Redirection for Monocular Videos via Foreground-Complete 4D Reconstruction [Paper]
Selective Timestep Weighting and Advantage-Based Replay for Sample-Efficient Diffusion RLHF [Paper]
Learning from Semantic Dictionaries: Discriminative Codebook Contrastive Learning for Unified Visual Representation and Generation [Paper]
MatLat: Material Latent Space for PBR Texture Generation [Paper]
Scale Space Diffusion [Paper]
Novel View Extrapolation with Video Diffusion Priors [Paper]
Are Image Generators Zero-shot Perceivers? A Rigorous Evaluation [Paper]
Best PaperRISE: Single Static Radar-based Indoor Scene Understanding [Paper]
Less Is More: Training-Free Acceleration of Identity-Preserved Generation [Paper]
Guidance for Low-Level Perceptual Editing in Unconditional Diffusion Models [Paper]
RareCrafter: Controllable Generative Augmentation for Rare Object Detection in Driving Scenes [Paper]
Early Commitment Without Revision: Diagnosing Sampling Instability in Binary Masked Generative Models [Paper]
SeeThrough3D: Occlusion Aware 3D Control in Text-to-Image Generation [Paper]
Do Safety-Aligned Vision-Language Models Degrade Differently Under Common Image Corruptions? [Paper]
Do Synthetic Images Help Small-Scale Visual Classification? A Preliminary CIFAR-10 Study with Classical, Deep, and Generative Augmentations [Paper]
Breaking Spurious Correlations via Generative Randomization and Cross-Variant Self-Supervised Learning [Paper]
Post-Training Strains Diffusion Models: A Stability Analysis of Distillation, Robustness, and Unlearning [Paper]
Personalized Generative Models for Contextual Debiasing [Paper]
SIGMA-GEN: Structure and Identity Guided Multi-subject Assembly for Image Generation [Paper]
Beyond Flat Latent: Scale-Aware Disentanglement in Hierarchical Latent Spaces [Paper]
ATHENA: Adaptive Test-Time Steering for Improving Count Fidelity in Diffusion Models [Paper]
Generative Transformer for Auto Computer-aided Design [Paper]
$\text{PartConcepts}$: A Unified Mechanism for Fine-Grained Part Localization and Generation [Paper]
Anchored Generation: Controllable Scene Variation via 3D-Aware Object Latents [Paper]
Let’s Fix Step-by-Step: Iterative Refinement for Compositional Image Generation [Paper]
Factored Generative Models through Mechanism Diversity [Paper]
Best PaperA Simple and Effective Zero-Shot Framework for Dynamic 3D World Modeling [Paper]

Organizers

Overview

Invited Speakers

Chelsea Finn

Yilun Du

Efstratios Gavves

Sherry Yang

Georgia Gkioxari

Matthias Niessner

Hila Chefer

Alan Yuille

Schedule

Covered Topics

Important Dates

Accepted Papers

Organizers

Adam Kortylewski

Fangneng Zhan

Jieneng Chen

Christian Theobalt

Alan Yuille