MGD³: Mode-Guided Dataset Distillation using Diffusion Models

📌 ICML 2025 Oral (Top 1.0%)

🧠 Introduction

MGD³ presents a novel approach to dataset distillation by leveraging pre-trained diffusion models without the need for fine-tuning. The method enhances diversity and representativeness in synthetic datasets through a three-stage process:

Mode Discovery: Identifies distinct data modes within each class.
Mode Guidance: Steers the diffusion process toward the discovered modes.
Stop Guidance: Transitions to unguided diffusion to prevent artifacts.

This approach ensures representative and diverse synthetic datasets suitable for training models.

For more details, visualizations, and supplementary materials, visit the Project Page.

🚀 Highlights

No Fine-Tuning Required: Utilizes pre-trained diffusion models directly.
Enhanced Diversity: Achieves superior intra-class diversity compared to existing methods.
Scalability: Demonstrates effectiveness on large-scale datasets like ImageNet-1K.

🛠️ Installation

Clone the repository:

   git clone https://github.com/jachansantiago/mode_guidance.git
   cd mode_guidance

Set up the environment:

   conda create -n modeguidance python=3.8
   conda activate modeguidance
   pip install -r requirements.txt

For text-to-image distillation:

Install our modified diffusers library:

   pip install -e diffusers

📊 Usage

To run the code on the ImageNette dataset:

./scripts/nette.sh

Acknowledgements

This project builds upon the following repositories:

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
diffusers		diffusers
diffusion		diffusion
misc		misc
scripts		scripts
train_models		train_models
.gitignore		.gitignore
README.md		README.md
argument.py		argument.py
data.py		data.py
download.py		download.py
finetune_dit.py		finetune_dit.py
finetuning.py		finetuning.py
models.py		models.py
requirements.txt		requirements.txt
sample.py		sample.py
sample_mode_guidance.py		sample_mode_guidance.py
sample_mode_guidance_text2img.py		sample_mode_guidance_text2img.py
train.py		train.py
train_dit.py		train_dit.py
tsne_plots.py		tsne_plots.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MGD³: Mode-Guided Dataset Distillation using Diffusion Models

🧠 Introduction

🚀 Highlights

🛠️ Installation

📊 Usage

Acknowledgements

About

Uh oh!

Releases

Packages

Languages

jachansantiago/mode_guidance

Folders and files

Latest commit

History

Repository files navigation

MGD³: Mode-Guided Dataset Distillation using Diffusion Models

🧠 Introduction

🚀 Highlights

🛠️ Installation

📊 Usage

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages