DSB: Dynamic Sliding Block Scheduling for Diffusion LLMs

Dynamic Sliding Block (DSB) is a training-free block scheduling method.

DSB Cache is a training-free KV-cache scheme tailored to DSB for diffusion LLMs, further demonstrating the advantages of DSB.

🚀 Features

A better semi-autoregressive paradigm.
DSB-tailored KV cache.
A training-free, plug-and-play method, improving quality-speed trade-off.
Fast inference support for Dream and LLaDA model.
Full evaluation provided.

🔍 Key Details

Dynamic Sliding Block (DSB) is a training-free decoding schedule for diffusion LLMs. Instead of using fixed blocks, it keeps an active block that slides forward and can change its size during inference. This lets the model decode easy/high-confidence tokens earlier (especially near block boundaries) and wait on low-confidence tokens until more context is available—improving the quality–speed trade-off.
DSB Cache is a training-free KV-cache design built for DSB. Sliding blocks can make newly exposed boundary tokens have unstable (transient) KV states, which hurts caching. To fix this, DSB Cache refreshes a small prefix window before the active block together with the block at every step, while caching the rest. It also does periodic global refreshes to keep the cache consistent—boosting throughput with minimal quality drop.

🔧 Installation

Option A: Quick start (recommended)

pip install -r requirements.txt

Option B: Reproducible install

pip install -r requirements-lock.txt

✨Eval

We provide the eval scripts for the main experiment, you can reproduce it directly. For example:

cd llada
bash eval_instruct.sh

The main result is conducted on an Nvidia H200 140G GPU, we evaluate two variants of DSB: DSB(const.) and DSB (greedy), demonstrating the stable improvement of our method.

🎓 Citation

Thank you for citing this work if it helps your research!

@misc{dsb,
      title={DSB: Dynamic Sliding Block Scheduling for Diffusion LLMs}, 
      author={Lizhuo Luo and Shenggui Li and Yonggang Wen and Tianwei Zhang},
      year={2026},
      eprint={2602.05992},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2602.05992}, 
}

🙏 Acknowledgement

We would like to thank the authors of LLaDA, Dream and Fast-dLLM for their excellent work and open-source contributions.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
asset		asset
dream		dream
llada		llada
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements-lock.txt		requirements-lock.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DSB: Dynamic Sliding Block Scheduling for Diffusion LLMs

🚀 Features

🔍 Key Details

🔧 Installation

Option A: Quick start (recommended)

Option B: Reproducible install

✨Eval

🎓 Citation

🙏 Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DSB: Dynamic Sliding Block Scheduling for Diffusion LLMs

🚀 Features

🔍 Key Details

🔧 Installation

Option A: Quick start (recommended)

Option B: Reproducible install

✨Eval

🎓 Citation

🙏 Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages