Mengyu Yang, Yanming Yang, Chenyi Xu, Chenxi Song, Yufan Zuo, Tong Zhao, Ruibo Li, Chi Zhang
AGI Lab,Westlake University
Diffusion models have achieved impressive generative quality across modalities like 2D images, videos, and 3D shapes, but their inference remains computationally expensive due to the iterative denoising process. While recent caching-based methods effectively reuse redundant computations to speed up 2D and video generation, directly applying these techniques to 3D diffusion models can severely disrupt geometric consistency. In 3D synthesis, even minor numerical errors in cached latent features accumulate, causing structural artifacts and topological inconsistencies. To overcome this limitation, we propose Fast3Dcache, a training-free geometry-aware caching framework that accelerates 3D diffusion inference while preserving geometric fidelity. Our method introduces a Predictive Caching Scheduler Constraint (PCSC) to dynamically determine cache quotas according to voxel stabilization patterns and a Spatiotemporal Stability Criterion (SSC) to select stable features for reuse based on velocity magnitude and acceleration criterion. Comprehensive experiments show that Fast3Dcache accelerates inference significantly, achieving up to a 27.12% speed-up and a 54.8% reduction in FLOPs, with minimal degradation in geometric quality as measured by Chamfer Distance (2.48%) and F-Score (1.95%).
Our approach is motivated by the observation of a Three-Phase Stabilization Pattern in voxel occupancy during the denoising process.
Instead of a fixed caching ratio, PCSC dynamically adjusts the caching budget over timesteps. It leverages the log-linear decay pattern of dynamic voxels to predict how many tokens can be safely cached at each step without harming the geometry structure.
To determine which specific tokens to cache, SSC evaluates voxel stability from two perspectives:
- Velocity Magnitude: Reflects the intensity of feature updates.
- Acceleration Criterion: Quantify the potential error incurred by approximating the current velocity with the previous step.
By jointly considering these metrics, SSC identifies regions that have converged and can be safely reused.
To set up the environment, please follow the steps below:
# 1. Clone the repository
git clone https://github.com/Westlake-AGILab/Fast3Dcache.git
cd Fast3Dcache
# 2. Create a virtual environment (ref: TRELLIS)
. ./setup.sh --new-env --basic --xformers --flash-attn --diffoctreerast --spconv --mipgaussian --kaolin --nvdiffrast# Vanilla TRELLIS / fast3Dcache inference
cd fast3Dcache
sh inference.sh
# If you want to change tau, please click into selection.py# 1. throughput
cd evaluation
sh throughput.sh
# 2. FLOPs
sh flops.sh
#3. CD / F-Score
sh geometry.shIf you find our work useful for your research, please consider citing:
@misc{yang2025fast3dcachetrainingfree3dgeometry,
title={Fast3Dcache: Training-free 3D Geometry Synthesis Acceleration},
author={Mengyu Yang and Yanming Yang and Chenyi Xu and Chenxi Song and Yufan Zuo and Tong Zhao and Ruibo Li and Chi Zhang},
year={2025},
eprint={2511.22533},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2511.22533},
}