WorldCache: Accelerating World Models for Free via Heterogeneous Token Caching

Chuanguang Yang; Dingrui Wang; Guoxin Fan; Haotong Qin; Libo Huang; Longlong Liao; Michele Magno; Mingqiang Wu; Weilun Feng; Xiangqi Li

arxiv: 2603.06331 · v2 · pith:SAQS6LFQnew · submitted 2026-03-06 · 💻 cs.CV

WorldCache: Accelerating World Models for Free via Heterogeneous Token Caching

Weilun Feng , Guoxin Fan , Haotong Qin , Mingqiang Wu , Yuqi Li , Xiangqi Li , Zhulin An , Libo Huang

show 5 more authors

Dingrui Wang Longlong Liao Michele Magno Yongjun Xu Chuanguang Yang

This is my paper

classification 💻 cs.CV

keywords worldmodelsworldcachetokencachingdiffusiontextbftokens

0 comments

read the original abstract

Diffusion-based world models have shown strong potential for unified world simulation, but the iterative denoising remains too costly for interactive use and long-horizon rollouts. While feature caching can accelerate inference without training, we find that policies designed for single-modal diffusion transfer poorly to world models due to two world-model-specific obstacles: \emph{token heterogeneity} from multi-modal coupling and spatial variation, and \emph{non-uniform temporal dynamics} where a small set of hard tokens drives error growth, making uniform skipping either unstable or overly conservative. We propose \textbf{WorldCache}, a caching framework tailored to diffusion world models. We introduce \textit{Curvature-guided Heterogeneous Token Prediction}, which uses a physics-grounded curvature score to estimate token predictability and applies a Hermite-guided damped predictor for chaotic tokens with abrupt direction changes. We also design \textit{Chaotic-prioritized Adaptive Skipping}, which accumulates a curvature-normalized, dimensionless drift signal and recomputes only when bottleneck tokens begin to drift. Experiments on diffusion world models show that WorldCache delivers up to \textbf{3.7$\times$} end-to-end speedups while maintaining \textbf{98\%} rollout quality, demonstrating the vast advantages and practicality of WorldCache in resource-constrained scenarios. Our code is released in https://github.com/FofGofx/WorldCache.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Video Generation Models as World Models: Efficient Paradigms, Architectures and Algorithms
eess.IV 2026-03 unverdicted novelty 6.0

Video generation models can function as world simulators if efficiency gaps in spatiotemporal modeling are bridged via organized paradigms, architectures, and algorithms.