Stable Virtual Camera: Generative View Synthesis with Diffusion Models

Jensen Zhou, Hang Gao, Vikram Voleti, Aaryaman Vasishta, Chun-Han Yao, Mark Boss + 2 more · 2025 · 2025 IEEE/CVF International Conference on Computer Vision (ICCV) · DOI 10.1109/iccv51701.2025.01153

1 Pith paper cite this work, alongside 5 external citations. Polarity classification is still indexing.

1 Pith paper citing it

5 external citations · Crossref

open at publisher browse 1 citing papers

representative citing papers

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

cs.CV · 2026-05-14 · unverdicted · novelty 5.0

SANA-WM is a 2.6B-parameter efficient world model that synthesizes minute-scale 720p videos with 6-DoF camera control, trained on 213K public clips in 15 days on 64 H100s and runnable on single GPUs at 36x higher throughput than prior open baselines.

citing papers explorer

Showing 1 of 1 citing paper.

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer cs.CV · 2026-05-14 · unverdicted · none · ref 63
SANA-WM is a 2.6B-parameter efficient world model that synthesizes minute-scale 720p videos with 6-DoF camera control, trained on 213K public clips in 15 days on 64 H100s and runnable on single GPUs at 36x higher throughput than prior open baselines.

Stable Virtual Camera: Generative View Synthesis with Diffusion Models

fields

years

verdicts

representative citing papers

citing papers explorer