Video diffu- sion models

Jonathan Ho, Tim Salimans, Alexey Gritsenko, William Chan, Mohammad Norouzi, David J Fleet · 2022

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Circuit Mechanisms for Spatial Relation Generation in Diffusion Transformers

cs.AI · 2026-01-09 · unverdicted · novelty 7.0

DiTs use either a two-stage cross-attention circuit or text-token fusion circuit for spatial relations depending on the text encoder, achieving near-perfect in-domain accuracy but differing out-of-domain robustness.

RoboTAG: End-to-end Robot Configuration Estimation via Topological Alignment Graph

cs.RO · 2025-11-11 · unverdicted · novelty 5.0

RoboTAG estimates robot poses from monocular images via a topological alignment graph with 2D-3D co-evolution and consistency supervision to alleviate reliance on labeled data.

citing papers explorer

Showing 2 of 2 citing papers.

Circuit Mechanisms for Spatial Relation Generation in Diffusion Transformers cs.AI · 2026-01-09 · unverdicted · none · ref 15
DiTs use either a two-stage cross-attention circuit or text-token fusion circuit for spatial relations depending on the text encoder, achieving near-perfect in-domain accuracy but differing out-of-domain robustness.
RoboTAG: End-to-end Robot Configuration Estimation via Topological Alignment Graph cs.RO · 2025-11-11 · unverdicted · none · ref 12
RoboTAG estimates robot poses from monocular images via a topological alignment graph with 2D-3D co-evolution and consistency supervision to alleviate reliance on labeled data.

Video diffu- sion models

fields

years

verdicts

representative citing papers

citing papers explorer