Alignhuman: Improving motion and fidelity via timestep- segment preference optimization for audio-driven human an- imation,

· 2025 · arXiv 2506.11144

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

EchoTorrent: Towards Swift, Sustained, and Streaming Multi-Modal Video Generation

cs.CV · 2026-02-14 · unverdicted · novelty 4.0

EchoTorrent combines multi-teacher distillation, adaptive CFG calibration, hybrid long-tail forcing, and VAE decoder refinement to enable few-pass autoregressive streaming video generation with improved temporal consistency and audio-lip sync.

citing papers explorer

Showing 1 of 1 citing paper.

EchoTorrent: Towards Swift, Sustained, and Streaming Multi-Modal Video Generation cs.CV · 2026-02-14 · unverdicted · none · ref 75
EchoTorrent combines multi-teacher distillation, adaptive CFG calibration, hybrid long-tail forcing, and VAE decoder refinement to enable few-pass autoregressive streaming video generation with improved temporal consistency and audio-lip sync.

Alignhuman: Improving motion and fidelity via timestep- segment preference optimization for audio-driven human an- imation,

fields

years

verdicts

representative citing papers

citing papers explorer