Chatanyone: Stylized real-time portrait video generation with hierarchical motion diffusion model

Jinwei Qi, Chaonan Ji, Sheng Xu, Peng Zhang, Bang Zhang, Liefeng Bo · 2025 · arXiv 2503.21144

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Real-Time Generation of Streamable Talking Portrait Video with Reference-Guided Deep Compression VAEs

cs.CV · 2026-06-01 · unverdicted · novelty 6.0

A causal VAE with variable reference guidance and a Rectified Flow Transformer enables real-time streamable high-quality talking portrait video generation from audio and images.

citing papers explorer

Showing 1 of 1 citing paper.

Real-Time Generation of Streamable Talking Portrait Video with Reference-Guided Deep Compression VAEs cs.CV · 2026-06-01 · unverdicted · none · ref 37
A causal VAE with variable reference guidance and a Rectified Flow Transformer enables real-time streamable high-quality talking portrait video generation from audio and images.

Chatanyone: Stylized real-time portrait video generation with hierarchical motion diffusion model

fields

years

verdicts

representative citing papers

citing papers explorer