Fixtalk: Taming identity leakage for high-quality talking head generation in extreme cases.CoRR, abs/2507.01390

Shuai Tan, Bill Gong, Bin Ji, Ye Pan · 2025 · arXiv 2507.01390

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Archon: A Unified Multimodal Model for Holistic Digital Human Generation

cs.CV · 2026-05-28 · unverdicted · novelty 5.0

Archon unifies seven modalities via modality-specific tokenizers and an autoregressive backbone pretrained on 72 tasks, plus a 4x-efficient video reparameterization and stepwise 'Thinking in Modality' procedure, and reports superior or comparable results on digital-human tasks.

PortraitDirector: A Hierarchical Disentanglement Framework for Controllable and Real-time Facial Reenactment

cs.CV · 2026-04-21 · unverdicted · novelty 5.0

PortraitDirector uses hierarchical disentanglement of spatial physical motions and semantic emotions to deliver controllable, high-fidelity real-time facial reenactment at 20 FPS.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Archon: A Unified Multimodal Model for Holistic Digital Human Generation cs.CV · 2026-05-28 · unverdicted · none · ref 43
Archon unifies seven modalities via modality-specific tokenizers and an autoregressive backbone pretrained on 72 tasks, plus a 4x-efficient video reparameterization and stepwise 'Thinking in Modality' procedure, and reports superior or comparable results on digital-human tasks.
PortraitDirector: A Hierarchical Disentanglement Framework for Controllable and Real-time Facial Reenactment cs.CV · 2026-04-21 · unverdicted · none · ref 44
PortraitDirector uses hierarchical disentanglement of spatial physical motions and semantic emotions to deliver controllable, high-fidelity real-time facial reenactment at 20 FPS.

Fixtalk: Taming identity leakage for high-quality talking head generation in extreme cases.CoRR, abs/2507.01390

fields

years

verdicts

representative citing papers

citing papers explorer