9 Openvid-1m: A large-scale high-quality dataset for text-to- video generation

Kepan Nan, Rui Xie, Penghao Zhou, Tiehan Fan, Zhenheng Yang, Zhijie Chen, Xiang Li, Jian Yang, Ying Tai · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

citation-role summary

dataset 1

citation-polarity summary

use dataset 1

representative citing papers

Improving Human Image Animation via Semantic Representation Alignment

cs.CV · 2026-05-11 · unverdicted · novelty 6.0

SemanticREPA aligns structure representations from video latents with depth features and ID representations with face recognition features to supervise diffusion models, yielding more coherent and consistent human animations.

citing papers explorer

Showing 1 of 1 citing paper.

Improving Human Image Animation via Semantic Representation Alignment cs.CV · 2026-05-11 · unverdicted · none · ref 35
SemanticREPA aligns structure representations from video latents with depth features and ID representations with face recognition features to supervise diffusion models, yielding more coherent and consistent human animations.

9 Openvid-1m: A large-scale high-quality dataset for text-to- video generation

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer