InThe Twelfth International Conference on Learning Representations, ICLR 2024, Vienna, Austria, May 7-11

Internvid: A large-scale video-text dataset for multimodal understanding, generation · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

MAVEN A Multi-Agent Framework for Multicultural Text-to-Video Generation

cs.CV · 2026-05-16 · unverdicted · novelty 6.0

MAVEN is a multi-agent prompt refinement framework that improves cultural fidelity in text-to-video generation, demonstrated on a new benchmark of 243 prompts and 972 videos across Chinese, American, and Romanian cultures.

citing papers explorer

Showing 1 of 1 citing paper.

MAVEN A Multi-Agent Framework for Multicultural Text-to-Video Generation cs.CV · 2026-05-16 · unverdicted · none · ref 16
MAVEN is a multi-agent prompt refinement framework that improves cultural fidelity in text-to-video generation, demonstrated on a new benchmark of 243 prompts and 972 videos across Chinese, American, and Romanian cultures.

InThe Twelfth International Conference on Learning Representations, ICLR 2024, Vienna, Austria, May 7-11

fields

years

verdicts

representative citing papers

citing papers explorer