pith. sign in

Cogview3: Finer and faster text-to-image generation via relay diffusion

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

citation-role summary

baseline 1 dataset 1

citation-polarity summary

fields

cs.CV 5

verdicts

UNVERDICTED 5

polarities

baseline 2

representative citing papers

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer

cs.CV · 2024-08-12 · unverdicted · novelty 6.0

CogVideoX generates coherent 10-second text-to-video outputs at high resolution using a 3D VAE, expert adaptive LayerNorm transformer, progressive training, and a custom data pipeline, claiming state-of-the-art results.

citing papers explorer

Showing 5 of 5 citing papers.