pith. sign in

The unrea- sonable effectiveness of deep features as a perceptual metric

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

citation-role summary

method 2

citation-polarity summary

fields

cs.CV 2

years

2024 1 2022 1

roles

method 2

polarities

use method 2

representative citing papers

LTX-Video: Realtime Video Latent Diffusion

cs.CV · 2024-12-30 · conditional · novelty 6.0

LTX-Video integrates Video-VAE and transformer for 1:192 latent compression and real-time video diffusion by moving patchifying to the VAE and letting the decoder finish denoising in pixel space.

citing papers explorer

Showing 2 of 2 citing papers.

  • LTX-Video: Realtime Video Latent Diffusion cs.CV · 2024-12-30 · conditional · none · ref 16

    LTX-Video integrates Video-VAE and transformer for 1:192 latent compression and real-time video diffusion by moving patchifying to the VAE and letting the decoder finish denoising in pixel space.

  • Scaling Autoregressive Models for Content-Rich Text-to-Image Generation cs.CV · 2022-06-22 · unverdicted · none · ref 81

    Scaling an autoregressive Transformer to 20B parameters for text-to-image generation using image token sequences achieves new SOTA zero-shot FID of 7.23 and fine-tuned FID of 3.22 on MS-COCO.