VideoGPT: Video generation using VQ-V AE and transformers.arXiv preprint arXiv:2104.10540, 2021

Wilson Yan, Yunzhi Zhang, Pieter Abbeel, Aravind Srinivas · 2021 · arXiv 2104.10540

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Adaptive Tokenisation Via Temporal Redundancy Masking And Latent Inpainting

cs.CV · 2026-06-04 · unverdicted · novelty 6.0

A parameter-free approach drops redundant video tokens via temporal L1 differences in frozen latent space and reconstructs them with LIT, yielding 31x speedup over ElasticTok-CV on TokenBench and DAVIS.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Adaptive Tokenisation Via Temporal Redundancy Masking And Latent Inpainting cs.CV · 2026-06-04 · unverdicted · none · ref 30
A parameter-free approach drops redundant video tokens via temporal L1 differences in frozen latent space and reconstructs them with LIT, yielding 31x speedup over ElasticTok-CV on TokenBench and DAVIS.

VideoGPT: Video generation using VQ-V AE and transformers.arXiv preprint arXiv:2104.10540, 2021

fields

years

verdicts

representative citing papers

citing papers explorer