Video- mae: Masked autoencoders are data-efficient learners for self-supervised video pre-training.NeurIPS, 2022

Zhan Tong, Yibing Song, Jue Wang, Limin Wang · 2022

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

cs.CV · 2025-12-15 · unverdicted · novelty 7.0

RVM uses recurrent computation inside a masked autoencoder to learn video representations that match or exceed prior video and image models on classification, tracking, and dense spatial tasks with up to 30x better parameter efficiency.

citing papers explorer

Showing 1 of 1 citing paper.

Recurrent Video Masked Autoencoders cs.CV · 2025-12-15 · unverdicted · none · ref 67
RVM uses recurrent computation inside a masked autoencoder to learn video representations that match or exceed prior video and image models on classification, tracking, and dense spatial tasks with up to 30x better parameter efficiency.

Video- mae: Masked autoencoders are data-efficient learners for self-supervised video pre-training.NeurIPS, 2022

fields

years

verdicts

representative citing papers

citing papers explorer