Masked Autoencoders in Computer Vision: A Comprehensive Survey,

· 2023 · arXiv 2023.332338

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Gaussian Process Prior Variational Autoencoder for Endoscopic Videos

cs.CV · 2026-06-18 · unverdicted · novelty 6.0

GPVAE replaces the standard VAE latent prior with a temporal Gaussian process prior, combined with endoscopy-specific encoders and specular masking, to achieve up to 26.1% lower image reconstruction RMSE on the C3VDv2 colonoscopy dataset.

Clustering Guided Domain-Specific Pretrained Foundation Model for Very High-Resolution Arctic Remote Sensing

cs.CV · 2026-05-28 · unverdicted · novelty 5.0

Affinity-propagation clustering of Arctic VHSR imagery enables MAE pretraining of a ViT-Large encoder that outperforms ImageNet and Prithvi-EO-2.0 baselines by 5-15 percentage points in mean F1 on four downstream Arctic detection and segmentation tasks.

citing papers explorer

Showing 2 of 2 citing papers.

Gaussian Process Prior Variational Autoencoder for Endoscopic Videos cs.CV · 2026-06-18 · unverdicted · none · ref 27
GPVAE replaces the standard VAE latent prior with a temporal Gaussian process prior, combined with endoscopy-specific encoders and specular masking, to achieve up to 26.1% lower image reconstruction RMSE on the C3VDv2 colonoscopy dataset.
Clustering Guided Domain-Specific Pretrained Foundation Model for Very High-Resolution Arctic Remote Sensing cs.CV · 2026-05-28 · unverdicted · none · ref 14
Affinity-propagation clustering of Arctic VHSR imagery enables MAE pretraining of a ViT-Large encoder that outperforms ImageNet and Prithvi-EO-2.0 baselines by 5-15 percentage points in mean F1 on four downstream Arctic detection and segmentation tasks.

Masked Autoencoders in Computer Vision: A Comprehensive Survey,

fields

years

verdicts

representative citing papers

citing papers explorer