A sequential-to-global SSL method based on DINO pretrains iterative foveal-inspired vision transformers to achieve competitive ImageNet-1K performance with constant compute regardless of input resolution.
In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition
3 Pith papers cite this work. Polarity classification is still indexing.
years
2026 3verdicts
UNVERDICTED 3representative citing papers
Neural networks trained via supervised contrastive learning yield feature attributions that are more faithful, less complex, and more continuous than those from cross-entropy trained networks.
T-DuMpRa fuses classifier outputs with cosine-matched multi-prototypes from a teacher model via conservative gating, yielding 0.21-2.69% gains on skin lesion datasets across five backbones.
citing papers explorer
-
Self-supervised pretraining for an iterative image size agnostic vision transformer
A sequential-to-global SSL method based on DINO pretrains iterative foveal-inspired vision transformers to achieve competitive ImageNet-1K performance with constant compute regardless of input resolution.
-
On the Properties of Feature Attribution for Supervised Contrastive Learning
Neural networks trained via supervised contrastive learning yield feature attributions that are more faithful, less complex, and more continuous than those from cross-entropy trained networks.
-
T-DuMpRa: Teacher-guided Dual-path Multi-prototype Retrieval Augmented framework for fine-grained medical image classification
T-DuMpRa fuses classifier outputs with cosine-matched multi-prototypes from a teacher model via conservative gating, yielding 0.21-2.69% gains on skin lesion datasets across five backbones.