Proposes Ratio 1-2 metric for teacher selection in knowledge distillation for fine-grained image recognition, validated across 1000+ experiments showing 18% better selection and up to 17% student accuracy gains.
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CV 2years
2026 2representative citing papers
Large-scale experiments demonstrate that data-aware augmentations applied only during training allow fine-grained image models to reach high accuracy without using discriminative crops at inference, lowering costs.
citing papers explorer
-
How to Choose Your Teacher for Fine Grained Image Recognition
Proposes Ratio 1-2 metric for teacher selection in knowledge distillation for fine-grained image recognition, validated across 1000+ experiments showing 18% better selection and up to 17% student accuracy gains.
-
A Large-Scale Study on the Accuracy vs Cost Trade-offs of Training and Evaluation Settings in Fine-Grained Image Recognition
Large-scale experiments demonstrate that data-aware augmentations applied only during training allow fine-grained image models to reach high accuracy without using discriminative crops at inference, lowering costs.