Text distillation from BioCLIP-2 into BioLingual creates audio-image alignment for bird species retrieval without any audio-image training pairs.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.SD 1years
2026 1verdicts
CONDITIONAL 1representative citing papers
citing papers explorer
-
Audio-to-Image Bird Species Retrieval without Audio-Image Pairs via Text Distillation
Text distillation from BioCLIP-2 into BioLingual creates audio-image alignment for bird species retrieval without any audio-image training pairs.