Dolph2Vec is the first species-specific self-supervised model for dolphin vocalizations, trained on longitudinal recordings from five dolphins, that outperforms general baselines on signature whistle classification and detection while producing embeddings aligned with known whistle categories.
Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
roles
dataset 1polarities
use dataset 1representative citing papers
A recurrent-depth architecture enables language models to improve reasoning performance by iterating computation in latent space, achieving gains equivalent to much larger models on benchmarks.
The survey identifies a key tension in multilingual vision-language models between language neutrality via contrastive learning and cultural awareness via diverse data, with most benchmarks relying on translation-based evaluation.
citing papers explorer
-
Multilingual Vision-Language Models, A Survey
The survey identifies a key tension in multilingual vision-language models between language neutrality via contrastive learning and cultural awareness via diverse data, with most benchmarks relying on translation-based evaluation.