preprint arXiv:2010.10504 , year=

· 2010 · arXiv 2010.10504

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

cs.CV · 2023-09-28 · unverdicted · novelty 6.0

Adding register tokens to Vision Transformers eliminates high-norm background artifacts and raises state-of-the-art performance on dense visual prediction tasks.

Adopting State-of-the-Art Pretrained Audio Representations for Music Recommender Systems

cs.IR · 2026-04-25 · unverdicted · novelty 5.0

Pretrained audio models show large performance gaps between standard MIR tasks and music recommendation in both hot and cold-start settings.

ViP-VL: Vietnamese Self-supervised Speech Pretraining Model with Vector-Quantization Learning

cs.SD · 2026-06-09 · unverdicted · novelty 4.0

ViP-VL achieves claimed state-of-the-art results on Vietnamese ASR, emotion recognition, dialect classification, and speaker verification via vector-quantization self-supervised pretraining on 17k hours with 8x subsampling modifications.

Responsible ASR: Overcoming Challenges of Foundational Models in Narrow-Band and Low-Resource Settings

cs.SD · 2026-06-17 · unverdicted · novelty 3.0

Evaluation of open-source and commercial ASR models on narrow-band Hindi and Indian English shows poor zero-shot results and inconsistent fine-tuning benefits tied to pretraining exposure.

citing papers explorer

Showing 3 of 3 citing papers after filters.

Adopting State-of-the-Art Pretrained Audio Representations for Music Recommender Systems cs.IR · 2026-04-25 · unverdicted · none · ref 104
Pretrained audio models show large performance gaps between standard MIR tasks and music recommendation in both hot and cold-start settings.
ViP-VL: Vietnamese Self-supervised Speech Pretraining Model with Vector-Quantization Learning cs.SD · 2026-06-09 · unverdicted · none · ref 28
ViP-VL achieves claimed state-of-the-art results on Vietnamese ASR, emotion recognition, dialect classification, and speaker verification via vector-quantization self-supervised pretraining on 17k hours with 8x subsampling modifications.
Responsible ASR: Overcoming Challenges of Foundational Models in Narrow-Band and Low-Resource Settings cs.SD · 2026-06-17 · unverdicted · none · ref 32
Evaluation of open-source and commercial ASR models on narrow-band Hindi and Indian English shows poor zero-shot results and inconsistent fine-tuning benefits tied to pretraining exposure.

preprint arXiv:2010.10504 , year=

fields

years

verdicts

representative citing papers

citing papers explorer