SSL speech representations outperform hand-crafted features at lower cognitive hierarchy levels but reverse for MCI classification, with greater response freedom in tasks linked to performance dilution at higher levels and structured tasks showing the opposite pattern.
DNSMOS: A Non-Intrusive Perceptual Objective Speech Quality Metric to Evaluate Noise Suppressors,
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CL 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
Audio language models are benchmarked on five semantic and paralinguistic reasoning tasks to reveal limitations in handling spoken audio evidence, accent variation, and domain shifts.
citing papers explorer
-
Beyond Binary: Speech Representations Across the Cognitive Score Hierarchy
SSL speech representations outperform hand-crafted features at lower cognitive hierarchy levels but reverse for MCI classification, with greater response freedom in tasks linked to performance dilution at higher levels and structured tasks showing the opposite pattern.
-
Afrispeech Semantics: Evaluating Audio Semantic Reasoning in Spoken Language Models Across Domains and Accents
Audio language models are benchmarked on five semantic and paralinguistic reasoning tasks to reveal limitations in handling spoken audio evidence, accent variation, and domain shifts.