Who is Speaking or Who is Depressed? A Controlled Study of Speaker Leakage in Speech-Based Depression Detection

· 2026 · eess.AS · arXiv 2604.14354

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

This study investigates whether speech-based depression detection models learn depression-related acoustic biomarkers or instead rely on speaker identity cues. Using the DAIC-WOZ dataset, we propose a data-splitting strategy that controls speaker overlap between training and test sets while keeping the training size constant, and evaluate three models of varying complexity. Results show that speaker overlap significantly boosts performance, whereas accuracy drops sharply on unseen speakers. Even with a Domain-Adversarial Neural Network, a substantial performance gap remains. These findings indicate that depression-related features extracted by current speech models are highly entangled with speaker identity. Conventional evaluation protocols may therefore overestimate generalization and clinical utility, highlighting the need for strictly speaker-independent evaluation.

representative citing papers

Who is Speaking or Who is Depressed? A Controlled Study of Speaker Leakage in Speech-Based Depression Detection

eess.AS · 2026-04-15 · unverdicted · novelty 5.0

Speech-based depression detection models primarily learn speaker identity rather than depression biomarkers, with performance dropping sharply on unseen speakers even under adversarial training.

citing papers explorer

Showing 1 of 1 citing paper.

Who is Speaking or Who is Depressed? A Controlled Study of Speaker Leakage in Speech-Based Depression Detection eess.AS · 2026-04-15 · unverdicted · none · ref 1 · internal anchor
Speech-based depression detection models primarily learn speaker identity rather than depression biomarkers, with performance dropping sharply on unseen speakers even under adversarial training.

Who is Speaking or Who is Depressed? A Controlled Study of Speaker Leakage in Speech-Based Depression Detection

fields

years

verdicts

representative citing papers

citing papers explorer