Clever Hans Effect Found in Automatic Detection of Alzheimer's Disease through Speech

Jia-Hong Yuan; Rui Feng; Yin-Long Liu; Zhen-Hua Ling

arxiv: 2406.07410 · v1 · pith:XXBTMJDVnew · submitted 2024-06-11 · 📡 eess.AS

Clever Hans Effect Found in Automatic Detection of Alzheimer's Disease through Speech

Yin-Long Liu , Rui Feng , Jia-Hong Yuan , Zhen-Hua Ling This is my paper

classification 📡 eess.AS

keywords detectionpittrecordingsaccuracyalzheimeraudioclevercorpus

0 comments

read the original abstract

We uncover an underlying bias present in the audio recordings produced from the picture description task of the Pitt corpus, the largest publicly accessible database for Alzheimer's Disease (AD) detection research. Even by solely utilizing the silent segments of these audio recordings, we achieve nearly 100% accuracy in AD detection. However, employing the same methods to other datasets and preprocessed Pitt recordings results in typical levels (approximately 80%) of AD detection accuracy. These results demonstrate a Clever Hans effect in AD detection on the Pitt corpus. Our findings emphasize the crucial importance of maintaining vigilance regarding inherent biases in datasets utilized for training deep learning models, and highlight the necessity for a better understanding of the models' performance.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

SpurAudio: A Benchmark for Studying Shortcut Learning in Few-Shot Audio Classification
cs.CV 2026-05 unverdicted novelty 7.0

SpurAudio benchmark shows state-of-the-art few-shot audio classifiers suffer large performance drops when background correlations are disrupted, even in large pretrained models.