Detecting cognitive decline using speech only: The ADReSSo Challenge

Brian MacWhinney; Davida Fromm; Fasih Haider; Saturnino Luz; Sofia de la Fuente

arxiv: 2104.09356 · v1 · pith:26OTCVZ7new · submitted 2021-03-23 · 📡 eess.AS · cs.CL· cs.LG· cs.SD

Detecting cognitive decline using speech only: The ADReSSo Challenge

Saturnino Luz , Fasih Haider , Sofia de la Fuente , Davida Fromm , Brian MacWhinney This is my paper

classification 📡 eess.AS cs.CLcs.LGcs.SD

keywords predictioncognitivechallengedeclinetaskaccuracyadressobaseline

0 comments

read the original abstract

Building on the success of the ADReSS Challenge at Interspeech 2020, which attracted the participation of 34 teams from across the world, the ADReSSo Challenge targets three difficult automatic prediction problems of societal and medical relevance, namely: detection of Alzheimer's Dementia, inference of cognitive testing scores, and prediction of cognitive decline. This paper presents these prediction tasks in detail, describes the datasets used, and reports the results of the baseline classification and regression models we developed for each task. A combination of acoustic and linguistic features extracted directly from audio recordings, without human intervention, yielded a baseline accuracy of 78.87% for the AD classification task, an MMSE prediction root mean squared (RMSE) error of 5.28, and 68.75% accuracy for the cognitive decline prediction task.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Language-Based Digital Twins for Elderly Cognitive Assistance
cs.AI 2026-06 unverdicted novelty 5.0

Introduces LLM-powered digital twins for elderly speech simulation, evaluated with multi-head cVAE on I-CONECT data for identity preservation and MoCA score prediction, outperforming GPT baselines.
From Objectives to Applications: Aligning Architectural Biases in Audio Self-Supervised Learning
eess.AS 2026-07 unverdicted novelty 3.0

A survey that organizes audio SSL into five objective paradigms, relates their demands to architectural biases, and interprets downstream applications as tests of generalization.