pith. sign in

arxiv: 2104.09356 · v1 · pith:26OTCVZ7new · submitted 2021-03-23 · 📡 eess.AS · cs.CL· cs.LG· cs.SD

Detecting cognitive decline using speech only: The ADReSSo Challenge

classification 📡 eess.AS cs.CLcs.LGcs.SD
keywords predictioncognitivechallengedeclinetaskaccuracyadressobaseline
0
0 comments X
read the original abstract

Building on the success of the ADReSS Challenge at Interspeech 2020, which attracted the participation of 34 teams from across the world, the ADReSSo Challenge targets three difficult automatic prediction problems of societal and medical relevance, namely: detection of Alzheimer's Dementia, inference of cognitive testing scores, and prediction of cognitive decline. This paper presents these prediction tasks in detail, describes the datasets used, and reports the results of the baseline classification and regression models we developed for each task. A combination of acoustic and linguistic features extracted directly from audio recordings, without human intervention, yielded a baseline accuracy of 78.87% for the AD classification task, an MMSE prediction root mean squared (RMSE) error of 5.28, and 68.75% accuracy for the cognitive decline prediction task.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Language-Based Digital Twins for Elderly Cognitive Assistance

    cs.AI 2026-06 unverdicted novelty 5.0

    Introduces LLM-powered digital twins for elderly speech simulation, evaluated with multi-head cVAE on I-CONECT data for identity preservation and MoCA score prediction, outperforming GPT baselines.

  2. From Objectives to Applications: Aligning Architectural Biases in Audio Self-Supervised Learning

    eess.AS 2026-07 unverdicted novelty 3.0

    A survey that organizes audio SSL into five objective paradigms, relates their demands to architectural biases, and interprets downstream applications as tests of generalization.