PROCESS-2: A Benchmark Speech Corpus for Early Cognitive Impairment Detection

Bahman Mirheidari; Caitlin H. Illingworth; Daniel Blackburn; Fritz Peters; Heidi Christensen; Hend Elghazaly; Labhpreet Kaur; Madhurananda Pahar; Sophie Young; Wing-Zin Leung

arxiv: 2605.14888 · v1 · pith:SHG2GHFUnew · submitted 2026-05-14 · 💻 cs.SD · cs.LG

PROCESS-2: A Benchmark Speech Corpus for Early Cognitive Impairment Detection

Madhurananda Pahar , Caitlin H. Illingworth , Bahman Mirheidari , Hend Elghazaly , Fritz Peters , Sophie Young , Wing-Zin Leung , Labhpreet Kaur

show 2 more authors

Daniel Blackburn Heidi Christensen

This is my paper

classification 💻 cs.SD cs.LG

keywords cognitiveassessmentprocess-2speechimpairmentbenchmarkclinicallycollected

0 comments

read the original abstract

Speech-based analysis offers a scalable and non-invasive approach for detecting cognitive decline, yet progress has been constrained by the limited availability of clinically validated datasets collected under realistic conditions. We introduce PROCESS-2, a large-scale speech dataset designed to support research on automatic assessment of cognitive impairment from spontaneous and task-oriented speech. The dataset comprises recordings from 200 healthy controls, 150 mild cognitive impairment, and 50 dementia diagnoses collected using the CognoMemory digital assessment platform. Each participant completed a single assessment session, including picture description and verbal fluency tasks, accompanied by manually verified transcripts and participant-level metadata. PROCESS-2 contains approximately 21 hours of speech audio with predefined train/test partitions. Comprehensive technical validation evaluated demographic balance, clinical consistency, recording stability, embedding-space structure, and reproducible baseline modelling performance, demonstrating clinically meaningful group separation and stable performance across modelling approaches while preserving real-world conversational variability. PROCESS-2 is released under controlled access via Hugging Face to enable responsible reuse while protecting participant privacy, providing a reproducible benchmark resource for speech-based cognitive assessment research.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

A Multimodal Framework for Dementia Detection via Linguistic and Acoustic Representation Learning
cs.SD 2026-05 unverdicted novelty 4.0

Multimodal framework fuses HuBERT acoustic and BERT linguistic embeddings with attention and MINE mutual-information maximization for dementia detection on ADReSS and PROCESS-2 datasets.