ChildVox aggregates 17 child-centered audio datasets into a multi-task benchmark to evaluate foundation models on physiological sounds, vocalizations, syllables, and speech recognition across childhood.
Huper: A human-inspired framework for phonetic perception,
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.SD 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
A LoRA-tuned LLM integrates four complementary speech views to detect dementia, reaching 90.14% F1 on ADReSSo with ablation support for each view's contribution.
citing papers explorer
-
ChildVox: A Speech, Audio, and Large Audio-Language Model Benchmark in Understanding and Characterizing Sound across Childhood
ChildVox aggregates 17 child-centered audio datasets into a multi-task benchmark to evaluate foundation models on physiological sounds, vocalizations, syllables, and speech recognition across childhood.
-
LoRA-Tuned Large Language Models for Dementia Detection via Multi-View Speech-Derived Features
A LoRA-tuned LLM integrates four complementary speech views to detect dementia, reaching 90.14% F1 on ADReSSo with ablation support for each view's contribution.