A Unified and Reproducible Experimentation Framework for Speech Understanding

Chenghao Wang; Duo Ma; Guanyu Chen; Hanqi Li; Haoran Wang; Haoyu Li; Hui Zhang; Jiang Li; Jiaqi Guo; Jing Peng

arxiv: 2605.30899 · v1 · pith:CAN5UIRKnew · submitted 2026-05-29 · 📡 eess.AS · cs.AI· cs.SD

A Unified and Reproducible Experimentation Framework for Speech Understanding

Jing Peng , Junhao Du , Chenghao Wang , Hanqi Li , Yi Yang , Yixuan Wang , Xiaoyu Gu , Guanyu Chen

show 16 more authors

Yucheng Wang Jiang Li Zhangjie Zhao Haoran Wang Wenming Tu Haoyu Li Duo Ma Lirong Qian Yu Xi Wen Wen Jiaqi Guo Hui Zhang Shuai Fan Wenbin Jiang Shuai Wang Kai Yu

This is my paper

classification 📡 eess.AS cs.AIcs.SD

keywords speechsurepipelinestrainingunifiedacrossdeployment-orientedevaluation

0 comments

read the original abstract

Speech foundation models and Speech LLMs have advanced speech understanding, yet deployment-oriented model selection is hindered by non-comparable evaluations caused by mismatched post-processing, and by training results that are hard to reproduce across data scales and pipelines. We present SURE, a unified experimentation framework that standardizes prediction formats, normalization, and scoring. SURE evaluates strong systems across paradigms, from conventional pipelines to Speech LLMs, on representative tasks under realistic acoustic and linguistic stressors. Beyond evaluation, SURE introduces an agent-assisted training conversion flow that maps paper and code into versioned, runnable training pipelines under a unified protocol on matched open-data subsets. Overall, SURE improves comparability and reproducibility for deployment-oriented evaluation.

This paper has not been read by Pith yet.

A Unified and Reproducible Experimentation Framework for Speech Understanding

discussion (0)