A Finetuned SpeechLLM for Joint Multi-Granular L2 Assessment and Natural-Language Rationales

· 2026 · cs.CL · arXiv 2606.09470

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Automated L2 speech assessment can assign proficiency labels, but often lacks interpretability. We propose a rubric-guided SpeechLLM for multi-aspect, multi-granular assessment, trained with a hybrid objective combining supervised fine-tuning and Bounded Direct Preference Optimization. The model jointly predicts ordinal labels at the sentence-level (accuracy, fluency, prosody), word/phoneme-level accuracy, and generates a natural-language rationale in the same response. On SpeechOcean762, our approach matches or outperforms single-granularity models while remaining competitive with prior approaches. We analyze rationale reliability along two axes: self-consistency with model predictions and alignment with ground-truth labels, using sentiment consistency (plausibility) and mention-based agreement (faithfulness). Rationales are plausible at the sentence level, but faithfulness degrades at the word/phoneme level: references are sparse and weakly aligned with token-level labels.

representative citing papers

A Finetuned SpeechLLM for Joint Multi-Granular L2 Assessment and Natural-Language Rationales

cs.CL · 2026-06-08 · unverdicted · novelty 6.0

A rubric-guided SpeechLLM jointly predicts multi-granular L2 proficiency labels and generates natural-language rationales using hybrid SFT and Bounded DPO, matching prior performance on SpeechOcean762 with plausible sentence-level rationales but weaker faithfulness at word/phoneme levels.

citing papers explorer

Showing 1 of 1 citing paper.

A Finetuned SpeechLLM for Joint Multi-Granular L2 Assessment and Natural-Language Rationales cs.CL · 2026-06-08 · unverdicted · none · ref 1 · internal anchor
A rubric-guided SpeechLLM jointly predicts multi-granular L2 proficiency labels and generates natural-language rationales using hybrid SFT and Bounded DPO, matching prior performance on SpeechOcean762 with plausible sentence-level rationales but weaker faithfulness at word/phoneme levels.

A Finetuned SpeechLLM for Joint Multi-Granular L2 Assessment and Natural-Language Rationales

fields

years

verdicts

representative citing papers

citing papers explorer