Efficient Scaling for LLM-based ASR

· 2025 · arXiv 2508.04096

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Towards Building Speech Large Language Models for Multitask Understanding in Low-Resource Languages

cs.SD · 2025-09-18 · unverdicted · novelty 5.0

Introduces XLSR-Thai encoder, U-Align alignment, and Thai-SUP data pipeline to enable multitask speech understanding SLLMs for Thai.

Rethinking Speech-LLM Integration for ASR: Effective Joint Speech-Text Training by Interleaving

cs.CL · 2026-07-02 · unverdicted · novelty 4.0

JSTIP interleaves speech and text sequences during pretraining on 38k hours of ASR data to improve entity accuracy over ASR-only and simple joint-training baselines while matching performance from domain text.

citing papers explorer

Showing 0 of 0 citing papers after filters.

No citing papers match the current filters.

Efficient Scaling for LLM-based ASR

fields

years

verdicts

representative citing papers

citing papers explorer