Longcat-audio-codec: An audio tokenizer and detokenizer solution designed for speech large language models, 2025.URL https://arxiv

Zhao, X · 2025 · arXiv 2510.15227

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Unlocking Speech-Text Compositional Powers: Instruction-Following Speech Language Models without Instruction Tuning

cs.CL · 2026-07-02 · unverdicted · novelty 7.0

SpeechCombine produces instruction-following SLMs via speech pre-training followed by direct weight combination with the text LLM instruction delta, without any speech instruction tuning.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Unlocking Speech-Text Compositional Powers: Instruction-Following Speech Language Models without Instruction Tuning cs.CL · 2026-07-02 · unverdicted · none · ref 37
SpeechCombine produces instruction-following SLMs via speech pre-training followed by direct weight combination with the text LLM instruction delta, without any speech instruction tuning.

Longcat-audio-codec: An audio tokenizer and detokenizer solution designed for speech large language models, 2025.URL https://arxiv

fields

years

verdicts

representative citing papers

citing papers explorer