Lexical acoustic coding lets LLMs transmit audio waveforms as editable natural-language sentences that another LLM can parse and reconstruct into sound.
and McVicar, Matt and Battenberg, Eric and Nieto, Oriol , title =
4 Pith papers cite this work. Polarity classification is still indexing.
4
Pith papers citing it
citation-role summary
background 1
citation-polarity summary
years
2026 4roles
background 1polarities
background 1representative citing papers
TTS data augmentation and LLM error correction together cut relative WER by 40-50% on ASR models for oral cancer speech.
citing papers explorer
-
Communicating Sound Through Natural Language
Lexical acoustic coding lets LLMs transmit audio waveforms as editable natural-language sentences that another LLM can parse and reconstruct into sound.
-
Improving Automatic Speech Recognition for Speakers Treated for Oral Cancer using Data Augmentation and LLM Error Correction
TTS data augmentation and LLM error correction together cut relative WER by 40-50% on ASR models for oral cancer speech.
- Voice "Cloning" is Style Transfer
- Hardware-Software Co-Design of Scalable, Energy-Efficient Analog Recurrent Computations