pith. sign in

Salmonn-omni: A codec-free llm for full-duplex speech understanding and generation

11 Pith papers cite this work. Polarity classification is still indexing.

11 Pith papers citing it

citation-role summary

background 3

citation-polarity summary

years

2026 9 2025 2

roles

background 3

polarities

background 3

clear filters

representative citing papers

FlexiSLM: A Dynamic and Controllable Frame Rate Spoken Language Model

cs.SD · 2026-06-30 · unverdicted · novelty 7.0

FlexiSLM is the first spoken language model supporting dynamic and controllable frame rates on speech input and output, outperforming fixed-rate 7B models at high quality and enabling faster inference at lower rates like 6.25 Hz.

Communicating Sound Through Natural Language

cs.LG · 2026-05-09 · unverdicted · novelty 6.0

Lexical acoustic coding lets LLMs transmit audio waveforms as editable natural-language sentences that another LLM can parse and reconstruct into sound.

Adaptive Turn-Taking for Real-time Multi-Party Voice Agents

eess.AS · 2026-06-11 · unverdicted · novelty 5.0 · 2 refs

ModeratorLM conditions a streaming speech LLM on assigned roles for adaptive turn-taking in multi-party settings, reporting over 40% higher precision and 70% higher recall than non-role baselines on real meetings and a new synthetic dataset.

citing papers explorer

Showing 9 of 9 citing papers after filters.