pith. sign in

Soundstream: An end-to-end neural audio codec

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

citation-role summary

background 2

citation-polarity summary

years

2026 7 2025 1

roles

background 2

polarities

background 2

clear filters

representative citing papers

ENSEMBITS: an alphabet of protein conformational ensembles

cs.LG · 2026-05-13 · unverdicted · novelty 8.0 · 2 refs

Ensembits is the first tokenizer of protein conformational ensembles that outperforms static tokenizers on RMSF prediction and matches them on function and mutation tasks while using less pretraining data.

Codec-Robust Attacks on Audio LLMs

cs.SD · 2026-05-19 · unverdicted · novelty 7.0 · 2 refs

CodecAttack perturbs audio in codec latent space with multi-bitrate EoT to achieve 85.5% average ASR on Opus-compressed Audio LLMs versus under 26% for waveform baselines, with transfer to MP3 and AAC.

FAST: Efficient Action Tokenization for Vision-Language-Action Models

cs.RO · 2025-01-16 · unverdicted · novelty 6.0

FAST applies discrete cosine transform to robot action sequences for efficient tokenization, enabling autoregressive VLAs to succeed on high-frequency dexterous tasks and scale to 10k hours of data while matching diffusion VLA performance with up to 5x faster training.

Woosh: A Sound Effects Foundation Model

cs.SD · 2026-04-02 · accept · novelty 5.0

Woosh is a new publicly released foundation model optimized for high-quality sound effect generation from text or video, showing competitive or better results than open alternatives like Stable Audio Open.

Telephony Voice Agent for Banking Services

cs.HC · 2026-06-27 · unverdicted · novelty 2.0

Implementation of a telephony voice agent for banking services using Dialogflow CX supporting queries, authentication, and live agent handoff.

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • Telephony Voice Agent for Banking Services cs.HC · 2026-06-27 · unverdicted · none · ref 22

    Implementation of a telephony voice agent for banking services using Dialogflow CX supporting queries, authentication, and live agent handoff.