pith. sign in

arxiv: 2606.03803 · v2 · pith:DEJ2FEPBnew · submitted 2026-06-02 · 💻 cs.SD · cs.AI· eess.AS

LiveBand: Live Accompaniment Generation in the Audio Domain

classification 💻 cs.SD cs.AIeess.AS
keywords audiocausalaccompanimentlivebandfuturegenerationgeneratorinference
0
0 comments X
read the original abstract

We present LiveBand, a real-time system that generates high-fidelity music accompaniments to live audio input, respecting strict causal constraints. Our method trains a causal transformer generator in the continuous latent space of a pre-trained causal audio autoencoder, using adversarial sequence-level supervision from a discriminator. At each timestep, the generator receives only the causally available mix context and Gaussian noise, and predicts accompaniment latents without access to future mix frames or ground-truth target latents. Training is performed in a single parallel forward pass under causal masking, while streaming inference proceeds autoregressively with a rolling attention state. The model's training and inference computations are matched by design, eliminating teacher forcing and the associated exposure bias. On a multi-instrument music accompaniment benchmark, LiveBand improves over prior work on objective measures of audio quality, beat alignment, and mix adherence, while enabling real-time streaming generation without lookahead into the future on consumer hardware.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.