Problem Formulation Given a vocal waveformx∈R fsT of durationTseconds at sample ratef s, we model the conditional distributionP(y| x)over instrumental waveformsy

PROPOSED METHOD 3

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

HAFM: Hierarchical Autoregressive Foundation Model for Music Accompaniment Generation

cs.SD · 2026-04-10 · unverdicted · novelty 5.0

HAFM uses a hierarchical autoregressive model with dual-rate HuBERT and EnCodec tokens to generate coherent instrumental music from vocals, achieving FAD 2.08 on MUSDB18 while matching prior systems with fewer parameters.

citing papers explorer

Showing 1 of 1 citing paper.

HAFM: Hierarchical Autoregressive Foundation Model for Music Accompaniment Generation cs.SD · 2026-04-10 · unverdicted · none · ref 3
HAFM uses a hierarchical autoregressive model with dual-rate HuBERT and EnCodec tokens to generate coherent instrumental music from vocals, achieving FAD 2.08 on MUSDB18 while matching prior systems with fewer parameters.

Problem Formulation Given a vocal waveformx∈R fsT of durationTseconds at sample ratef s, we model the conditional distributionP(y| x)over instrumental waveformsy

fields

years

verdicts

representative citing papers

citing papers explorer