MixtureTT performs direct per-stem timbre transfer on polyphonic mixtures via a shared diffusion transformer, outperforming single-stem baselines on SATB choral data while eliminating cascaded separation errors.
Denoising diffusion probabilis- tic models
2 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 2verdicts
UNVERDICTED 2roles
method 1polarities
use method 1representative citing papers
Schrödinger Bridge-based generative semantic communication (SBGSC) enables direct optimal distribution transport from semantics to images, cutting hallucinations and achieving 38% better FID, 49.3% better SSIM, and over 8x faster inference than prior GSC methods.
citing papers explorer
-
Remix the Timbre: Diffusion-Based Style Transfer Across Polyphonic Stems
MixtureTT performs direct per-stem timbre transfer on polyphonic mixtures via a shared diffusion transformer, outperforming single-stem baselines on SATB choral data while eliminating cascaded separation errors.
-
Optimally Bridging Semantics and Data: Generative Semantic Communication via Schr\"odinger Bridge
Schrödinger Bridge-based generative semantic communication (SBGSC) enables direct optimal distribution transport from semantics to images, cutting hallucinations and achieving 38% better FID, 49.3% better SSIM, and over 8x faster inference than prior GSC methods.