pith. sign in

arxiv: 2403.14623 · v5 · pith:2B5573G4new · submitted 2024-03-21 · 💻 cs.LG · cs.CV

Simplified Diffusion Schr\"odinger Bridge

classification 💻 cs.LG cs.CV
keywords bridgediffusiongenerativeodingerperformanceschrsgmssimplified
0
0 comments X
read the original abstract

This paper introduces a novel theoretical simplification of the Diffusion Schr\"odinger Bridge (DSB) that facilitates its unification with Score-based Generative Models (SGMs), addressing the limitations of DSB in complex data generation and enabling faster convergence and enhanced performance. By employing SGMs as an initial solution for DSB, our approach capitalizes on the strengths of both frameworks, ensuring a more efficient training process and improving the performance of SGM. We also propose a reparameterization technique that, despite theoretical approximations, practically improves the network's fitting capabilities. Our extensive experimental evaluations confirm the effectiveness of the simplified DSB, demonstrating its significant improvements. We believe the contributions of this work pave the way for advanced generative modeling.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Timage: A Generative Text-in-Image Paradigm for Fine-Tuning Vision-Language Models

    cs.CV 2026-06 unverdicted novelty 7.0

    Timage generates text query overlays on images via Constrained Schrödinger Bridge to boost fine-grained spatial reasoning in vision-language models, outperforming larger systems on VMCBench with a 7B backbone.