← back to paper
arxiv: 2606.11025 · 2 revisions
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models