pith. sign in

A simple and effective reinforcement learning method for text-to-image diffusion fine-tuning

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

citation-role summary

background 2

citation-polarity summary

fields

cs.CV 5

years

2026 3 2025 2

roles

background 2

polarities

background 2

representative citing papers

Flow-GRPO: Training Flow Matching Models via Online RL

cs.CV · 2025-05-08 · unverdicted · novelty 8.0

Flow-GRPO is the first online RL method for flow matching models, raising GenEval accuracy from 63% to 95% and text-rendering accuracy from 59% to 92% with little reward hacking.

citing papers explorer

Showing 5 of 5 citing papers.