pith. sign in

hub Mixed citations

Coefficients-preserving sampling for reinforcement learning with flow matching.arXiv preprint arXiv:2509.05952

Mixed citation behavior. Most common role is background (57%).

11 Pith papers citing it
Background 57% of classified citations

hub tools

citation-role summary

background 6 method 1

citation-polarity summary

years

2026 8 2025 3

verdicts

UNVERDICTED 11

clear filters

representative citing papers

DiffusionNFT: Online Diffusion Reinforcement with Forward Process

cs.LG · 2025-09-19 · unverdicted · novelty 7.0

DiffusionNFT performs online RL for diffusion models on the forward process via flow matching and positive-negative contrasts, delivering up to 25x efficiency gains and rapid benchmark improvements over prior reverse-process methods.

MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE

cs.AI · 2025-07-29 · unverdicted · novelty 7.0

MixGRPO speeds up GRPO for flow-based image generators by restricting SDE sampling and optimization to a sliding window while using ODE elsewhere, cutting training time by up to 71% with better alignment performance.

A Systematic Post-Train Framework for Video Generation

cs.CV · 2026-04-28 · unverdicted · novelty 5.0

A post-training pipeline for video generation models combines SFT, RLHF with novel GRPO, prompt enhancement, and inference optimization to improve visual quality, temporal coherence, and instruction following.

citing papers explorer

Showing 11 of 11 citing papers.