pith. sign in

Mixed citations

Training language models to follow instructions with human feedback

Mixed citation behavior. Most common role is background (60%).

8 Pith papers citing it
Background 60% of classified citations

citation-role summary

background 3 method 2

citation-polarity summary

years

2026 5 2025 3

clear filters

representative citing papers

DanceGRPO: Unleashing GRPO on Visual Generation

cs.CV · 2025-05-12 · unverdicted · novelty 6.0

DanceGRPO applies GRPO to visual generation tasks to achieve stable policy optimization across diffusion models, rectified flows, multiple tasks, and diverse reward models, outperforming prior RL methods.

citing papers explorer

Showing 3 of 3 citing papers after filters.