pith. sign in

E-grpo: High entropy steps drive effective reinforcement learning for flow models

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

citation-role summary

background 2

citation-polarity summary

years

2026 5

verdicts

UNVERDICTED 5

roles

background 2

polarities

background 1 unclear 1

representative citing papers

A Systematic Post-Train Framework for Video Generation

cs.CV · 2026-04-28 · unverdicted · novelty 5.0

A post-training pipeline for video generation models combines SFT, RLHF with novel GRPO, prompt enhancement, and inference optimization to improve visual quality, temporal coherence, and instruction following.

citing papers explorer

Showing 5 of 5 citing papers.