-3 (Minor mismatch):Most relevant elements are preserved, but a few aspects (e.g., background details, lighting consistency) are missing or incorrectly handled

Image Faithfulness(How well are the non-edited parts, key input elements preserved?) -4 (Uses input fully):All relevant elements from the input (background, style, lighting, ide

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

cs.AI · 2026-04-13 · unverdicted · novelty 6.0

RationalRewards recovers rationales from preference data via PARROT to create a critique-first reward model that improves visual generators at both training time through RL and test time through prompt refinement, matching RL fine-tuning performance while using far less data.

citing papers explorer

Showing 1 of 1 citing paper after filters.

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time cs.AI · 2026-04-13 · unverdicted · none · ref 22
RationalRewards recovers rationales from preference data via PARROT to create a critique-first reward model that improves visual generators at both training time through RL and test time through prompt refinement, matching RL fine-tuning performance while using far less data.

-3 (Minor mismatch):Most relevant elements are preserved, but a few aspects (e.g., background details, lighting consistency) are missing or incorrectly handled

fields

years

verdicts

representative citing papers

citing papers explorer