pith. sign in

hub Baseline reference

Editre- ward: A human-aligned reward model for instruction-guided image editing

Baseline reference. 67% of citing Pith papers use this work as a benchmark or comparison.

11 Pith papers citing it
Baseline 67% of classified citations

hub tools

citation-role summary

baseline 3 background 2 dataset 1

citation-polarity summary

years

2026 10 2025 1

verdicts

UNVERDICTED 11

representative citing papers

RewardHarness: Self-Evolving Agentic Post-Training

cs.AI · 2026-05-09 · unverdicted · novelty 7.0

RewardHarness self-evolves a tool-and-skill library from 100 preference examples to reach 47.4% accuracy on image-edit evaluation, beating GPT-5, and yields stronger RL-tuned models.

Image Diffusion Preview with Consistency Solver

cs.LG · 2025-12-15 · unverdicted · novelty 6.0

ConsistencySolver enables high-quality low-step diffusion previews by adapting general linear multistep methods into a lightweight RL-optimized solver, matching multistep DPM-Solver FID with 47% fewer steps and cutting user interaction time by nearly 50%.

citing papers explorer

Showing 11 of 11 citing papers.