pith. sign in

Human-Aligned MLLM judges for fine- grained image editing evaluation: a benchmark, framework, and analysis.arXiv preprint arXiv:2602.13028, 2026

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

fields

cs.AI 1 cs.GR 1

years

2026 2

verdicts

UNVERDICTED 2

representative citing papers

PaintBench: Deterministic Evaluation of Precise Visual Editing

cs.GR · 2026-05-29 · unverdicted · novelty 5.0

PaintBench provides a scalable deterministic benchmark for precise visual editing operations, revealing that even the best of 11 models achieves only 17.1% mIoU and that scores correlate strongly with applied data visualization editing performance.

citing papers explorer

Showing 2 of 2 citing papers.