pith. sign in

hub Baseline reference

Q-bench: A benchmark for general-purpose foundation models on low-level vision

Baseline reference. 57% of citing Pith papers use this work as a benchmark or comparison.

13 Pith papers citing it
Baseline 57% of classified citations

hub tools

citation-role summary

baseline 3 background 2 dataset 2

citation-polarity summary

representative citing papers

Are We on the Right Way for Evaluating Large Vision-Language Models?

cs.CV · 2024-03-29 · conditional · novelty 6.0

Current LVLM benchmarks overestimate capabilities because many questions can be answered without images due to design flaws or data leakage; MMStar is a human-curated set of 1,500 vision-indispensable samples across 6 capabilities and 18 axes with new metrics for leakage and true multi-modal gain.

LLaVA-OneVision: Easy Visual Task Transfer

cs.CV · 2024-08-06 · unverdicted · novelty 5.0

LLaVA-OneVision is the first single open LMM to simultaneously achieve strong performance in single-image, multi-image, and video scenarios with cross-scenario transfer capabilities.

iDiff: Interpretable Difference-aware Framework for Pairwise Image Quality Assessment

cs.CV · 2026-05-19 · unverdicted · novelty 4.0

iDiff is a dual-branch framework with an Answer Model for robust pairwise preference prediction via view decomposition and ensembles, and a Thinking Model for structured rationale generation using templates and answer-aware supervision, winning first place in the NTIRE 2026 RAIM challenge.

citing papers explorer

Showing 13 of 13 citing papers.