Additionally, for the GQA dataset, the rich scene graph information provided by the dataset itself is leveraged as supplementary initial evidence

extracts the location, textual content within the image, robustly handling complex environments

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

VG-CoT: Towards Trustworthy Visual Reasoning via Grounded Chain-of-Thought

cs.CV · 2026-04-23 · unverdicted · novelty 6.0

VG-CoT is a new scalable dataset and three-axis benchmark that improves grounded chain-of-thought reasoning in LVLMs by explicitly tying each reasoning step to visual evidence.

citing papers explorer

Showing 1 of 1 citing paper.

VG-CoT: Towards Trustworthy Visual Reasoning via Grounded Chain-of-Thought cs.CV · 2026-04-23 · unverdicted · none · ref 7
VG-CoT is a new scalable dataset and three-axis benchmark that improves grounded chain-of-thought reasoning in LVLMs by explicitly tying each reasoning step to visual evidence.

Additionally, for the GQA dataset, the rich scene graph information provided by the dataset itself is leveraged as supplementary initial evidence

fields

years

verdicts

representative citing papers

citing papers explorer