IVG combines spec-grounded introspection and view-grounded interaction to let VLMs achieve 0.81 QA accuracy on interactive charts, with gains on overlapping elements, using a new benchmark of 500 Plotly figures.
InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 13872–13882
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Beyond Pixels: Introspective and Interactive Grounding for Visualization Agents
IVG combines spec-grounded introspection and view-grounded interaction to let VLMs achieve 0.81 QA accuracy on interactive charts, with gains on overlapping elements, using a new benchmark of 500 Plotly figures.