Are Large Vision Language Models up to the Challenge of Chart Comprehension and Reasoning

Mohammed Saidul Islam, Raian Rahman, Ahmed Masry, Md Tahmid Rahman Laskar, Mir Tafseer Nayeem, Enamul Hoque · 2024 · Findings of the Association for Computational Linguistics: EMNLP 2024 · DOI 10.18653/v1/2024.findings-emnlp.191

3 Pith papers cite this work, alongside 7 external citations. Polarity classification is still indexing.

3 Pith papers citing it

7 external citations · Crossref

open at publisher browse 3 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

QCalEval: Benchmarking Vision-Language Models for Quantum Calibration Plot Understanding

quant-ph · 2026-04-28 · unverdicted · novelty 7.0

Introduces QCalEval benchmark showing best zero-shot VLM score of 72.3 on quantum calibration plots, with fine-tuning and in-context learning effects varying by model type.

Beyond Single Plots: A Benchmark for Question Answering on Multi-Charts

cs.CL · 2026-04-23 · unverdicted · novelty 7.0

PolyChartQA is a new mid-scale dataset for multi-chart question answering that reveals a 27.4% accuracy drop for multimodal models on human-authored questions compared to AI-generated ones, plus a modest gain from a proposed prompting method.

Assessing Y-Axis Influence: Bias in Multimodal Language Models on Chart-to-Table Translation

cs.AI · 2026-04-27 · unverdicted · novelty 5.0

Y-axis features such as major tick digit length, number of ticks, value range, and format introduce significant biases in multimodal models during chart-to-table tasks, with y-axis prompting improving performance for some models.

citing papers explorer

Showing 3 of 3 citing papers.

QCalEval: Benchmarking Vision-Language Models for Quantum Calibration Plot Understanding quant-ph · 2026-04-28 · unverdicted · none · ref 34
Introduces QCalEval benchmark showing best zero-shot VLM score of 72.3 on quantum calibration plots, with fine-tuning and in-context learning effects varying by model type.
Beyond Single Plots: A Benchmark for Question Answering on Multi-Charts cs.CL · 2026-04-23 · unverdicted · none · ref 8
PolyChartQA is a new mid-scale dataset for multi-chart question answering that reveals a 27.4% accuracy drop for multimodal models on human-authored questions compared to AI-generated ones, plus a modest gain from a proposed prompting method.
Assessing Y-Axis Influence: Bias in Multimodal Language Models on Chart-to-Table Translation cs.AI · 2026-04-27 · unverdicted · none · ref 6
Y-axis features such as major tick digit length, number of ticks, value range, and format introduce significant biases in multimodal models during chart-to-table tasks, with y-axis prompting improving performance for some models.

Are Large Vision Language Models up to the Challenge of Chart Comprehension and Reasoning

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer