Byungjin Choi, Seongsu Bae, Sunjun Kweon, and Ed- ward Choi

Varco-vision-2 · 2025 · arXiv 2509.10105

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

KMMMU: Evaluation of Massive Multi-discipline Multimodal Understanding in Korean Language and Context

cs.CL · 2026-03-18 · conditional · novelty 7.0

KMMMU benchmark demonstrates that leading multimodal models achieve at most 52.42% accuracy on hard Korean exam questions, highlighting limitations in non-English multimodal understanding.

K-MetBench: A Multi-Dimensional Benchmark for Fine-Grained Evaluation of Expert Reasoning, Locality, and Multimodality in Meteorology

cs.CL · 2026-04-27 · unverdicted · novelty 6.0

K-MetBench shows LLMs have large gaps in interpreting meteorology diagrams and Korean-specific context, with smaller local models beating much larger global ones.

citing papers explorer

Showing 2 of 2 citing papers.

KMMMU: Evaluation of Massive Multi-discipline Multimodal Understanding in Korean Language and Context cs.CL · 2026-03-18 · conditional · none · ref 1
KMMMU benchmark demonstrates that leading multimodal models achieve at most 52.42% accuracy on hard Korean exam questions, highlighting limitations in non-English multimodal understanding.
K-MetBench: A Multi-Dimensional Benchmark for Fine-Grained Evaluation of Expert Reasoning, Locality, and Multimodality in Meteorology cs.CL · 2026-04-27 · unverdicted · none · ref 1
K-MetBench shows LLMs have large gaps in interpreting meteorology diagrams and Korean-specific context, with smaller local models beating much larger global ones.

Byungjin Choi, Seongsu Bae, Sunjun Kweon, and Ed- ward Choi

fields

years

verdicts

representative citing papers

citing papers explorer