Byungjin Choi, Seongsu Bae, Sunjun Kweon, and Ed- ward Choi

Varco-vision-2 · 2022 · arXiv 2509.10105

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

KSAFE-MM: A Multimodal Safety Benchmark via Localized Contextualization for Korean Cultural Risks

cs.CL · 2026-05-27 · unverdicted · novelty 7.0

KSAFE-MM is a two-part multimodal safety benchmark for Korean contexts that shows MLLMs are more vulnerable to culturally grounded jailbreaks than generic ones, with a noted safety-over-refusal trade-off.

KMMMU: Evaluation of Massive Multi-discipline Multimodal Understanding in Korean Language and Context

cs.CL · 2026-03-18 · conditional · novelty 7.0

KMMMU benchmark demonstrates that leading multimodal models achieve at most 52.42% accuracy on hard Korean exam questions, highlighting limitations in non-English multimodal understanding.

K-MetBench: A Multi-Dimensional Benchmark for Fine-Grained Evaluation of Expert Reasoning, Locality, and Multimodality in Meteorology

cs.CL · 2026-04-27 · unverdicted · novelty 6.0

K-MetBench shows LLMs have large gaps in interpreting meteorology diagrams and Korean-specific context, with smaller local models beating much larger global ones.

citing papers explorer

Showing 1 of 1 citing paper after filters.

KMMMU: Evaluation of Massive Multi-discipline Multimodal Understanding in Korean Language and Context cs.CL · 2026-03-18 · conditional · none · ref 1
KMMMU benchmark demonstrates that leading multimodal models achieve at most 52.42% accuracy on hard Korean exam questions, highlighting limitations in non-English multimodal understanding.

Byungjin Choi, Seongsu Bae, Sunjun Kweon, and Ed- ward Choi

fields

years

verdicts

representative citing papers

citing papers explorer