MGSM-Pro creates five digit-and-context varied instantiations per MGSM question to expose robustness gaps in multilingual LLM math reasoning, with larger drops in low-resource languages.
,→You should use this as a ,→reference alongside the ,→English template to judge ,→if the Native language ,→template is correct
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
CONDITIONAL 1representative citing papers
citing papers explorer
-
MGSM-Pro: A Simple Strategy for Robust Multilingual Mathematical Reasoning Evaluation
MGSM-Pro creates five digit-and-context varied instantiations per MGSM question to expose robustness gaps in multilingual LLM math reasoning, with larger drops in low-resource languages.