MGSM-Pro creates five digit-and-context varied instantiations per MGSM question to expose robustness gaps in multilingual LLM math reasoning, with larger drops in low-resource languages.
You ,→should make sure the native ,→language template is as ,→similar to the english ,→template as possible
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
CONDITIONAL 1representative citing papers
citing papers explorer
-
MGSM-Pro: A Simple Strategy for Robust Multilingual Mathematical Reasoning Evaluation
MGSM-Pro creates five digit-and-context varied instantiations per MGSM question to expose robustness gaps in multilingual LLM math reasoning, with larger drops in low-resource languages.