Fine-tuning on matched native and English-pivoted multilingual reasoning datasets across six languages reduces the native reasoning gap to 1.9-3.5%; layer swap of English mid-layers largely closes the remaining gap while preserving target-language CoT.
Lucas Bandarkar and Nanyun Peng
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Rethinking the Multilingual Reasoning Gap with Layer Swap
Fine-tuning on matched native and English-pivoted multilingual reasoning datasets across six languages reduces the native reasoning gap to 1.9-3.5%; layer swap of English mid-layers largely closes the remaining gap while preserving target-language CoT.