pith. sign in

hub

MMLU - P ro X : A Multilingual Benchmark for Advanced Large Language Model Evaluation

11 Pith papers cite this work. Polarity classification is still indexing.

11 Pith papers citing it

hub tools

fields

cs.CL 11

years

2026 11

clear filters

representative citing papers

Soft Token Alignment for Cross-Lingual Reasoning

cs.CL · 2026-06-25 · unverdicted · novelty 6.0

SOLAR aligns soft-token probability mixtures across languages in embedding space during SFT and raises multilingual reasoning accuracy by up to 17.7 points over the base model.

DEPART: DEcomposing PARiTy across Multilingual LLMs

cs.CL · 2026-05-27 · unverdicted · novelty 6.0

A Bayesian framework decomposes mLLM variance, showing language features explain 79-92% of language identity variance and that model identity vs. benchmark-model interactions dominate differently for understanding versus reasoning tasks.

citing papers explorer

Showing 11 of 11 citing papers after filters.