pith. sign in

Sai Koneru, Miriam Exel, Matthias Huck, and Jan Niehues

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.CL 1

years

2026 1

verdicts

UNVERDICTED 1

representative citing papers

GAIA-v2-LILT: Multilingual Adaptation of Agent Benchmark beyond Translation

cs.CL · 2026-04-27 · unverdicted · novelty 7.0

A new workflow for multilingual agent benchmark adaptation using functional, cultural, and difficulty alignments improves non-English agent success rates by up to 32.7% over simple machine translation, indicating substantial benchmark-induced measurement error in prior multilingual evaluations.

citing papers explorer

Showing 1 of 1 citing paper.

  • GAIA-v2-LILT: Multilingual Adaptation of Agent Benchmark beyond Translation cs.CL · 2026-04-27 · unverdicted · none · ref 3

    A new workflow for multilingual agent benchmark adaptation using functional, cultural, and difficulty alignments improves non-English agent success rates by up to 32.7% over simple machine translation, indicating substantial benchmark-induced measurement error in prior multilingual evaluations.