pith. sign in

Title resolution pending

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

fields

cs.CL 1 cs.CR 1

years

2026 2

verdicts

UNVERDICTED 2

representative citing papers

GAIA-v2-LILT: Multilingual Adaptation of Agent Benchmark beyond Translation

cs.CL · 2026-04-27 · unverdicted · novelty 7.0

A new workflow for multilingual agent benchmark adaptation using functional, cultural, and difficulty alignments improves non-English agent success rates by up to 32.7% over simple machine translation, indicating substantial benchmark-induced measurement error in prior multilingual evaluations.

citing papers explorer

Showing 2 of 2 citing papers.

  • GAIA-v2-LILT: Multilingual Adaptation of Agent Benchmark beyond Translation cs.CL · 2026-04-27 · unverdicted · none · ref 2

    A new workflow for multilingual agent benchmark adaptation using functional, cultural, and difficulty alignments improves non-English agent success rates by up to 32.7% over simple machine translation, indicating substantial benchmark-induced measurement error in prior multilingual evaluations.

  • AgentShield: Deception-based Compromise Detection for Tool-using LLM Agents cs.CR · 2026-05-10 · unverdicted · none · ref 19

    AgentShield uses layered deception traps in LLM agent tool interfaces to detect indirect prompt injection compromises with 90.7-100% success on commercial models, zero false positives, and cross-lingual transfer without retraining.