CRMA rena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments

Huang, Kung-Hsiang, Prabhakar, Akshara, Dhawan, Sidharth, Mao, Yixin, Wang, Huan, Savarese, Silvio · 2025 · DOI 10.18653/v1/2025.naacl-long.194

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

SEATauBench: Adapting Tool-Agent-User Evaluation Into Low-Resource Southeast Asian Languages

cs.CL · 2026-06-27 · unverdicted · novelty 7.0

SEATauBench is the first agent benchmark for SEA languages, finding that performance holds for language-only changes but degrades sharply with full domain localization.

What If Prompt Injection Never Left? Exploring Cross-Session Stored Prompt Injection in Agentic Systems

cs.CR · 2026-06-03 · unverdicted · novelty 6.0

Formalizes stored prompt injection in agentic systems, develops a taxonomy and benchmark to show how adversarial prompts can persist across sessions via persistent state artifacts.

citing papers explorer

Showing 2 of 2 citing papers.

SEATauBench: Adapting Tool-Agent-User Evaluation Into Low-Resource Southeast Asian Languages cs.CL · 2026-06-27 · unverdicted · none · ref 33
SEATauBench is the first agent benchmark for SEA languages, finding that performance holds for language-only changes but degrades sharply with full domain localization.
What If Prompt Injection Never Left? Exploring Cross-Session Stored Prompt Injection in Agentic Systems cs.CR · 2026-06-03 · unverdicted · none · ref 9
Formalizes stored prompt injection in agentic systems, develops a taxonomy and benchmark to show how adversarial prompts can persist across sessions via persistent state artifacts.

CRMA rena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments

fields

years

verdicts

representative citing papers

citing papers explorer