M ulti WOZ 2.2 : A Dialogue Dataset with Additional Annotation Corrections and State Tracking Baselines

Zang, Xiaoxue, Rastogi, Abhinav, Sunkara, Srinivas, Gupta, Raghav, Zhang, Jianguo, Chen, Jindong · 2020 · DOI 10.18653/v1/2020.nlp4convai-1.13

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

SEATauBench: Adapting Tool-Agent-User Evaluation Into Low-Resource Southeast Asian Languages

cs.CL · 2026-06-27 · unverdicted · novelty 7.0

SEATauBench is the first agent benchmark for SEA languages, finding that performance holds for language-only changes but degrades sharply with full domain localization.

GBC: Gradient-Based Connections for Optimizing Multi-Agent Systems

cs.MA · 2026-06-26 · unverdicted · novelty 5.0

GBC treats multi-agent LLM workflows as differentiable graphs to enable token-level attribution and targeted optimization, with reported gains on MultiWOZ and τ-bench.

citing papers explorer

Showing 1 of 1 citing paper after filters.

SEATauBench: Adapting Tool-Agent-User Evaluation Into Low-Resource Southeast Asian Languages cs.CL · 2026-06-27 · unverdicted · none · ref 28
SEATauBench is the first agent benchmark for SEA languages, finding that performance holds for language-only changes but degrades sharply with full domain localization.

M ulti WOZ 2.2 : A Dialogue Dataset with Additional Annotation Corrections and State Tracking Baselines

fields

years

verdicts

representative citing papers

citing papers explorer