InThe Twelfth In- ternational Conference on Learning Representations, ICLR 2024, Vienna, Austria, May 7-11

Let’s verify step by step · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

AgentEval: DAG-Structured Step-Level Evaluation for Agentic Workflows with Error Propagation Tracking

cs.SE · 2026-04-26 · conditional · novelty 7.0

AgentEval evaluates agentic workflows via DAGs with step metrics, a 21-category failure taxonomy, and error propagation tracking, yielding 2.17x higher failure recall than end-to-end methods and strong human agreement.

citing papers explorer

Showing 1 of 1 citing paper.

AgentEval: DAG-Structured Step-Level Evaluation for Agentic Workflows with Error Propagation Tracking cs.SE · 2026-04-26 · conditional · none · ref 2
AgentEval evaluates agentic workflows via DAGs with step metrics, a 21-category failure taxonomy, and error propagation tracking, yielding 2.17x higher failure recall than end-to-end methods and strong human agreement.

InThe Twelfth In- ternational Conference on Learning Representations, ICLR 2024, Vienna, Austria, May 7-11

fields

years

verdicts

representative citing papers

citing papers explorer