pith. machine review for the scientific record.
sign in

Patrick D. Watson

Identifiers

No identifiers captured yet.

Papers (1)

  1. Agent-Diff: Benchmarking LLM Agents on Enterprise API Tasks via Code Execution with State-Diff-Based Evaluation cs.SE · 2026 · author #3

Mentions

No mention provenance yet.

Frequent Coauthors