pith. sign in

Yannis Yiming He

Identifiers

No identifiers captured yet.

Papers (3)

  1. SWE Atlas: Benchmarking Coding Agents Beyond Issue Resolution cs.LG · 2026 · author #4
  2. HiL-Bench (Human-in-Loop Benchmark): Do Agents Know When to Ask for Help? cs.AI · 2026 · author #8
  3. SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks? cs.SE · 2025 · author #4

Mentions

No mention provenance yet.

Frequent Coauthors