pith. sign in

Honghua Dong

Identifiers

  • name variant Honghua Dong 0.60 · backfill

Papers (4)

  1. Precise Debugging Benchmark: Is Your Model Debugging or Regenerating? cs.SE · 2026 · author #6
  2. $\tau^2$-Bench: Evaluating Conversational Agents in a Dual-Control Environment cs.AI · 2025 · author #2
  3. Identifying the Risks of LM Agents with an LM-Emulated Sandbox cs.AI · 2023 · author #2
  4. Neural Logic Machines cs.AI · 2019 · author #1

Mentions

  • 2604.17338 #6 · arxiv_oai · confidence 0.70 Honghua Dong

Frequent Coauthors