pith. sign in

Terry Yue Zhuo

Identifiers

  • name variant Terry Yue Zhuo 0.60 · backfill

Papers (8)

  1. SWE-Chain: Benchmarking Coding Agents on Chained Release-Level Package Upgrades cs.SE · 2026 · author #7
  2. ShredBench: Evaluating the Semantic Reasoning Capabilities of Multimodal LLMs in Document Reconstruction cs.CV · 2026 · author #6
  3. Coherence Collapse: Diagnosing Why Code Agents Fail After Reaching the Right Code cs.SE · 2026 · author #5
  4. Watermarking LLM Agent Trajectories cs.CR · 2026 · author #3
  5. Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces cs.SE · 2026 · author #65
  6. BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions cs.SE · 2024 · author #1
  7. StarCoder 2 and The Stack v2: The Next Generation cs.SE · 2024 · author #25
  8. StarCoder: may the source be with you! cs.CL · 2023 · author #13

Mentions

  • 2603.24631 #5 · arxiv_oai · confidence 0.70 Terry Yue Zhuo

Frequent Coauthors