pith. sign in

Haiyang Shen

Identifiers

  • name variant Haiyang Shen 0.60 · backfill

Papers (8)

  1. EvoCode-Bench: Evaluating Coding Agents in Multi-Turn Iterative Interactions cs.AI · 2026 · author #1
  2. SGR-Bench: Benchmarking Search Agents on State-Gated Retrieval cs.AI · 2026 · author #2
  3. MindLoom: Composing Thought Modes for Frontier-Level Reasoning Data Synthesis cs.AI · 2026 · author #1
  4. DeepWeb-Bench: A Deep Research Benchmark Demanding Massive Cross-Source Evidence and Long-Horizon Derivation cs.AI · 2026 · author #3
  5. Teaching AI Through Benchmark Construction: QuestBench as a Course-Based Practice for Accountable Knowledge Work cs.AI · 2026 · author #1
  6. RoadmapBench: Evaluating Long-Horizon Agentic Software Development Across Version Upgrades cs.SE · 2026 · author #3
  7. ViDR: Grounding Multimodal Deep Research Reports in Source Visual Evidence cs.CV · 2026 · author #4
  8. Tongyi DeepResearch Technical Report cs.CL · 2025 · author #29

Mentions

  • 2605.24110 #1 · arxiv_oai · confidence 0.70 Haiyang Shen
  • 2605.22219 #2 · arxiv_oai · confidence 0.70 Haiyang Shen
  • 2605.21630 #1 · arxiv_oai · confidence 0.70 Haiyang Shen
  • 2605.21482 #3 · arxiv_oai · confidence 0.70 Haiyang Shen
  • 2605.21413 #1 · arxiv_oai · confidence 0.70 Haiyang Shen
  • 2510.24701 #29 · arxiv_oai · confidence 0.70 Haiyang Shen
  • 2605.15846 #3 · arxiv_oai · confidence 0.70 Haiyang Shen

Frequent Coauthors