pith. sign in

Siyuan Gan

Identifiers

  • name variant Siyuan Gan 0.60 · backfill

Papers (1)

  1. Thinking-Based Non-Thinking: Solving the Reward Hacking Problem in Training Hybrid Reasoning Models via Reinforcement Learning cs.AI · 2026 · author #1

Mentions

  • 2601.04805 #1 · arxiv_oai · confidence 0.70 Siyuan Gan

Frequent Coauthors