pith. sign in

Chuanyi Sun

Identifiers

  • name variant Chuanyi Sun 0.60 · backfill

Papers (2)

  1. EvalStop: Using World Feedback to Detect and Correct Reward Overoptimization in Multi-Tenant RLHF Platforms cs.LG · 2026 · author #2
  2. When Does Deep RL Beat Calibrated Baselines? A Benchmark Study on Adaptive Resource Control cs.LG · 2026 · author #2

Mentions

  • 2606.04145 #2 · arxiv_oai · confidence 0.70 Chuanyi Sun
  • 2605.26418 #2 · arxiv_oai · confidence 0.70 Chuanyi Sun

Frequent Coauthors