pith. sign in

Zhancun Mu

Identifiers

  • name variant Zhancun Mu 0.60 · backfill

Papers (3)

  1. Scaling World-Model Reinforcement Learning Through Diffusion Policy Optimization cs.LG · 2026 · author #3
  2. Preserve Support, Not Correspondence: Dynamic Routing for Offline Reinforcement Learning cs.LG · 2026 · author #1
  3. Preference Goal Tuning: Post-Training as Latent Control for Frozen Policies cs.AI · 2024 · author #6

Mentions

  • 2605.26282 #3 · arxiv_oai · confidence 0.70 Zhancun Mu

Frequent Coauthors