pith. sign in

Yunho Choi

Identifiers

No identifiers captured yet.

Papers (4)

  1. KL for a KL: On-Policy Distillation with Control Variate Baseline cs.LG · 2026 · author #4
  2. Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States cs.LG · 2026 · author #1
  3. Future Policy Approximation for Offline Reinforcement Learning Improves Mathematical Reasoning cs.CL · 2025 · author #2
  4. Text2Action: Generative Adversarial Synthesis from Language to Action cs.LG · 2017 · author #3

Mentions

No mention provenance yet.

Frequent Coauthors