Minjae Oh
Identifiers
No identifiers captured yet.
Papers (4)
- KL for a KL: On-Policy Distillation with Control Variate Baseline cs.LG · 2026 · author #1
- Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States cs.LG · 2026 · author #4
- ThinkBrake: Efficient Reasoning via Log-Probability Margin Guided Decoding cs.CL · 2025 · author #2
- Future Policy Approximation for Offline Reinforcement Learning Improves Mathematical Reasoning cs.CL · 2025 · author #1
Mentions
No mention provenance yet.
Frequent Coauthors
- Yohan Jo 4 shared papers
- Yunho Choi 3 shared papers
- Sangjun Song 2 shared papers
- Dongmin Choi 1 shared papers
- Gyubin Choi 1 shared papers
- Jeonghoon Shim 1 shared papers
- Jongwon Lim 1 shared papers
- Seungkyu Lee 1 shared papers
- Sungmin Jo 1 shared papers
- Woojin Ahn 1 shared papers