Minjae Oh

Identifiers

No identifiers captured yet.

Papers (4)

KL for a KL: On-Policy Distillation with Control Variate Baseline cs.LG · 2026 · author #1
Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States cs.LG · 2026 · author #4
ThinkBrake: Efficient Reasoning via Log-Probability Margin Guided Decoding cs.CL · 2025 · author #2
Future Policy Approximation for Offline Reinforcement Learning Improves Mathematical Reasoning cs.CL · 2025 · author #1

Mentions

No mention provenance yet.

Frequent Coauthors

Yohan Jo 4 shared papers
Yunho Choi 3 shared papers
Sangjun Song 2 shared papers
Dongmin Choi 1 shared papers
Gyubin Choi 1 shared papers
Jeonghoon Shim 1 shared papers
Jongwon Lim 1 shared papers
Seungkyu Lee 1 shared papers
Sungmin Jo 1 shared papers
Woojin Ahn 1 shared papers