← back to paper
arxiv: 2605.06156 · 2 revisions
Entropy-Regularized Adjoint Matching for Offline Reinforcement Learning