pith. sign in

To this end, we construct an example wheref(r+r′ 2 )>max{f(r),f(r′)}

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.LG 1

years

2026 1

verdicts

UNVERDICTED 1

representative citing papers

Fast Rates for Inverse Reinforcement Learning

cs.LG · 2026-05-14 · unverdicted · novelty 6.0

Entropy-regularized Min-Max-IRL achieves O(n^{-1}) rates for trajectory-level KL divergence and squared parameter error in the Hessian norm under misspecification in Borel MDPs.

citing papers explorer

Showing 1 of 1 citing paper.

  • Fast Rates for Inverse Reinforcement Learning cs.LG · 2026-05-14 · unverdicted · none · ref 14

    Entropy-regularized Min-Max-IRL achieves O(n^{-1}) rates for trajectory-level KL divergence and squared parameter error in the Hessian norm under misspecification in Borel MDPs.