To this end, we construct an example wheref(r+r′ 2 )>max{f(r),f(r′)}

We show the stronger statement thatf(r):= ˆLMLE n (π⋆ r)is not even quasiconvex in general

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Fast Rates for Inverse Reinforcement Learning

cs.LG · 2026-05-14 · unverdicted · novelty 6.0

Entropy-regularized Min-Max-IRL achieves O(n^{-1}) rates for trajectory-level KL divergence and squared parameter error in the Hessian norm under misspecification in Borel MDPs.

citing papers explorer

Showing 1 of 1 citing paper.

Fast Rates for Inverse Reinforcement Learning cs.LG · 2026-05-14 · unverdicted · none · ref 14
Entropy-regularized Min-Max-IRL achieves O(n^{-1}) rates for trajectory-level KL divergence and squared parameter error in the Hessian norm under misspecification in Borel MDPs.

To this end, we construct an example wheref(r+r′ 2 )>max{f(r),f(r′)}

fields

years

verdicts

representative citing papers

citing papers explorer