For the lower bound instances (refer to Assumption 2), we consider that the initial state distribution is uniform over all states

For RBAS MDPs, we consider that the initial state distribution is uniform over good states · 2019

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis

cs.LG · 2022-08-03 · unverdicted · novelty 7.0

TV-AIL achieves a horizon-independent imitation gap of O(min{1, sqrt(|S|/N)}) via stage-coupled dynamic programming analysis on locomotion-abstracted MDPs.

citing papers explorer

Showing 1 of 1 citing paper.

Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis cs.LG · 2022-08-03 · unverdicted · none · ref 16
TV-AIL achieves a horizon-independent imitation gap of O(min{1, sqrt(|S|/N)}) via stage-coupled dynamic programming analysis on locomotion-abstracted MDPs.

For the lower bound instances (refer to Assumption 2), we consider that the initial state distribution is uniform over all states

fields

years

verdicts

representative citing papers

citing papers explorer