The implementation of TV-AIL is based on an existing AIL algorithm DAC [Kostrikov et al., 2019] (https://github.com/google-research/ google-research/tree/master/value_dice)

For MuJoCo tasks, we implement BC according to [Li et al · 2022

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis

cs.LG · 2022-08-03 · unverdicted · novelty 7.0

TV-AIL achieves a horizon-independent imitation gap of O(min{1, sqrt(|S|/N)}) via stage-coupled dynamic programming analysis on locomotion-abstracted MDPs.

citing papers explorer

Showing 1 of 1 citing paper.

Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis cs.LG · 2022-08-03 · unverdicted · none · ref 15
TV-AIL achieves a horizon-independent imitation gap of O(min{1, sqrt(|S|/N)}) via stage-coupled dynamic programming analysis on locomotion-abstracted MDPs.

The implementation of TV-AIL is based on an existing AIL algorithm DAC [Kostrikov et al., 2019] (https://github.com/google-research/ google-research/tree/master/value_dice)

fields

years

verdicts

representative citing papers

citing papers explorer