Risk-Sensitive Generative Adversarial Imitation Learning

Jonathan Lacotte , Mohammad Ghavamzadeh , Yinlam Chow , Marco Pavone

Authors on Pith no claims yet

classification 💻 cs.LG cs.AIstat.ML

keywords imitationlearningrisk-sensitiveadversarialalgorithmsgailgenerativeoptimization

read the original abstract

We study risk-sensitive imitation learning where the agent's goal is to perform at least as well as the expert in terms of a risk profile. We first formulate our risk-sensitive imitation learning setting. We consider the generative adversarial approach to imitation learning (GAIL) and derive an optimization problem for our formulation, which we call it risk-sensitive GAIL (RS-GAIL). We then derive two different versions of our RS-GAIL optimization problem that aim at matching the risk profiles of the agent and the expert w.r.t. Jensen-Shannon (JS) divergence and Wasserstein distance, and develop risk-sensitive generative adversarial imitation learning algorithms based on these optimization problems. We evaluate the performance of our algorithms and compare them with GAIL and the risk-averse imitation learning (RAIL) algorithms in two MuJoCo and two OpenAI classical control tasks.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Structure from Strategic Interaction & Uncertainty: Risk Sensitive Games for Robust Preference Learning
cs.GT 2026-05 unverdicted novelty 7.0

Risk-sensitive preference games retain monotonicity via translation-invariant risk measures, enabling convergent self-play algorithms with stability bounds and empirical robustness across data strata.
Structure from Strategic Interaction & Uncertainty: Risk Sensitive Games for Robust Preference Learning
cs.GT 2026-05 unverdicted novelty 6.0

Risk-sensitive preference games using convex risk measures produce policies that are robust across data strata and match or exceed standard Nash learning performance without added cost.