SD2AIL: Adversarial Imitation Learning from Synthetic Demonstrations via Diffusion Models

· 2025 · cs.LG · arXiv 2512.18583

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Adversarial Imitation Learning (AIL) is a dominant framework in imitation learning that infers rewards from expert demonstrations to guide policy optimization. Although providing more expert demonstrations typically leads to improved performance and greater stability, collecting such demonstrations can be challenging in certain scenarios. Inspired by the success of diffusion models in data generation, we propose SD2AIL, which utilizes synthetic demonstrations via diffusion models. We first employ a diffusion model in the discriminator to generate synthetic demonstrations as pseudo-expert data that augment the expert demonstrations. To selectively replay the most valuable demonstrations from the large pool of (pseudo-) expert demonstrations, we further introduce a prioritized expert demonstration replay strategy (PEDR). The experimental results on simulation tasks demonstrate the effectiveness and robustness of our method. In particular, in the Hopper task, our method achieves an average return of 3441, surpassing the state-of-the-art method by 89. Our code will be available at https://github.com/positron-lpc/SD2AIL.

representative citing papers

SD2AIL: Adversarial Imitation Learning from Synthetic Demonstrations via Diffusion Models

cs.LG · 2025-12-21 · conditional · novelty 6.0

SD2AIL generates synthetic expert demonstrations via diffusion models and prioritizes them with PEDR to improve adversarial imitation learning performance on simulation benchmarks.

citing papers explorer

Showing 1 of 1 citing paper.

SD2AIL: Adversarial Imitation Learning from Synthetic Demonstrations via Diffusion Models cs.LG · 2025-12-21 · conditional · none · ref 2 · internal anchor
SD2AIL generates synthetic expert demonstrations via diffusion models and prioritizes them with PEDR to improve adversarial imitation learning performance on simulation benchmarks.

SD2AIL: Adversarial Imitation Learning from Synthetic Demonstrations via Diffusion Models

fields

years

verdicts

representative citing papers

citing papers explorer