Primal wasserstein imitation learning

Robert Dadashi, L´eonard Hussenot, Matthieu Geist, Olivier Pietquin · 2006 · arXiv 2006.04678

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Failure Identification in Imitation Learning Via Statistical and Semantic Filtering

cs.RO · 2026-04-15 · unverdicted · novelty 7.0

FIDeL detects failures in imitation learning by building compact nominal representations via optimal transport, applying conformal prediction thresholds, and using VLMs for semantic filtering, outperforming baselines by 5.3% AUROC and 17.38% accuracy on the new BotFails dataset.

TimeRewarder: Learning Dense Reward from Passive Videos via Frame-wise Temporal Distance

cs.AI · 2025-09-30 · unverdicted · novelty 6.0

TimeRewarder derives step-wise progress rewards from frame-wise temporal distances in passive videos and uses them to guide RL, achieving high success rates on Meta-World tasks with fewer interactions than prior methods or hand-designed rewards.

citing papers explorer

Showing 2 of 2 citing papers.

Failure Identification in Imitation Learning Via Statistical and Semantic Filtering cs.RO · 2026-04-15 · unverdicted · none · ref 51
FIDeL detects failures in imitation learning by building compact nominal representations via optimal transport, applying conformal prediction thresholds, and using VLMs for semantic filtering, outperforming baselines by 5.3% AUROC and 17.38% accuracy on the new BotFails dataset.
TimeRewarder: Learning Dense Reward from Passive Videos via Frame-wise Temporal Distance cs.AI · 2025-09-30 · unverdicted · none · ref 3
TimeRewarder derives step-wise progress rewards from frame-wise temporal distances in passive videos and uses them to guide RL, achieving high success rates on Meta-World tasks with fewer interactions than prior methods or hand-designed rewards.

Primal wasserstein imitation learning

fields

years

verdicts

representative citing papers

citing papers explorer