Counterfactual behavior cloning: Offline imitation learning from imperfect human demonstra- tions.arXiv preprint arXiv:2505.10760, 2025

Shahabedin Sagheb, Dylan P Losey · 2025 · arXiv 2505.10760

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Set-Supervised Diffusion Policy: Learning Action-Chunking Diffusion through Corrections

cs.RO · 2026-06-01 · unverdicted · novelty 6.0

SDP constructs sets of desired action-chunks from human correction pairs and trains diffusion policies to align with those sets, yielding better performance and robustness than standard behavior cloning on robotic tasks.

citing papers explorer

Showing 1 of 1 citing paper.

Set-Supervised Diffusion Policy: Learning Action-Chunking Diffusion through Corrections cs.RO · 2026-06-01 · unverdicted · none · ref 31
SDP constructs sets of desired action-chunks from human correction pairs and trains diffusion policies to align with those sets, yielding better performance and robustness than standard behavior cloning on robotic tasks.

Counterfactual behavior cloning: Offline imitation learning from imperfect human demonstra- tions.arXiv preprint arXiv:2505.10760, 2025

fields

years

verdicts

representative citing papers

citing papers explorer