Shankar Sastry, Jiajun Wu, Koushil Sreenath, Saurabh Gupta, and Xue Bin Peng

Learning Smooth Humanoid Locomotion through Lipschitz-Constrained Policies , author= · 2024 · arXiv 2410.11825

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Difference-Aware Retrieval Policies for Imitation Learning

cs.RO · 2026-06-08 · conditional · novelty 6.0

DARP reparameterizes imitation learning around local neighborhood structure using k-NN expert states, actions, and relative distance vectors, delivering 15-46% gains over behavior cloning in control and manipulation tasks.

Empowering Multi-Robot Cooperation via Sequential World Models

cs.RO · 2025-09-16 · unverdicted · novelty 6.0

SeqWM introduces sequential autoregressive agent-wise world models for multi-robot MBRL, outperforming baselines in performance and sample efficiency on Bi-DexHands and Multi-Quadruped tasks with physical robot deployment.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Difference-Aware Retrieval Policies for Imitation Learning cs.RO · 2026-06-08 · conditional · none · ref 39
DARP reparameterizes imitation learning around local neighborhood structure using k-NN expert states, actions, and relative distance vectors, delivering 15-46% gains over behavior cloning in control and manipulation tasks.

Shankar Sastry, Jiajun Wu, Koushil Sreenath, Saurabh Gupta, and Xue Bin Peng

fields

years

verdicts

representative citing papers

citing papers explorer