Diffuseloco: Real-time legged locomotion control with diffusion from offline datasets

Huang, X · 2024 · arXiv 2404.19264

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

SMP: Reusable Score-Matching Motion Priors for Physics-Based Character Control

cs.GR · 2025-12-02 · conditional · novelty 7.0

SMP turns pre-trained motion diffusion models into task-agnostic, reusable reward functions via score distillation sampling, enabling style-specific and composable motion priors for humanoid control without retraining per task.

AID: Agent Intent from Diffusion for Multi-Agent Informative Path Planning

cs.RO · 2025-12-02 · conditional · novelty 7.0

AID trains diffusion policies via behavior cloning on existing MAIPP planners followed by RL fine-tuning to achieve faster execution and higher information gain in multi-agent coordination.

NavOL: Navigation Policy with Online Imitation Learning

cs.RO · 2026-05-12 · unverdicted · novelty 6.0

NavOL collects expert trajectory labels online from a global planner during policy rollouts in simulation to train a diffusion navigation policy, mitigating distribution shift and improving performance on visual navigation tasks.

Constraint-Aware Diffusion Priors for High-Fidelity and Versatile Quadruped Locomotion

cs.RO · 2026-05-09 · unverdicted · novelty 6.0 · 2 refs

Diff-CAST replaces GAN discriminators with diffusion-based priors and adds symmetric command conditioning plus constrained RL to enable versatile, drift-free, and hardware-safe quadruped locomotion.

Positive-Only Drifting Policy Optimization

cs.LG · 2026-04-15 · unverdicted · novelty 6.0

PODPO is a likelihood-free generative policy optimization method for online RL that steers actions to high-return regions using only positive-advantage samples and local contrastive drifting.

citing papers explorer

Showing 5 of 5 citing papers.

SMP: Reusable Score-Matching Motion Priors for Physics-Based Character Control cs.GR · 2025-12-02 · conditional · none · ref 1
SMP turns pre-trained motion diffusion models into task-agnostic, reusable reward functions via score distillation sampling, enabling style-specific and composable motion priors for humanoid control without retraining per task.
AID: Agent Intent from Diffusion for Multi-Agent Informative Path Planning cs.RO · 2025-12-02 · conditional · none · ref 11
AID trains diffusion policies via behavior cloning on existing MAIPP planners followed by RL fine-tuning to achieve faster execution and higher information gain in multi-agent coordination.
NavOL: Navigation Policy with Online Imitation Learning cs.RO · 2026-05-12 · unverdicted · none · ref 5
NavOL collects expert trajectory labels online from a global planner during policy rollouts in simulation to train a diffusion navigation policy, mitigating distribution shift and improving performance on visual navigation tasks.
Constraint-Aware Diffusion Priors for High-Fidelity and Versatile Quadruped Locomotion cs.RO · 2026-05-09 · unverdicted · none · ref 11 · 2 links
Diff-CAST replaces GAN discriminators with diffusion-based priors and adds symmetric command conditioning plus constrained RL to enable versatile, drift-free, and hardware-safe quadruped locomotion.
Positive-Only Drifting Policy Optimization cs.LG · 2026-04-15 · unverdicted · none · ref 6
PODPO is a likelihood-free generative policy optimization method for online RL that steers actions to high-return regions using only positive-advantage samples and local contrastive drifting.

Diffuseloco: Real-time legged locomotion control with diffusion from offline datasets

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer