hub

//arxiv.org/abs/1909.06586

Kim, Donghyun, Di Carlo, Jared, Katz, Benjamin, Bledt, Gerardo, Kim, Sangbae , year = · 1909 · arXiv 1909.06586

10 Pith papers cite this work. Polarity classification is still indexing.

10 Pith papers citing it

read on arXiv browse 10 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 2

citation-polarity summary

background 1 unclear 1

representative citing papers

Global Convergence of Sampling-Based Nonconvex Optimization through Diffusion-Style Smoothing

cs.LG · 2026-05-15 · unverdicted · novelty 6.0

Recasts sampling-based nonconvex optimization as smoothed gradient descent to obtain non-asymptotic convergence guarantees and introduces the DIDA annealed algorithm that converges to the global optimum.

Neuromorphic Reinforcement Learning for Quadruped Locomotion Control on Uneven Terrain

cs.NE · 2026-05-10 · unverdicted · novelty 6.0

EP-based PPO with CPG and residual policies matches standard PPO performance on 12-DoF quadruped uneven-terrain locomotion while using 4.3 times less GPU memory during training.

Constraint-Enhanced Reinforcement Learning Based on Dynamic Decoupled Spherical Radial Squashing

cs.LG · 2026-05-05 · unverdicted · novelty 6.0

DD-SRad is a new RL constraint technique that adapts per-actuator radii dynamically to achieve zero violations and unconstrained-level task performance on heterogeneous robotic joints.

Trajectory-based actuator identification via differentiable simulation

cs.RO · 2026-04-11 · unverdicted · novelty 6.0

Differentiable simulation enables torque-sensor-free actuator model identification from trajectory data, achieving 1.88x better position tracking than a stand-trained baseline and 46% longer travel in downstream locomotion policies.

Watch Your Step: Learning Semantically-Guided Locomotion in Cluttered Environment

cs.RO · 2026-03-03 · unverdicted · novelty 6.0

SemLoco is a reinforcement learning system that integrates semantic understanding with foothold planning to let legged robots navigate cluttered environments without stepping on sensitive low-lying objects.

A Reconfigured Wheel-Legged Robot for Enhanced Steering and Adaptability

cs.RO · 2025-07-30 · conditional · novelty 6.0

FLORES is a wheel-legged robot with front-leg hip-yaw DoFs replacing hip-roll, paired with a custom RL controller using adapted HIM and tailored rewards for smooth wheeled-to-legged transitions and efficient gaits.

Iteratively Learning Muscle Memory for Legged Robots to Master Adaptive and High Precision Locomotion

cs.RO · 2025-07-18 · unverdicted · novelty 6.0

Integrates iterative learning control with a torque library to enable high-precision adaptive locomotion on bipedal and quadrupedal robots, reducing tracking errors by up to 85% and achieving over 30x faster control rates.

TAG-K: Tail-Averaged Greedy Kaczmarz for Computationally Efficient and Performant Online Inertial Parameter Estimation

cs.RO · 2025-10-06 · unverdicted · novelty 5.0

TAG-K combines greedy randomized Kaczmarz row selection with tail averaging to deliver faster convergence and noise robustness for online inertial parameter estimation in robotics.

Right Model, Right Time: Real-Time Cascaded-Fidelity MPC for Bipedal Walking

cs.RO · 2026-05-06 · unverdicted · novelty 4.0

Multi-phase whole-body MPC for bipedal locomotion uses detailed model near horizon and simplified model later, solved via acados SQP without preselected footsteps, validated in simulation.

Quadruped Parkour Learning: Sparsely Gated Mixture of Experts with Visual Input

cs.RO · 2026-04-21 · unverdicted · novelty 4.0

Sparsely gated MoE policies double the success rate of a real Unitree Go2 quadruped on large-obstacle parkour versus matched-active-parameter MLP baselines while cutting inference time compared with a scaled-up MLP.

citing papers explorer

Showing 10 of 10 citing papers.

Global Convergence of Sampling-Based Nonconvex Optimization through Diffusion-Style Smoothing cs.LG · 2026-05-15 · unverdicted · none · ref 120
Recasts sampling-based nonconvex optimization as smoothed gradient descent to obtain non-asymptotic convergence guarantees and introduces the DIDA annealed algorithm that converges to the global optimum.
Neuromorphic Reinforcement Learning for Quadruped Locomotion Control on Uneven Terrain cs.NE · 2026-05-10 · unverdicted · none · ref 16
EP-based PPO with CPG and residual policies matches standard PPO performance on 12-DoF quadruped uneven-terrain locomotion while using 4.3 times less GPU memory during training.
Constraint-Enhanced Reinforcement Learning Based on Dynamic Decoupled Spherical Radial Squashing cs.LG · 2026-05-05 · unverdicted · none · ref 4
DD-SRad is a new RL constraint technique that adapts per-actuator radii dynamically to achieve zero violations and unconstrained-level task performance on heterogeneous robotic joints.
Trajectory-based actuator identification via differentiable simulation cs.RO · 2026-04-11 · unverdicted · none · ref 2
Differentiable simulation enables torque-sensor-free actuator model identification from trajectory data, achieving 1.88x better position tracking than a stand-trained baseline and 46% longer travel in downstream locomotion policies.
Watch Your Step: Learning Semantically-Guided Locomotion in Cluttered Environment cs.RO · 2026-03-03 · unverdicted · none · ref 11
SemLoco is a reinforcement learning system that integrates semantic understanding with foothold planning to let legged robots navigate cluttered environments without stepping on sensitive low-lying objects.
A Reconfigured Wheel-Legged Robot for Enhanced Steering and Adaptability cs.RO · 2025-07-30 · conditional · none · ref 2
FLORES is a wheel-legged robot with front-leg hip-yaw DoFs replacing hip-roll, paired with a custom RL controller using adapted HIM and tailored rewards for smooth wheeled-to-legged transitions and efficient gaits.
Iteratively Learning Muscle Memory for Legged Robots to Master Adaptive and High Precision Locomotion cs.RO · 2025-07-18 · unverdicted · none · ref 26
Integrates iterative learning control with a torque library to enable high-precision adaptive locomotion on bipedal and quadrupedal robots, reducing tracking errors by up to 85% and achieving over 30x faster control rates.
TAG-K: Tail-Averaged Greedy Kaczmarz for Computationally Efficient and Performant Online Inertial Parameter Estimation cs.RO · 2025-10-06 · unverdicted · none · ref 4
TAG-K combines greedy randomized Kaczmarz row selection with tail averaging to deliver faster convergence and noise robustness for online inertial parameter estimation in robotics.
Right Model, Right Time: Real-Time Cascaded-Fidelity MPC for Bipedal Walking cs.RO · 2026-05-06 · unverdicted · none · ref 2
Multi-phase whole-body MPC for bipedal locomotion uses detailed model near horizon and simplified model later, solved via acados SQP without preselected footsteps, validated in simulation.
Quadruped Parkour Learning: Sparsely Gated Mixture of Experts with Visual Input cs.RO · 2026-04-21 · unverdicted · none · ref 13
Sparsely gated MoE policies double the success rate of a real Unitree Go2 quadruped on large-obstacle parkour versus matched-active-parameter MLP baselines while cutting inference time compared with a scaled-up MLP.

//arxiv.org/abs/1909.06586

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer