Rsl-rl: A learning library for robotics research,

· 2025 · arXiv 2509.10771

16 Pith papers cite this work. Polarity classification is still indexing.

16 Pith papers citing it

read on arXiv browse 16 citing papers

citation-role summary

background 1 baseline 1

citation-polarity summary

baseline 1 unclear 1

representative citing papers

Beyond Binary: Sim-to-Real Dexterous Manipulation with Physics-Grounded Contact Representation

cs.RO · 2026-05-27 · unverdicted · novelty 7.0

CoP tactile representation with differentiable calibration enables zero-shot sim-to-real transfer and outperforms binary and raw-taxel baselines on peg-in-hole insertion and ball balancing with a multi-fingered hand.

Betting for Sim-to-Real Performance Evaluation

cs.RO · 2026-04-27 · unverdicted · novelty 7.0

Betting mechanisms can yield provably more accurate and efficient estimates of real-world robot behavior than Monte Carlo sampling under specified conditions, with practical approximations demonstrated on synthetic data and a robotic manipulator task.

HALO: Hybrid Auto-encoded Locomotion with Learned Latent Dynamics, Poincar\'e Maps, and Regions of Attraction

cs.RO · 2026-04-20 · unverdicted · novelty 7.0

HALO learns latent reduced-order models with Poincaré maps for hybrid locomotion dynamics, allowing Lyapunov-based regions of attraction to be lifted from latent space to the full-order system.

Bounded Ratio Reinforcement Learning

cs.LG · 2026-04-20 · conditional · novelty 7.0

BRRL derives an analytic optimal policy for regularized constrained RL that guarantees monotonic improvement and yields the BPO algorithm that matches or exceeds PPO.

S-Cheetah: A Novel Quadrupedal Robot with a 3-DOF Active Spine Learning Agile Locomotion

cs.RO · 2026-05-27 · unverdicted · novelty 6.0

A quadruped robot with a three-degree-of-freedom active spine reaches 6.9 m/s top speed and 7.2 rad/s turning rate via an RL framework that rewards spine engagement and gallop gaits.

ViserDex: Visual Sim-to-Real for Robust Dexterous In-hand Reorientation

cs.RO · 2026-04-13 · unverdicted · novelty 6.0

A framework using 3D Gaussian Splatting for visual domain randomization enables robust monocular RGB-based dexterous in-hand reorientation on real hardware for multiple objects under varied lighting.

PriPG-RL: Privileged Planner-Guided Reinforcement Learning for Partially Observable Systems with Anytime-Feasible MPC

cs.LG · 2026-04-09 · unverdicted · novelty 6.0

PriPG-RL trains RL policies for POMDPs by distilling knowledge from a privileged anytime-feasible MPC planner into a P2P-SAC policy, improving sample efficiency and performance in partially observable robotic navigation.

FlashSAC: Fast and Stable Off-Policy Reinforcement Learning for High-Dimensional Robot Control

cs.LG · 2026-04-06 · unverdicted · novelty 6.0 · 2 refs

FlashSAC improves training speed and final performance of off-policy RL on high-dimensional robot tasks by reducing update frequency, increasing model scale, and bounding norms to limit critic error accumulation.

Isaac Lab: A GPU-Accelerated Simulation Framework for Multi-Modal Robot Learning

cs.RO · 2025-11-06 · unverdicted · novelty 6.0

Isaac Lab is a unified GPU-native platform combining high-fidelity physics, photorealistic rendering, multi-frequency sensors, domain randomization, and learning pipelines for scalable multi-modal robot policy training.

RANDPOL: Parameter-Efficient End-to-End Quadruped Locomotion via Randomized Policy Learning

cs.LG · 2025-05-25 · unverdicted · novelty 6.0

RANDPOL achieves effective quadruped locomotion by training only the final linear readout of a randomly initialized and fixed neural network policy, matching PPO results with reduced parameters and enabling zero-shot sim-to-real transfer on Unitree Go2.

PPO-EAL: Exact Augmented Lagrangian Proximal Policy Optimization for Safe Robotic Control

cs.RO · 2026-06-26 · unverdicted · novelty 5.0

PPO-EAL integrates exact augmented Lagrangian optimization into PPO for safe robotic control, with claimed theoretical guarantees and better empirical safety-performance tradeoffs on several robot benchmarks including sim-to-real gear assembly.

Terrain Consistent Reference-Guided RL for Humanoid Navigation Autonomy

cs.RO · 2026-05-15 · unverdicted · novelty 5.0

Terrain-consistent reference modulation during RL training yields SE(2)-controllable humanoid locomotion policies that improve tracking in simulation and enable over 70 m closed-loop autonomous navigation on rough terrain and stairs on the Unitree G1 with onboard computation.

Efficient On-policy Visual-RL via Stochastic Decoupled Policy Gradient

cs.RO · 2026-05-26 · unverdicted · novelty 4.0

SDPG is a new on-policy visual RL algorithm that estimates gradients via stochastic perturbations of rollouts, achieving faster training and lower memory use than baselines on visual MuJoCo tasks while adding new robotics benchmarks and sim-to-real results.

Robotic Strawberry Harvesting with Robust Vision and Deep Reinforcement Learning based Sim-to-Real Control

cs.RO · 2026-05-22 · conditional · novelty 4.0

A modified YOLO segmentation model plus sim-trained PPO control yields 84.3% overall success harvesting 281 strawberries in greenhouse trials on a real UR10e manipulator.

The Unified Autonomy Stack: Toward a Blueprint for Generalizable Robot Autonomy

cs.RO · 2026-05-12 · accept · novelty 4.0

An open-sourced Unified Autonomy Stack fuses LiDAR, radar, vision and inertial data with sampling-based planning and control barrier functions to deliver resilient autonomy on aerial and ground robots in challenging real-world settings.

SERNF: Sample-Efficient Real-World Dexterous Policy Fine-Tuning via Action-Chunked Critics and Normalizing Flows

cs.RO · 2026-02-10

citing papers explorer

Showing 4 of 4 citing papers after filters.

Bounded Ratio Reinforcement Learning cs.LG · 2026-04-20 · conditional · none · ref 25
BRRL derives an analytic optimal policy for regularized constrained RL that guarantees monotonic improvement and yields the BPO algorithm that matches or exceeds PPO.
PriPG-RL: Privileged Planner-Guided Reinforcement Learning for Partially Observable Systems with Anytime-Feasible MPC cs.LG · 2026-04-09 · unverdicted · none · ref 33
PriPG-RL trains RL policies for POMDPs by distilling knowledge from a privileged anytime-feasible MPC planner into a P2P-SAC policy, improving sample efficiency and performance in partially observable robotic navigation.
FlashSAC: Fast and Stable Off-Policy Reinforcement Learning for High-Dimensional Robot Control cs.LG · 2026-04-06 · unverdicted · none · ref 73 · 2 links
FlashSAC improves training speed and final performance of off-policy RL on high-dimensional robot tasks by reducing update frequency, increasing model scale, and bounding norms to limit critic error accumulation.
RANDPOL: Parameter-Efficient End-to-End Quadruped Locomotion via Randomized Policy Learning cs.LG · 2025-05-25 · unverdicted · none · ref 25
RANDPOL achieves effective quadruped locomotion by training only the final linear readout of a randomly initialized and fixed neural network policy, matching PPO results with reduced parameters and enabling zero-shot sim-to-real transfer on Unitree Go2.

Rsl-rl: A learning library for robotics research,

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer