Title resolution pending

· 2025 · arXiv 2509.10771

12 Pith papers cite this work. Polarity classification is still indexing.

12 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

background 1 baseline 1

citation-polarity summary

baseline 1 unclear 1

representative citing papers

Betting for Sim-to-Real Performance Evaluation

cs.RO · 2026-04-27 · unverdicted · novelty 7.0

Betting mechanisms can yield provably more accurate and efficient estimates of real-world robot behavior than Monte Carlo sampling under specified conditions, with practical approximations demonstrated on synthetic data and a robotic manipulator task.

HALO: Hybrid Auto-encoded Locomotion with Learned Latent Dynamics, Poincar\'e Maps, and Regions of Attraction

cs.RO · 2026-04-20 · unverdicted · novelty 7.0

HALO learns latent reduced-order models with Poincaré maps for hybrid locomotion dynamics, allowing Lyapunov-based regions of attraction to be lifted from latent space to the full-order system.

Bounded Ratio Reinforcement Learning

cs.LG · 2026-04-20 · conditional · novelty 7.0

BRRL derives an analytic optimal policy for regularized constrained RL that guarantees monotonic improvement and yields the BPO algorithm that matches or exceeds PPO.

ViserDex: Visual Sim-to-Real for Robust Dexterous In-hand Reorientation

cs.RO · 2026-04-13 · unverdicted · novelty 6.0

A framework using 3D Gaussian Splatting for visual domain randomization enables robust monocular RGB-based dexterous in-hand reorientation on real hardware for multiple objects under varied lighting.

PriPG-RL: Privileged Planner-Guided Reinforcement Learning for Partially Observable Systems with Anytime-Feasible MPC

cs.LG · 2026-04-09 · unverdicted · novelty 6.0

PriPG-RL trains RL policies for POMDPs by distilling knowledge from a privileged anytime-feasible MPC planner into a P2P-SAC policy, improving sample efficiency and performance in partially observable robotic navigation.

FlashSAC: Fast and Stable Off-Policy Reinforcement Learning for High-Dimensional Robot Control

cs.LG · 2026-04-06 · unverdicted · novelty 6.0 · 2 refs

FlashSAC improves training speed and final performance of off-policy RL on high-dimensional robot tasks by reducing update frequency, increasing model scale, and bounding norms to limit critic error accumulation.

Isaac Lab: A GPU-Accelerated Simulation Framework for Multi-Modal Robot Learning

cs.RO · 2025-11-06 · unverdicted · novelty 6.0

Isaac Lab is a unified GPU-native platform combining high-fidelity physics, photorealistic rendering, multi-frequency sensors, domain randomization, and learning pipelines for scalable multi-modal robot policy training.

RANDPOL: Parameter-Efficient End-to-End Quadruped Locomotion via Randomized Policy Learning

cs.LG · 2025-05-25 · unverdicted · novelty 6.0

RANDPOL achieves effective quadruped locomotion by training only the final linear readout of a randomly initialized and fixed neural network policy, matching PPO results with reduced parameters and enabling zero-shot sim-to-real transfer on Unitree Go2.

Terrain Consistent Reference-Guided RL for Humanoid Navigation Autonomy

cs.RO · 2026-05-15 · unverdicted · novelty 5.0

Terrain-consistent reference modulation during RL training yields SE(2)-controllable humanoid locomotion policies that improve tracking in simulation and enable over 70 m closed-loop autonomous navigation on rough terrain and stairs on the Unitree G1 with onboard computation.

Robotic Strawberry Harvesting with Robust Vision and Deep Reinforcement Learning based Sim-to-Real Control

cs.RO · 2026-05-22 · conditional · novelty 4.0

A modified YOLO segmentation model plus sim-trained PPO control yields 84.3% overall success harvesting 281 strawberries in greenhouse trials on a real UR10e manipulator.

The Unified Autonomy Stack: Toward a Blueprint for Generalizable Robot Autonomy

cs.RO · 2026-05-12 · accept · novelty 4.0

An open-sourced Unified Autonomy Stack fuses LiDAR, radar, vision and inertial data with sampling-based planning and control barrier functions to deliver resilient autonomy on aerial and ground robots in challenging real-world settings.

SERNF: Sample-Efficient Real-World Dexterous Policy Fine-Tuning via Action-Chunked Critics and Normalizing Flows

cs.RO · 2026-02-10

citing papers explorer

Showing 12 of 12 citing papers.

Betting for Sim-to-Real Performance Evaluation cs.RO · 2026-04-27 · unverdicted · none · ref 69
Betting mechanisms can yield provably more accurate and efficient estimates of real-world robot behavior than Monte Carlo sampling under specified conditions, with practical approximations demonstrated on synthetic data and a robotic manipulator task.
HALO: Hybrid Auto-encoded Locomotion with Learned Latent Dynamics, Poincar\'e Maps, and Regions of Attraction cs.RO · 2026-04-20 · unverdicted · none · ref 21
HALO learns latent reduced-order models with Poincaré maps for hybrid locomotion dynamics, allowing Lyapunov-based regions of attraction to be lifted from latent space to the full-order system.
Bounded Ratio Reinforcement Learning cs.LG · 2026-04-20 · conditional · none · ref 25
BRRL derives an analytic optimal policy for regularized constrained RL that guarantees monotonic improvement and yields the BPO algorithm that matches or exceeds PPO.
ViserDex: Visual Sim-to-Real for Robust Dexterous In-hand Reorientation cs.RO · 2026-04-13 · unverdicted · none · ref 27
A framework using 3D Gaussian Splatting for visual domain randomization enables robust monocular RGB-based dexterous in-hand reorientation on real hardware for multiple objects under varied lighting.
PriPG-RL: Privileged Planner-Guided Reinforcement Learning for Partially Observable Systems with Anytime-Feasible MPC cs.LG · 2026-04-09 · unverdicted · none · ref 33
PriPG-RL trains RL policies for POMDPs by distilling knowledge from a privileged anytime-feasible MPC planner into a P2P-SAC policy, improving sample efficiency and performance in partially observable robotic navigation.
FlashSAC: Fast and Stable Off-Policy Reinforcement Learning for High-Dimensional Robot Control cs.LG · 2026-04-06 · unverdicted · none · ref 73 · 2 links
FlashSAC improves training speed and final performance of off-policy RL on high-dimensional robot tasks by reducing update frequency, increasing model scale, and bounding norms to limit critic error accumulation.
Isaac Lab: A GPU-Accelerated Simulation Framework for Multi-Modal Robot Learning cs.RO · 2025-11-06 · unverdicted · none · ref 88
Isaac Lab is a unified GPU-native platform combining high-fidelity physics, photorealistic rendering, multi-frequency sensors, domain randomization, and learning pipelines for scalable multi-modal robot policy training.
RANDPOL: Parameter-Efficient End-to-End Quadruped Locomotion via Randomized Policy Learning cs.LG · 2025-05-25 · unverdicted · none · ref 25
RANDPOL achieves effective quadruped locomotion by training only the final linear readout of a randomly initialized and fixed neural network policy, matching PPO results with reduced parameters and enabling zero-shot sim-to-real transfer on Unitree Go2.
Terrain Consistent Reference-Guided RL for Humanoid Navigation Autonomy cs.RO · 2026-05-15 · unverdicted · none · ref 32
Terrain-consistent reference modulation during RL training yields SE(2)-controllable humanoid locomotion policies that improve tracking in simulation and enable over 70 m closed-loop autonomous navigation on rough terrain and stairs on the Unitree G1 with onboard computation.
Robotic Strawberry Harvesting with Robust Vision and Deep Reinforcement Learning based Sim-to-Real Control cs.RO · 2026-05-22 · conditional · none · ref 40
A modified YOLO segmentation model plus sim-trained PPO control yields 84.3% overall success harvesting 281 strawberries in greenhouse trials on a real UR10e manipulator.
The Unified Autonomy Stack: Toward a Blueprint for Generalizable Robot Autonomy cs.RO · 2026-05-12 · accept · none · ref 235
An open-sourced Unified Autonomy Stack fuses LiDAR, radar, vision and inertial data with sampling-based planning and control barrier functions to deliver resilient autonomy on aerial and ground robots in challenging real-world settings.
SERNF: Sample-Efficient Real-World Dexterous Policy Fine-Tuning via Action-Chunked Critics and Normalizing Flows cs.RO · 2026-02-10 · unreviewed · ref 46

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer