archive

Every paper Pith has read. Search by title, abstract, or pith.

2900 papers in cs.RO · page 10

cs.RO 2026-05-12 reviewed

Unified VLA model beats human drivers on driving benchmark
MindVLA-U1: VLA Beats VA with Unified Streaming Architecture for Autonomous Driving

Yuzhou Huang +8
cs.RO 2026-05-12 reviewed

Streaming intent produces controllable driving plans in end-to-end model
Action Emergence from Streaming Intent

Pengfei Jing +5
cs.RO 2026-05-12 reviewed

Streaming intent steers driving AI to distinct plans
Action Emergence from Streaming Intent

Pengfei Jing +5
cs.RO 2026-05-12 reviewed

Benchmark finds successful robot tasks often unsafe
SafeManip: A Property-Driven Benchmark for Temporal Safety Evaluation in Robotic Manipulation

Chengyue Huang +4

1 Piths
cs.RO 2026-05-12 reviewed

Specialized heads boost VLA robot success in and out of domain
GuidedVLA: Specifying Task-Relevant Factors via Plug-and-Play Action Attention Specialization

Xiaosong Jia +19
cs.RO 2026-05-12 reviewed

IMU suit teleoperates humanoid robot in real time with stable motions
Real-Time Whole-Body Teleoperation of a Humanoid Robot Using IMU-Based Motion Capture with Sim2Sim and Sim2Real Validation

Hamza Ahmed Durrani +1
cs.CV 2026-05-12 reviewed

Stereo event cameras track 3D hand poses at 30 mm error
EgoEV-HandPose: Egocentric 3D Hand Pose Estimation and Gesture Recognition with Stereo Event Cameras

Luming Wang +4
cs.RO 2026-05-12 reviewed

One diffusion policy learns both search and insertion
SI-Diff: A Framework for Learning Search and High-Precision Insertion with a Force-Domain Diffusion Policy

Yibo Liu +7
cs.RO 2026-05-12 reviewed

Timestep modulation turns diffusion pretraining into efficient robot exploration
TMRL: Diffusion Timestep-Modulated Pretraining Enables Exploration for Efficient Policy Finetuning

Matthew M. Hong +3
cs.RO 2026-05-12 reviewed

Symmetry prior speeds up bimanual robot learning
Morphologically Equivariant Flow Matching for Bimanual Mobile Manipulation

Max Siebenborn +6
cs.CV 2026-05-12 reviewed

Three height bands give 49-FPS LiDAR pedestrian detection
TriBand-BEV: Real-Time LiDAR-Only 3D Pedestrian Detection via Height-Aware BEV and High-Resolution Feature Fusion

Mohammad Khoshkdahan +1
cs.RO 2026-05-12 reviewed

Virtual objectives stabilize twist retargeting for dexterous hands
DexTwist: Dexterous Hand Retargeting for Twist Motion via Mixed Reality-based Teleoperation

Dongmyoung Lee +2
cs.RO 2026-05-12 reviewed

Mixture of inverse models turns robot video predictions into actions
From Imagined Futures to Executable Actions: Mixture of Latent Actions for Robot Manipulation

Yajie Li +7
cs.RO 2026-05-12 reviewed

Bidirectional pose-action loop boosts robot manipulation
X-Imitator: Spatial-Aware Imitation Learning via Bidirectional Action-Pose Interaction

Kai Xiong +3
cs.RO 2026-05-12 reviewed

Premover cuts robot task time 13.6% by acting on incomplete commands
Premover: Fast Vision-Language-Action Control by Acting Before Instructions Are Complete

Joonha Park +2
cs.RO 2026-05-12 reviewed

OrbiSim turns world models into differentiable physics engines
OrbiSim: World Models as Differentiable Physics Engines for Embodied Intelligence

Jiajian Li +5
cs.RO 2026-05-12 reviewed

World models merge with action generation for embodied AI
World Action Models: The Next Frontier in Embodied AI

Siyin Wang +13
cs.RO 2026-05-12 reviewed

QOED focuses robot exploration on identifiable parameters
Learning What Matters: Adaptive Information-Theoretic Objectives for Robot Exploration

Youwei Yu +4
cs.RO 2026-05-12 reviewed

INDI yields lower position errors than geometric NDI on hexarotors
Control of Fully Actuated Aerial Vehicles: A Comparison of Model-based and Sensor-based Dynamic Inversion

Ali Sidar Yilmaz +3
cs.HC 2026-05-12 reviewed

Robot execution and AI chat boost student code reflection
RoboBlockly Studio: Conversational Block Programming with Embodied Robot Feedback for Computational Thinking

Leyi Li +4
cs.RO 2026-05-12 reviewed

Motion statecharts execute semantic tasks on eight robot platforms
Closing the Motion Execution Gap: From Semantic Motion Task Constraints to Kinematic Control

Simon Stelter +3
cs.RO 2026-05-12 reviewed

Robot blocks unsafe merges at blind intersections
Cooperative Robotics Reinforced by Collective Perception for Traffic Moderation

Mohammad Khoshkdahan +3
cs.RO 2026-05-12 reviewed

Pre-planned graph branches let robots recover from failures instantly
From Reaction to Anticipation: Proactive Failure Recovery through Agentic Task Graph for Robotic Manipulation

Sheng Xu +8
cs.RO 2026-05-12 reviewed

LLM evolution designs superior robot navigation rewards
EvoNav: Evolutionary Reward Function Design for Robot Navigation with Large Language Models

Zhikai Zhao +6
cs.RO 2026-05-12 reviewed

Multi-view latents and manifold actions boost VLA robotic success
Learning Action Manifold with Multi-view Latent Priors for Robotic Manipulation

Junjin Xiao +11
cs.RO 2026-05-12 reviewed

Body regions dictate robot affective touch strategies
Mapping Embodied Affective Touch Strategies on a Humanoid Robot

Qiaoqiao Ren +5
cs.RO 2026-05-12 reviewed

New sampler prunes robot vision tokens to under 10% with no accuracy loss
See What Matters: Differentiable Grid Sample Pruning for Generalizable Vision-Language-Action Model

Yixu Feng +6
cs.RO 2026-05-12 reviewed

Grid sampler trims VLA tokens to under 10% with full success
See What Matters: Differentiable Grid Sample Pruning for Generalizable Vision-Language-Action Model

Yixu Feng +6
cs.RO 2026-05-12 reviewed

Online imitation learning improves navigation via privileged planner labels
NavOL: Navigation Policy with Online Imitation Learning

Xiaofei Wei +2
cs.RO 2026-05-12 reviewed

Robots dream short futures to dodge manipulation failures
DreamAvoid: Critical-Phase Test-Time Dreaming to Avoid Failures in VLA Policies

Xianzhe Fan +6
cs.RO 2026-05-12 reviewed

Surfaces guide soft gripper to grasp paper sheets
Introducing Environmental Constraints to Grasping Strategies for Paper-Like Flexible Materials Using a Soft Gripper

Yi Dong +3
cs.RO 2026-05-12 reviewed

Geometry tuning lets Rainbow DQN master cooperative insertions
Rainbow Deep Q-Learning with Kinematics-Aware Design for Cooperative Delta and 3-RRS Parallel Robot Insertion

Hassen Nigatu +4
cs.RO 2026-05-12 reviewed

IEKF and smoother cut long-term error versus MUSE on quadruped data
A Proprioceptive-Only Benchmark for Quadruped State Estimation: ATE, RPE, and Runtime Trade-offs Between Filters and Smoothers

Ylenia Nistic\`o +3
cs.RO 2026-05-12 reviewed

One prompt generates full robot learning pipelines
Nautilus: From One Prompt to Plug-and-Play Robot Learning

Yufeng Jin +10
cs.CV 2026-05-12 reviewed

SkyPart discovers semantic parts in drone and satellite images using competing learnable…
Weather-Robust Cross-View Geo-Localization via Prototype-Based Semantic Part Discovery

Chi-Nguyen Tran +5
cs.CV 2026-05-12 reviewed

Learnable prototypes separate layout from texture in geo-matching
Weather-Robust Cross-View Geo-Localization via Prototype-Based Semantic Part Discovery

Chi-Nguyen Tran +5
cs.RO 2026-05-12 reviewed

Planner achieves zero tip error for continuum robots on arms
Sampling-Based Follow-the-Leader Motion Planning for Manipulator-Mounted Continuum Robots

Chengnan Shentu +4
cs.RO 2026-05-12 reviewed

Lightweight Python layer lets users swap robot bodies with little code
RIO: Flexible Real-Time Robot I/O for Cross-Embodiment Robot Learning

Pablo Ortega-Kral +15
eess.SP 2026-05-12 reviewed

Diffusion model upgrades low-cost IMU to virtual high-grade data
Overcoming the Intrinsic Performance Limitations of MEMS IMU via Diffusion-Based Generative Learning

Jiarui Lv +2
cs.RO 2026-05-12 reviewed

Benchmark shows intent resolution bottlenecks LLM household agents
PRISM: : Planning and Reasoning with Intent in Simulated Embodied Environments

Yunn Kang Lim +6
cs.RO 2026-05-12 reviewed

Single-agent demos plus cost produce coordinated multi-agent policies
Coordinated Diffusion: Generating Multi-Agent Behavior Without Multi-Agent Demonstrations

Lasse Peters +3
cs.RO 2026-05-12 reviewed

Liveness operator cuts truncation bias in robot policy evaluation
Offline Policy Evaluation for Manipulation Policies via Discounted Liveness Formulation

Hao Wang +3
cs.AI 2026-05-12 reviewed

PPO reformulated to beat SAC in multi-task RL
TOPPO: Rethinking PPO for Multi-Task Reinforcement Learning with Critic Balancing

Yuanpeng Li +3
cs.RO 2026-05-12 reviewed

Quadratic cost correction lifts VLA success 28.8% in dynamic scenes
Overcoming Dynamics-Blindness: Training-Free Pace-and-Path Correction for VLA Models

Yanyan Zhang +8
cs.RO 2026-05-12 reviewed

Training-free fix lifts VLA success rates up to 28.8% in dynamic scenes
Overcoming Dynamics-Blindness: Training-Free Pace-and-Path Correction for VLA Models

Yanyan Zhang +8
cs.LG 2026-05-12 reviewed

Mode discovery prevents collapse in RL-tuned generative policies
Behavioral Mode Discovery for Fine-tuning Multimodal Generative Policies

Alberta Longhini +3
cs.CV 2026-05-12 reviewed

MRF joint aligner reduces collisions in multi-agent paths
JACoP: Joint Alignment for Compliant Multi-Agent Prediction

Qingze Liu +5
cs.RO 2026-05-12 reviewed

Kairos cuts physical AI task latency by 32-66 percent
Kairos: A Scalable Serving System for Physical AI

Yinwei Dai +5
cs.LG 2026-05-11 reviewed

RL policy learns safe sparse timing via Lyapunov shield
Learning When to Act: Communication-Efficient Reinforcement Learning via Run-Time Assurance

Adam Haroon +3
cs.RO 2026-05-11 reviewed

Spinning single-propeller drone reduces visibility via motion blur
Computational Design of a Low-Visibility UAV Using a Human-Aligned Perceptual Metric

Jingxian Wang +5