archive

Every paper Pith has read. Search by title, abstract, or pith.

2900 papers in cs.RO · page 8

cs.RO 2026-05-14 reviewed

Diffusion policy opens pull doors with base and dual arms
Diffusion Policy for Coordinated Control of a Nonholonomic Mobile Base and Dual Arms in Door Opening and Passing

Shangqun Yu +6
cs.RO 2026-05-14 reviewed

Hybrid video and MoCap data enable zero-shot humanoid motion tracking
HoloMotion-1 Technical Report

Maiyue Chen +9
cs.RO 2026-05-14 reviewed

Hybrid video and capture data enables zero-shot humanoid tracking
HoloMotion-1 Technical Report

Maiyue Chen +9
cs.RO 2026-05-14 reviewed

Human video builds physical smarts for top robot policies
PhysBrain 1.0 Technical Report

Shijie Lian +12
cs.RO 2026-05-14 reviewed

Blended human-robot control cuts dexterous errors by 87%
Hand-in-the-Loop: Improving VLA Policies for Dexterous Manipulation via Seamless Hand-Arm Intervention

Zhuohang Li +7
cs.RO 2026-05-14 reviewed

Seamless blending cuts robot-hand takeover jitter by 99.8%
Hand-in-the-Loop: Improving VLA Policies for Dexterous Manipulation via Seamless Hand-Arm Intervention

Zhuohang Li +7
cs.RO 2026-05-14 reviewed

One model tops VLM benchmarks and robot action tasks
Pelican-Unify 1.0: A Unified Embodied Intelligence Model for Understanding, Reasoning, Imagination and Action

Yi Zhang +28
cs.RO 2026-05-14 reviewed

Single model tops VLM and world benchmarks while ranking near first on robot actions
Pelican-Unify 1.0: A Unified Embodied Intelligence Model for Understanding, Reasoning, Imagination and Action

Yi Zhang +28
cs.RO 2026-05-14 reviewed

CLOVER reaches new NAVSIM high for driving planners
CLOVER: Closed-Loop Value Estimation and Ranking for End-to-End Autonomous Driving Planning

Sining Ang +3
cs.CV 2026-05-14 reviewed

VLMs fail to locate hidden functional objects from task instructions
SceneFunRI: Reasoning the Invisible for Task-Driven Functional Object Localization

Posheng Chen +4
cs.RO 2026-05-14 reviewed

State space models let robot policies use full observation history
DSSP: Diffusion State Space Policy with Full-History Encoding

Zhiyuan Guan +7
cs.CV 2026-05-14 reviewed

Two uncalibrated cameras track joint angles with 6-degree accuracy
Agentic Pipeline for Self-Synchronized Multiview Joint Angle Monitoring in Uncalibrated Environments

Juncheng Yu +3
cs.RO 2026-05-14 reviewed

Alignment lets robots predict tactile sensations from sight
Let Robots Feel Your Touch: Visuo-Tactile Cortical Alignment for Embodied Mirror Resonance

Tianfang Zhu +6
cs.GR 2026-05-14 reviewed

Unified GPU solver gives exact gradients for stiff heterogeneous soft bodies
DiffPhD: A Unified Differentiable Solver for Projective Heterogeneous Materials in Elastodynamics with Contact-Rich GPU-Acceleration

Shih-Yu Lai +11
cs.RO 2026-05-14 reviewed

DAJI anticipates joint intent for language-driven humanoid robots
Before the Body Moves: Learning Anticipatory Joint Intent for Language-Conditioned Humanoid Control

Haozhe Jia +11
cs.RO 2026-05-14 reviewed

Language lets humanoids plan moves before the body starts
Before the Body Moves: Learning Anticipatory Joint Intent for Language-Conditioned Humanoid Control

Haozhe Jia +11
cs.RO 2026-05-14 reviewed

Intermediate foot springs cut quadruped energy use by 17%
Energy-Efficient Quadruped Locomotion with Compliant Feet

Pramod Pal (1) +5
cs.CV 2026-05-14 reviewed

Diffusion models uncover semantic attacks on vehicle maps
Systematic Discovery of Semantic Attacks in Online Map Construction through Conditional Diffusion

Chenyi Wang +7

2 Piths
cs.RO 2026-05-14 reviewed

Distill refines robot task specs to match true user intent
Distill: Uncovering the True Intent behind Human-Robot Communication

Ting Li +1
cs.RO 2026-05-14 reviewed

Robots dodge obstacles with partial maps via local path fixes
Reactive Planning based Control for Mobile Robots in Obstacle-Cluttered Environments

Li Tan +3
cs.RO 2026-05-13 reviewed

MAPLE trains end-to-end driving models through reactive multi-agent rollouts performed…
MAPLE: Latent Multi-Agent Play for End-to-End Autonomous Driving

Rajeev Yasarla +11
cs.RO 2026-05-13 reviewed

Latent multi-agent rollouts boost closed-loop driving performance
MAPLE: Latent Multi-Agent Play for End-to-End Autonomous Driving

Rajeev Yasarla +11
cs.RO 2026-05-13 reviewed

GCS optimization matches nonlinear AV trajectories efficiently
Motion Planning for Autonomous Vehicles using Optimization over Graphs of Convex Sets

Matheus Wagner +1
cs.RO 2026-05-13 reviewed

CVaR policies top safety verification rates in robot navigation
Safety-Constrained Reinforcement Learning with Post-Training Reachability Verification for Robot Navigation

Qisong He +6
cs.CV 2026-05-13 reviewed

Sparse relevance heads cut ViT 3D detection time by 3x
SToRe3D: Sparse Token Relevance in ViTs for Efficient Multi-View 3D Object Detection

Sandro Papais +3
cs.RO 2026-05-13 reviewed

Low-res egocentric images suffice for robot active perception
Behavior Cloning for Active Perception with Low-Resolution Egocentric Vision

Anthony Bilic +2
cs.RO 2026-05-13 reviewed

Target distribution from demos lets robots explore while staying grounded
Ergodic Imitation for Adaptive Exploration around Demonstrations

Ziyi Xu +3
cs.LG 2026-05-13 reviewed

Action history prior straightens robot flow paths
WarmPrior: Straightening Flow-Matching Policies with Temporal Priors

Sinjae Kang +4
cs.RO 2026-05-13 reviewed

Guidance sets safe speed for loiter UAV corridor reinsertion
Loiter UAV Reinsertion Guidance for Fixed-wing UAV Corridors

Pradeep J +2
cs.CV 2026-05-13 reviewed

One diffusion model generates LiDAR scans across eight domains
OmniLiDAR: A Unified Diffusion Framework for Multi-Domain 3D LiDAR Generation

Youquan Liu +11
cs.RO 2026-05-13 reviewed

Language models turn satellite images into UAV search priors
LMPath: Language-Mediated Priors and Path Generation for Aerial Exploration

Jonathan A. Diller +3
cs.RO 2026-05-13 reviewed

Draft model cuts diffusion VLA latency to 19 ms
Realtime-VLA FLASH: Speculative Inference Framework for Diffusion-based VLAs

Jiahui Niu +7
cs.RO 2026-05-13 reviewed

RoboEvolve pairs a vision-language planner with a video-generation simulator in a…
RoboEvolve: Co-Evolving Planner-Simulator for Robotic Manipulation with Limited Data

Harold Haodong Chen +4
cs.RO 2026-05-13 reviewed

FrameSkip lifts VLA success from 66.5% to 76.2% with 20% frames
FrameSkip: Learning from Fewer but More Informative Frames in VLA Training

Bin Yu +10
cs.RO 2026-05-13 reviewed

One demo lets robots build walls of any length
Manipulation Planning for Construction Activities with Repetitive Tasks

Wangyi Liu +4
cs.RO 2026-05-13 reviewed

CARS attributes AV collisions to driver faults
Learning Responsibility-Attributed Adversarial Scenarios for Testing Autonomous Vehicles

Yizhuo Xiao +7
cs.RO 2026-05-13 reviewed

TinySDP runs real-time SDP on microcontrollers for robot control
TinySDP: Real Time Semidefinite Optimization for Certifiable and Agile Edge Robotics

Ishaan Mahajan +6
cs.RO 2026-05-13 reviewed

Latent actions unify control across robot bodies
SCAR: Self-Supervised Continuous Action Representation Learning

Hongjia Liu +5
cs.RO 2026-05-13 reviewed

Monocular RGB builds accurate 3D scene graphs room by room
LEXI-SG: Monocular 3D Scene Graph Mapping with Room-Guided Feed-Forward Reconstruction

Christina Kassab +4
eess.SY 2026-05-13 reviewed

Sliding-mode law hits targets on exact schedule inside acceleration limits
Bounded-Input True Proportional Navigation for Impact-Time Control

Lohitvel Gopikannan +2
cs.RO 2026-05-13 reviewed

Survey ties hardware and methods to dexterous hand progress
Towards Robotic Dexterous Hand Intelligence: A Survey

Weiguang Zhao +5
cs.RO 2026-05-13 reviewed

Distilled policy trains quadrupeds to traverse any tunnel
Robot Squid Game: Quadrupedal Locomotion for Traversing Narrow Tunnels

Amir Hossain Raj +2
cs.RO 2026-05-13 reviewed

Causal joint modeling raises driving score to 87.53
Causality-Aware End-to-End Autonomous Driving via Ego-Centric Joint Scene Modeling

Seokha Moon +4
cs.RO 2026-05-13 reviewed

Causal ego-agent links raise driving score to 87.53
Causality-Aware End-to-End Autonomous Driving via Ego-Centric Joint Scene Modeling

Seokha Moon +4
cs.RO 2026-05-13 reviewed

User spatial cues lift robot success to 81.2 percent
Guide, Think, Act: Interactive Embodied Reasoning in Vision-Language-Action Models

Yiran Ling +8
cs.RO 2026-05-13 reviewed

Rotational rings tune magnetic robot tip response in fixed fields
Design of Magnetic Continuum Robots with Tunable Force Response Using Rotational Ring Pairs

Alex Sayres +1
cs.AI 2026-05-13 reviewed

MCP gives AI agents and humans one lab control interface
NIMO Controller: a self-driving laboratory orchestrator based on the Model Context Protocol

Naruki Yoshikawa +1
cs.LG 2026-05-13 reviewed

One vector of atom scores certifies every formula in a temporal fragment
Vision-Based Runtime Monitoring under Varying Specifications using Semantic Latent Representations

Bardh Hoxha +4
cs.RO 2026-05-13 reviewed

Reweighting robot actions by velocity boosts performance
AttenA+: Rectifying Action Inequality in Robotic Foundation Models

Daojie Peng +9
cs.RO 2026-05-13 reviewed

Open standards let one agent model run consistently in three simulators
Integration of an Agent Model into an Open Simulation Architecture for Scenario-Based Testing of Automated Vehicles

Christian Geller +3