pith. sign in

archive

Every paper Pith has read. Search by title, abstract, or pith.

2900 papers in cs.RO · page 10

  1. cs.RO 2026-05-12 reviewed
    Unified VLA model beats human drivers on driving benchmark

    MindVLA-U1: VLA Beats VA with Unified Streaming Architecture for Autonomous Driving

    Yuzhou Huang +8

  2. cs.RO 2026-05-12 reviewed
    Streaming intent produces controllable driving plans in end-to-end model

    Action Emergence from Streaming Intent

    Pengfei Jing +5

  3. cs.RO 2026-05-12 reviewed
    Streaming intent steers driving AI to distinct plans

    Action Emergence from Streaming Intent

    Pengfei Jing +5

  4. cs.RO 2026-05-12 reviewed
    Benchmark finds successful robot tasks often unsafe

    SafeManip: A Property-Driven Benchmark for Temporal Safety Evaluation in Robotic Manipulation

    Chengyue Huang +4

    1 Piths
  5. cs.RO 2026-05-12 reviewed
    Specialized heads boost VLA robot success in and out of domain

    GuidedVLA: Specifying Task-Relevant Factors via Plug-and-Play Action Attention Specialization

    Xiaosong Jia +19

  6. cs.RO 2026-05-12 reviewed
    IMU suit teleoperates humanoid robot in real time with stable motions

    Real-Time Whole-Body Teleoperation of a Humanoid Robot Using IMU-Based Motion Capture with Sim2Sim and Sim2Real Validation

    Hamza Ahmed Durrani +1

  7. cs.CV 2026-05-12 reviewed
    Stereo event cameras track 3D hand poses at 30 mm error

    EgoEV-HandPose: Egocentric 3D Hand Pose Estimation and Gesture Recognition with Stereo Event Cameras

    Luming Wang +4

  8. cs.RO 2026-05-12 reviewed
    One diffusion policy learns both search and insertion

    SI-Diff: A Framework for Learning Search and High-Precision Insertion with a Force-Domain Diffusion Policy

    Yibo Liu +7

  9. cs.RO 2026-05-12 reviewed
    Timestep modulation turns diffusion pretraining into efficient robot exploration

    TMRL: Diffusion Timestep-Modulated Pretraining Enables Exploration for Efficient Policy Finetuning

    Matthew M. Hong +3

  10. cs.RO 2026-05-12 reviewed
    Symmetry prior speeds up bimanual robot learning

    Morphologically Equivariant Flow Matching for Bimanual Mobile Manipulation

    Max Siebenborn +6

  11. cs.CV 2026-05-12 reviewed
    Three height bands give 49-FPS LiDAR pedestrian detection

    TriBand-BEV: Real-Time LiDAR-Only 3D Pedestrian Detection via Height-Aware BEV and High-Resolution Feature Fusion

    Mohammad Khoshkdahan +1

  12. cs.RO 2026-05-12 reviewed
    Virtual objectives stabilize twist retargeting for dexterous hands

    DexTwist: Dexterous Hand Retargeting for Twist Motion via Mixed Reality-based Teleoperation

    Dongmyoung Lee +2

  13. cs.RO 2026-05-12 reviewed
    Mixture of inverse models turns robot video predictions into actions

    From Imagined Futures to Executable Actions: Mixture of Latent Actions for Robot Manipulation

    Yajie Li +7

  14. cs.RO 2026-05-12 reviewed
    Bidirectional pose-action loop boosts robot manipulation

    X-Imitator: Spatial-Aware Imitation Learning via Bidirectional Action-Pose Interaction

    Kai Xiong +3

  15. cs.RO 2026-05-12 reviewed
    Premover cuts robot task time 13.6% by acting on incomplete commands

    Premover: Fast Vision-Language-Action Control by Acting Before Instructions Are Complete

    Joonha Park +2

  16. cs.RO 2026-05-12 reviewed
    OrbiSim turns world models into differentiable physics engines

    OrbiSim: World Models as Differentiable Physics Engines for Embodied Intelligence

    Jiajian Li +5

  17. cs.RO 2026-05-12 reviewed
    World models merge with action generation for embodied AI

    World Action Models: The Next Frontier in Embodied AI

    Siyin Wang +13

  18. cs.RO 2026-05-12 reviewed
    QOED focuses robot exploration on identifiable parameters

    Learning What Matters: Adaptive Information-Theoretic Objectives for Robot Exploration

    Youwei Yu +4

  19. cs.RO 2026-05-12 reviewed
    INDI yields lower position errors than geometric NDI on hexarotors

    Control of Fully Actuated Aerial Vehicles: A Comparison of Model-based and Sensor-based Dynamic Inversion

    Ali Sidar Yilmaz +3

  20. cs.HC 2026-05-12 reviewed
    Robot execution and AI chat boost student code reflection

    RoboBlockly Studio: Conversational Block Programming with Embodied Robot Feedback for Computational Thinking

    Leyi Li +4

  21. cs.RO 2026-05-12 reviewed
    Motion statecharts execute semantic tasks on eight robot platforms

    Closing the Motion Execution Gap: From Semantic Motion Task Constraints to Kinematic Control

    Simon Stelter +3

  22. cs.RO 2026-05-12 reviewed
    Robot blocks unsafe merges at blind intersections

    Cooperative Robotics Reinforced by Collective Perception for Traffic Moderation

    Mohammad Khoshkdahan +3

  23. cs.RO 2026-05-12 reviewed
    Pre-planned graph branches let robots recover from failures instantly

    From Reaction to Anticipation: Proactive Failure Recovery through Agentic Task Graph for Robotic Manipulation

    Sheng Xu +8

  24. cs.RO 2026-05-12 reviewed
    LLM evolution designs superior robot navigation rewards

    EvoNav: Evolutionary Reward Function Design for Robot Navigation with Large Language Models

    Zhikai Zhao +6

  25. cs.RO 2026-05-12 reviewed
    Multi-view latents and manifold actions boost VLA robotic success

    Learning Action Manifold with Multi-view Latent Priors for Robotic Manipulation

    Junjin Xiao +11

  26. cs.RO 2026-05-12 reviewed
    Body regions dictate robot affective touch strategies

    Mapping Embodied Affective Touch Strategies on a Humanoid Robot

    Qiaoqiao Ren +5

  27. cs.RO 2026-05-12 reviewed
    New sampler prunes robot vision tokens to under 10% with no accuracy loss

    See What Matters: Differentiable Grid Sample Pruning for Generalizable Vision-Language-Action Model

    Yixu Feng +6

  28. cs.RO 2026-05-12 reviewed
    Grid sampler trims VLA tokens to under 10% with full success

    See What Matters: Differentiable Grid Sample Pruning for Generalizable Vision-Language-Action Model

    Yixu Feng +6

  29. cs.RO 2026-05-12 reviewed
    Online imitation learning improves navigation via privileged planner labels

    NavOL: Navigation Policy with Online Imitation Learning

    Xiaofei Wei +2

  30. cs.RO 2026-05-12 reviewed
    Robots dream short futures to dodge manipulation failures

    DreamAvoid: Critical-Phase Test-Time Dreaming to Avoid Failures in VLA Policies

    Xianzhe Fan +6

  31. cs.RO 2026-05-12 reviewed
    Surfaces guide soft gripper to grasp paper sheets

    Introducing Environmental Constraints to Grasping Strategies for Paper-Like Flexible Materials Using a Soft Gripper

    Yi Dong +3

  32. cs.RO 2026-05-12 reviewed
    Geometry tuning lets Rainbow DQN master cooperative insertions

    Rainbow Deep Q-Learning with Kinematics-Aware Design for Cooperative Delta and 3-RRS Parallel Robot Insertion

    Hassen Nigatu +4

  33. cs.RO 2026-05-12 reviewed
    IEKF and smoother cut long-term error versus MUSE on quadruped data

    A Proprioceptive-Only Benchmark for Quadruped State Estimation: ATE, RPE, and Runtime Trade-offs Between Filters and Smoothers

    Ylenia Nistic\`o +3

  34. cs.RO 2026-05-12 reviewed
    One prompt generates full robot learning pipelines

    Nautilus: From One Prompt to Plug-and-Play Robot Learning

    Yufeng Jin +10

  35. cs.CV 2026-05-12 reviewed
    SkyPart discovers semantic parts in drone and satellite images using competing learnable…

    Weather-Robust Cross-View Geo-Localization via Prototype-Based Semantic Part Discovery

    Chi-Nguyen Tran +5

  36. cs.CV 2026-05-12 reviewed
    Learnable prototypes separate layout from texture in geo-matching

    Weather-Robust Cross-View Geo-Localization via Prototype-Based Semantic Part Discovery

    Chi-Nguyen Tran +5

  37. cs.RO 2026-05-12 reviewed
    Planner achieves zero tip error for continuum robots on arms

    Sampling-Based Follow-the-Leader Motion Planning for Manipulator-Mounted Continuum Robots

    Chengnan Shentu +4

  38. cs.RO 2026-05-12 reviewed
    Lightweight Python layer lets users swap robot bodies with little code

    RIO: Flexible Real-Time Robot I/O for Cross-Embodiment Robot Learning

    Pablo Ortega-Kral +15

  39. eess.SP 2026-05-12 reviewed
    Diffusion model upgrades low-cost IMU to virtual high-grade data

    Overcoming the Intrinsic Performance Limitations of MEMS IMU via Diffusion-Based Generative Learning

    Jiarui Lv +2

  40. cs.RO 2026-05-12 reviewed
    Benchmark shows intent resolution bottlenecks LLM household agents

    PRISM: : Planning and Reasoning with Intent in Simulated Embodied Environments

    Yunn Kang Lim +6

  41. cs.RO 2026-05-12 reviewed
    Single-agent demos plus cost produce coordinated multi-agent policies

    Coordinated Diffusion: Generating Multi-Agent Behavior Without Multi-Agent Demonstrations

    Lasse Peters +3

  42. cs.RO 2026-05-12 reviewed
    Liveness operator cuts truncation bias in robot policy evaluation

    Offline Policy Evaluation for Manipulation Policies via Discounted Liveness Formulation

    Hao Wang +3

  43. cs.AI 2026-05-12 reviewed
    PPO reformulated to beat SAC in multi-task RL

    TOPPO: Rethinking PPO for Multi-Task Reinforcement Learning with Critic Balancing

    Yuanpeng Li +3

  44. cs.RO 2026-05-12 reviewed
    Quadratic cost correction lifts VLA success 28.8% in dynamic scenes

    Overcoming Dynamics-Blindness: Training-Free Pace-and-Path Correction for VLA Models

    Yanyan Zhang +8

  45. cs.RO 2026-05-12 reviewed
    Training-free fix lifts VLA success rates up to 28.8% in dynamic scenes

    Overcoming Dynamics-Blindness: Training-Free Pace-and-Path Correction for VLA Models

    Yanyan Zhang +8

  46. cs.LG 2026-05-12 reviewed
    Mode discovery prevents collapse in RL-tuned generative policies

    Behavioral Mode Discovery for Fine-tuning Multimodal Generative Policies

    Alberta Longhini +3

  47. cs.CV 2026-05-12 reviewed
    MRF joint aligner reduces collisions in multi-agent paths

    JACoP: Joint Alignment for Compliant Multi-Agent Prediction

    Qingze Liu +5

  48. cs.RO 2026-05-12 reviewed
    Kairos cuts physical AI task latency by 32-66 percent

    Kairos: A Scalable Serving System for Physical AI

    Yinwei Dai +5

  49. cs.LG 2026-05-11 reviewed
    RL policy learns safe sparse timing via Lyapunov shield

    Learning When to Act: Communication-Efficient Reinforcement Learning via Run-Time Assurance

    Adam Haroon +3

  50. cs.RO 2026-05-11 reviewed
    Spinning single-propeller drone reduces visibility via motion blur

    Computational Design of a Low-Visibility UAV Using a Human-Aligned Perceptual Metric

    Jingxian Wang +5