archive
Every paper Pith has read. Search by title, abstract, or pith.
2900 papers in cs.RO · page 8
-
Diffusion policy opens pull doors with base and dual arms
Diffusion Policy for Coordinated Control of a Nonholonomic Mobile Base and Dual Arms in Door Opening and Passing
-
Hybrid video and MoCap data enable zero-shot humanoid motion tracking
HoloMotion-1 Technical Report
-
Hybrid video and capture data enables zero-shot humanoid tracking
HoloMotion-1 Technical Report
-
Human video builds physical smarts for top robot policies
PhysBrain 1.0 Technical Report
-
Blended human-robot control cuts dexterous errors by 87%
Hand-in-the-Loop: Improving VLA Policies for Dexterous Manipulation via Seamless Hand-Arm Intervention
-
Seamless blending cuts robot-hand takeover jitter by 99.8%
Hand-in-the-Loop: Improving VLA Policies for Dexterous Manipulation via Seamless Hand-Arm Intervention
-
One model tops VLM benchmarks and robot action tasks
Pelican-Unify 1.0: A Unified Embodied Intelligence Model for Understanding, Reasoning, Imagination and Action
-
Single model tops VLM and world benchmarks while ranking near first on robot actions
Pelican-Unify 1.0: A Unified Embodied Intelligence Model for Understanding, Reasoning, Imagination and Action
-
CLOVER reaches new NAVSIM high for driving planners
CLOVER: Closed-Loop Value Estimation and Ranking for End-to-End Autonomous Driving Planning
-
VLMs fail to locate hidden functional objects from task instructions
SceneFunRI: Reasoning the Invisible for Task-Driven Functional Object Localization
-
State space models let robot policies use full observation history
DSSP: Diffusion State Space Policy with Full-History Encoding
-
Two uncalibrated cameras track joint angles with 6-degree accuracy
Agentic Pipeline for Self-Synchronized Multiview Joint Angle Monitoring in Uncalibrated Environments
-
Alignment lets robots predict tactile sensations from sight
Let Robots Feel Your Touch: Visuo-Tactile Cortical Alignment for Embodied Mirror Resonance
-
Unified GPU solver gives exact gradients for stiff heterogeneous soft bodies
DiffPhD: A Unified Differentiable Solver for Projective Heterogeneous Materials in Elastodynamics with Contact-Rich GPU-Acceleration
-
DAJI anticipates joint intent for language-driven humanoid robots
Before the Body Moves: Learning Anticipatory Joint Intent for Language-Conditioned Humanoid Control
-
Language lets humanoids plan moves before the body starts
Before the Body Moves: Learning Anticipatory Joint Intent for Language-Conditioned Humanoid Control
-
Intermediate foot springs cut quadruped energy use by 17%
Energy-Efficient Quadruped Locomotion with Compliant Feet
-
Diffusion models uncover semantic attacks on vehicle maps
Systematic Discovery of Semantic Attacks in Online Map Construction through Conditional Diffusion
2 Piths -
Distill refines robot task specs to match true user intent
Distill: Uncovering the True Intent behind Human-Robot Communication
-
Robots dodge obstacles with partial maps via local path fixes
Reactive Planning based Control for Mobile Robots in Obstacle-Cluttered Environments
-
MAPLE trains end-to-end driving models through reactive multi-agent rollouts performed…
MAPLE: Latent Multi-Agent Play for End-to-End Autonomous Driving
-
Latent multi-agent rollouts boost closed-loop driving performance
MAPLE: Latent Multi-Agent Play for End-to-End Autonomous Driving
-
GCS optimization matches nonlinear AV trajectories efficiently
Motion Planning for Autonomous Vehicles using Optimization over Graphs of Convex Sets
-
CVaR policies top safety verification rates in robot navigation
Safety-Constrained Reinforcement Learning with Post-Training Reachability Verification for Robot Navigation
-
Sparse relevance heads cut ViT 3D detection time by 3x
SToRe3D: Sparse Token Relevance in ViTs for Efficient Multi-View 3D Object Detection
-
Low-res egocentric images suffice for robot active perception
Behavior Cloning for Active Perception with Low-Resolution Egocentric Vision
-
Target distribution from demos lets robots explore while staying grounded
Ergodic Imitation for Adaptive Exploration around Demonstrations
-
Action history prior straightens robot flow paths
WarmPrior: Straightening Flow-Matching Policies with Temporal Priors
-
Guidance sets safe speed for loiter UAV corridor reinsertion
Loiter UAV Reinsertion Guidance for Fixed-wing UAV Corridors
-
One diffusion model generates LiDAR scans across eight domains
OmniLiDAR: A Unified Diffusion Framework for Multi-Domain 3D LiDAR Generation
-
Language models turn satellite images into UAV search priors
LMPath: Language-Mediated Priors and Path Generation for Aerial Exploration
-
Draft model cuts diffusion VLA latency to 19 ms
Realtime-VLA FLASH: Speculative Inference Framework for Diffusion-based VLAs
-
RoboEvolve pairs a vision-language planner with a video-generation simulator in a…
RoboEvolve: Co-Evolving Planner-Simulator for Robotic Manipulation with Limited Data
-
FrameSkip lifts VLA success from 66.5% to 76.2% with 20% frames
FrameSkip: Learning from Fewer but More Informative Frames in VLA Training
-
One demo lets robots build walls of any length
Manipulation Planning for Construction Activities with Repetitive Tasks
-
CARS attributes AV collisions to driver faults
Learning Responsibility-Attributed Adversarial Scenarios for Testing Autonomous Vehicles
-
TinySDP runs real-time SDP on microcontrollers for robot control
TinySDP: Real Time Semidefinite Optimization for Certifiable and Agile Edge Robotics
-
Latent actions unify control across robot bodies
SCAR: Self-Supervised Continuous Action Representation Learning
-
Monocular RGB builds accurate 3D scene graphs room by room
LEXI-SG: Monocular 3D Scene Graph Mapping with Room-Guided Feed-Forward Reconstruction
-
Sliding-mode law hits targets on exact schedule inside acceleration limits
Bounded-Input True Proportional Navigation for Impact-Time Control
-
Survey ties hardware and methods to dexterous hand progress
Towards Robotic Dexterous Hand Intelligence: A Survey
-
Distilled policy trains quadrupeds to traverse any tunnel
Robot Squid Game: Quadrupedal Locomotion for Traversing Narrow Tunnels
-
Causal joint modeling raises driving score to 87.53
Causality-Aware End-to-End Autonomous Driving via Ego-Centric Joint Scene Modeling
-
Causal ego-agent links raise driving score to 87.53
Causality-Aware End-to-End Autonomous Driving via Ego-Centric Joint Scene Modeling
-
User spatial cues lift robot success to 81.2 percent
Guide, Think, Act: Interactive Embodied Reasoning in Vision-Language-Action Models
-
Rotational rings tune magnetic robot tip response in fixed fields
Design of Magnetic Continuum Robots with Tunable Force Response Using Rotational Ring Pairs
-
MCP gives AI agents and humans one lab control interface
NIMO Controller: a self-driving laboratory orchestrator based on the Model Context Protocol
-
One vector of atom scores certifies every formula in a temporal fragment
Vision-Based Runtime Monitoring under Varying Specifications using Semantic Latent Representations
-
Reweighting robot actions by velocity boosts performance
AttenA+: Rectifying Action Inequality in Robotic Foundation Models
-
Open standards let one agent model run consistently in three simulators
Integration of an Agent Model into an Open Simulation Architecture for Scenario-Based Testing of Automated Vehicles