An asynchronous architecture decouples incremental voxel-based mapping from VLM-based semantic enrichment to produce queryable open-vocabulary 3D scene graphs that match or exceed prior methods on segmentation and grounding benchmarks.
hub Canonical reference
Monocular visual-inertial odometry in low-textured environments with smooth gradients: A fully dense direct filtering approach
Canonical reference. 80% of citing Pith papers cite this work as background.
hub tools
citation-role summary
citation-polarity summary
representative citing papers
An event-camera system with active gaze control and contrast-maximization spin estimation achieves real-time performance in table tennis with 8.8% magnitude error, 6.4° axis error, 3 ms latency, and 750 Hz throughput.
SA-LIVO uses eigendecomposition of the joint information matrix with linear-clamp soft gates per eigendirection for efficient degeneracy-aware LiDAR-inertial-visual odometry.
A flow-adaptive ergodic coverage formulation using MMD that preserves guarantees over evolving domains and supports open-loop planning for robots in flows.
Timed reward machines extend reward machines with timing constraints, allowing model-free RL algorithms to learn policies that satisfy precise temporal requirements on standard benchmarks.
BEVCALIB performs LiDAR-camera calibration from raw data by fusing camera and LiDAR bird's-eye view features with a novel feature selector and reports state-of-the-art accuracy on KITTI and NuScenes.
Residual feature integration with a trainable target-side encoder provably prevents negative transfer, achieving convergence rates no worse than training from scratch under informative target distributions.
INSANE releases multiple MAV datasets with cross-environment trajectories, rich multi-IMU and camera suites, high-rate vibration data, and sub-centimeter RTK GNSS ground truth for localization research.
FALCON algorithm solves non-convex partially-decoupled GNEPs via SCP and potential games, claiming global convergence to open-loop Nash equilibria under mild assumptions.
Introduces LSM that outputs calibrated multimodal spatial distributions from language plus scene graph, fused via VL-Map to improve 3D target localization on VLA-3D benchmark and real robot.
A LiDAR-inertial odometry pipeline using on-manifold ellipsoidal set-membership filtering to output feasible sets as deterministic protection levels under unknown-but-bounded point-cloud noise.
RigidFormer learns mesh-free rigid dynamics from point clouds using object-centric anchors, Anchor-Vertex Pooling, Anchor-based RoPE, and differentiable Kabsch alignment to enforce rigidity.
Dr-BA delivers a separable optimization approach for direct radar bundle adjustment and cross-session localization using full spinning-radar intensity images, achieving state-of-the-art performance on over 200 km of on-road data.
New knot classification benchmark and topology-aware supervision methods yield small specificity gains but confirm that appearance bias remains the dominant failure mode.
Integrates iterative learning control with a torque library to enable high-precision adaptive locomotion on bipedal and quadrupedal robots, reducing tracking errors by up to 85% and achieving over 30x faster control rates.
Bee hive mind from weighted voter imitation equals a single RL agent using a new multi-armed bandit rule called Maynard-Cross Learning.
UECP replaces detection-correlated confidence maps with a LiDAR point-density uncertainty map and introduces Uncertainty-Aware Pyramid Fusion to improve collaborative perception.
KITE decouples task reasoning from embodiment-specific control via learned latent interaction intents to enable zero-shot transfer across structurally different robots.
A fully discrete strain-based model for continuum robot dynamics via Lie group variational integrators, combined with an EKF-based observer for states and disturbances, validated on hardware.
A simulation-trained deep deformation model combined with online adaptive control enables zero-shot autonomous tissue retraction for ROI exposure in robotic surgery.
Systematic grasping strategies for paper-like materials are developed and tested with a soft gripper by exploiting environmental constraints to improve force control and success rates.
PF-CD3Q uses online particle filtering to estimate fatigue parameters and constrains a deep Q-learning agent to solve fatigue-aware human-robot task planning as a CMDP.
PRISM downsamples point clouds by stratifying on RGB color bins with a maximum capacity k per bin to preserve high chromatic diversity regions over homogeneous surfaces.
RSR-RSMARL is a robust safe MARL framework with V2V communication and CBF safety shields that supports zero-shot sim-to-real transfer and improves coordination on 1/10-scale vehicle hardware.