hub Mixed citations

Bidirectional attention network for monocular depth estimation

Michelle A · 2021 · arXiv 8506.2021

Mixed citation behavior. Most common role is background (67%).

37 Pith papers citing it

Background 67% of classified citations

read on arXiv browse 37 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 4 method 2

citation-polarity summary

background 4 use method 2

representative citing papers

Learned Memory Attenuation in Sage-Husa Kalman Filters for Robust UAV State Estimation

eess.SP · 2026-05-18 · unverdicted · novelty 7.0

NDR-SHKF replaces the static forgetting factor in Sage-Husa Kalman Filters with a learned vector-valued memory attenuation policy from a bifurcated recurrent network trained end-to-end on whitened innovations to minimize estimation error.

Randomized Advantage Transformation (RAT): Computing Natural Policy Gradients via Direct Backpropagation

cs.LG · 2026-05-18 · unverdicted · novelty 7.0

RAT reformulates regularized natural policy gradients as vanilla gradients with a transformed advantage, computed efficiently via randomized block Kaczmarz iterations on on-policy data.

Discrete Diffusion for Complex and Congested Multi-Agent Path Finding with Sparse Social Attention

cs.AI · 2026-05-13 · unverdicted · novelty 7.0

DiffLNS uses a discrete diffusion initializer to produce warm-start plans that lift LNS2 success rates to 95.8% across 20 congested MAPF settings, generalizing from 96 to 312 agents.

Distributed Pose Graph Optimization via Continuous Riemannian Dynamics

cs.RO · 2026-05-11 · unverdicted · novelty 7.0

Pose graph optimization is recast as damped Riemannian dynamics on Lie groups, enabling a fully distributed algorithm with a semi-implicit integrator that converges under both synchronous and asynchronous communication.

LLM-Foraging: Large Language Models for Decentralized Swarm Robot Foraging

cs.RO · 2026-05-02 · unverdicted · novelty 7.0

LLM-Foraging uses off-the-shelf LLMs for decentralized tactical decisions in CPFA-based swarm foraging, collecting more resources than GA-tuned baselines across 36 varied configurations while showing greater consistency.

Simulation-Ready Cluttered Scene Estimation via Physics-aware Joint Shape and Pose Optimization

cs.RO · 2026-02-23 · unverdicted · novelty 7.0

SPARCS uses a differentiable contact model and sparse Hessian solver to jointly optimize shapes and poses of up to five interacting objects, producing physically valid simulation-ready reconstructions.

Adaptive Control in Autonomous Driving via Real-Time Recurrent RL

cs.RO · 2026-02-02 · unverdicted · novelty 7.0

Combines offline behavioral cloning with online Real-Time Recurrent RL fine-tuning on LrcSSM models to adapt autonomous driving policies to distribution shifts, validated in simulation and on a real 1:10-scale robot with event camera.

AID: Agent Intent from Diffusion for Multi-Agent Informative Path Planning

cs.RO · 2025-12-02 · conditional · novelty 7.0

AID trains diffusion policies via behavior cloning on existing MAIPP planners followed by RL fine-tuning to achieve faster execution and higher information gain in multi-agent coordination.

Guided Reinforcement Learning for Omnidirectional 3D Jumping in Quadruped Robots

cs.RO · 2025-07-22 · unverdicted · novelty 7.0

Guided RL using Bezier curves and UARM model enables efficient, explainable omnidirectional jumping in quadruped robots.

INSANE: Cross-Domain UAV Data Sets with Increased Number of Sensors for developing Advanced and Novel Estimators

cs.RO · 2022-10-17 · accept · novelty 7.0

INSANE releases multiple MAV datasets with cross-environment trajectories, rich multi-IMU and camera suites, high-rate vibration data, and sub-centimeter RTK GNSS ground truth for localization research.

VBT-MPC: Vision-Based Tactile MPC for Contour Following

cs.RO · 2026-05-19 · unverdicted · novelty 6.0

VBT-MPC performs robotic contour following by running MPC directly in vision-based tactile contour feature space and is tested on varied geometries in simulation and real experiments.

ARC-RL: A Reinforcement Learning Playground Inspired by ARC Raiders

cs.RO · 2026-05-19 · accept · novelty 6.0 · 2 refs

ARC-RL is a new suite of four MuJoCo continuous-control environments featuring game-inspired hexapod and quadruped morphologies, a single closed-form multi-component reward function, CPG demonstrators, and empirical comparisons of online and offline-to-online RL algorithms.

BoolXLLM: LLM-Assisted Explainability for Boolean Models

cs.AI · 2026-05-12 · unverdicted · novelty 6.0

BoolXLLM augments an existing Boolean rule learner with LLMs for feature selection, discretization thresholds, and natural-language rule translation to improve interpretability while preserving accuracy.

TouchAnything: Diffusion-Guided 3D Reconstruction from Sparse Robot Touches

cs.CV · 2026-04-10 · unverdicted · novelty 6.0

TouchAnything reconstructs accurate 3D object geometries from only a few tactile contacts by optimizing for consistency with a pretrained visual diffusion prior.

From Local Matches to Global Masks: Template-Guided Instance Detection and Segmentation in Open-World Scenes

cs.CV · 2026-03-03 · unverdicted · novelty 6.0

L2G-Det detects and segments novel object instances in open scenes by using local template patch matches to generate points that prompt an augmented SAM for global masks.

House of Dextra: Cross-embodied Co-design for Dexterous Hands

cs.RO · 2025-12-03 · unverdicted · novelty 6.0

A co-design framework learns task-specific hand shapes and complementary control policies, supporting design, training, fabrication, and deployment of new dexterous hands in under 24 hours.

QuickLAP: Quick Language-Action Preference Learning for Semi-Autonomous Agents

cs.AI · 2025-11-22 · unverdicted · novelty 6.0 · 2 refs

QuickLAP fuses LLM-extracted language observations with physical feedback in a closed-form Bayesian update to cut reward learning error by over 70% in a driving simulator and improve user preference in a 15-person study.

Learning Multi-Modal Whole-Body Control for Real-World Humanoid Robots

cs.RO · 2024-07-30 · unverdicted · novelty 6.0

A single learned controller called MHC enables real humanoid robots to execute diverse whole-body behaviors from multi-modal inputs via masked target trajectories.

FusionSense: Tri-Stage Near-Sensor Learning for Runtime-Adaptive Multimodal Edge Intelligence

cs.LG · 2026-05-19 · unverdicted · novelty 5.0

FusionSense uses server-side fusion learning, filter-out-safe labels, and edge compaction to enable runtime-adaptive multimodal sensing that cuts energy up to 33x while preserving task quality on RGB+Depth data.

Visibility-Aware Mobile Grasping in Dynamic Environments

cs.RO · 2026-05-04 · unverdicted · novelty 5.0

A visibility-aware mobile grasping system with iterative whole-body planning and behavior-tree subgoal generation achieves 68.8% success in unknown static and 58% in dynamic environments, outperforming a baseline by 22.8% and 18%.

From Spherical to Gaussian: A Comparative Analysis of Point Cloud Cropping Strategies in Large-Scale 3D Environments

cs.CV · 2026-05-03 · unverdicted · novelty 5.0

Gaussian and related cropping strategies for point cloud subclouds improve 3D neural network performance over spherical cropping on large outdoor scenes.

BIEVR-LIO: Robust LiDAR-Inertial Odometry through Bump-Image-Enhanced Voxel Maps

cs.RO · 2026-04-15 · unverdicted · novelty 5.0

BIEVR-LIO improves robustness of LiDAR-inertial odometry by representing maps as voxel-wise oriented height images and sampling points only from geometrically informative regions.

Hierarchical Awareness Adapters with Hybrid Pyramid Feature Fusion for Dense Depth Prediction

cs.CV · 2026-04-03 · unverdicted · novelty 5.0

A multilevel perceptual CRF model using Swin Transformer, HPF fusion, HA adapters, and dynamic scaling attention achieves state-of-the-art monocular depth estimation on NYU Depth v2, KITTI, and MatterPort3D with reduced error and fast inference.

ARROW: Augmented Replay for RObust World models

cs.LG · 2026-03-12 · unverdicted · novelty 5.0

ARROW adds a distribution-matching long-term replay buffer to DreamerV3 and shows reduced forgetting versus same-size baselines on Atari and Procgen continual RL benchmarks.

citing papers explorer

Showing 37 of 37 citing papers.

Learned Memory Attenuation in Sage-Husa Kalman Filters for Robust UAV State Estimation eess.SP · 2026-05-18 · unverdicted · none · ref 24
NDR-SHKF replaces the static forgetting factor in Sage-Husa Kalman Filters with a learned vector-valued memory attenuation policy from a bifurcated recurrent network trained end-to-end on whitened innovations to minimize estimation error.
Randomized Advantage Transformation (RAT): Computing Natural Policy Gradients via Direct Backpropagation cs.LG · 2026-05-18 · unverdicted · none · ref 84
RAT reformulates regularized natural policy gradients as vanilla gradients with a transformed advantage, computed efficiently via randomized block Kaczmarz iterations on on-policy data.
Discrete Diffusion for Complex and Congested Multi-Agent Path Finding with Sparse Social Attention cs.AI · 2026-05-13 · unverdicted · none · ref 9
DiffLNS uses a discrete diffusion initializer to produce warm-start plans that lift LNS2 success rates to 95.8% across 20 congested MAPF settings, generalizing from 96 to 312 agents.
Distributed Pose Graph Optimization via Continuous Riemannian Dynamics cs.RO · 2026-05-11 · unverdicted · none · ref 35
Pose graph optimization is recast as damped Riemannian dynamics on Lie groups, enabling a fully distributed algorithm with a semi-implicit integrator that converges under both synchronous and asynchronous communication.
LLM-Foraging: Large Language Models for Decentralized Swarm Robot Foraging cs.RO · 2026-05-02 · unverdicted · none · ref 22
LLM-Foraging uses off-the-shelf LLMs for decentralized tactical decisions in CPFA-based swarm foraging, collecting more resources than GA-tuned baselines across 36 varied configurations while showing greater consistency.
Simulation-Ready Cluttered Scene Estimation via Physics-aware Joint Shape and Pose Optimization cs.RO · 2026-02-23 · unverdicted · none · ref 44
SPARCS uses a differentiable contact model and sparse Hessian solver to jointly optimize shapes and poses of up to five interacting objects, producing physically valid simulation-ready reconstructions.
Adaptive Control in Autonomous Driving via Real-Time Recurrent RL cs.RO · 2026-02-02 · unverdicted · none · ref 10
Combines offline behavioral cloning with online Real-Time Recurrent RL fine-tuning on LrcSSM models to adapt autonomous driving policies to distribution shifts, validated in simulation and on a real 1:10-scale robot with event camera.
AID: Agent Intent from Diffusion for Multi-Agent Informative Path Planning cs.RO · 2025-12-02 · conditional · none · ref 34
AID trains diffusion policies via behavior cloning on existing MAIPP planners followed by RL fine-tuning to achieve faster execution and higher information gain in multi-agent coordination.
Guided Reinforcement Learning for Omnidirectional 3D Jumping in Quadruped Robots cs.RO · 2025-07-22 · unverdicted · none · ref 10
Guided RL using Bezier curves and UARM model enables efficient, explainable omnidirectional jumping in quadruped robots.
INSANE: Cross-Domain UAV Data Sets with Increased Number of Sensors for developing Advanced and Novel Estimators cs.RO · 2022-10-17 · accept · none · ref 4
INSANE releases multiple MAV datasets with cross-environment trajectories, rich multi-IMU and camera suites, high-rate vibration data, and sub-centimeter RTK GNSS ground truth for localization research.
VBT-MPC: Vision-Based Tactile MPC for Contour Following cs.RO · 2026-05-19 · unverdicted · none · ref 19
VBT-MPC performs robotic contour following by running MPC directly in vision-based tactile contour feature space and is tested on varied geometries in simulation and real experiments.
ARC-RL: A Reinforcement Learning Playground Inspired by ARC Raiders cs.RO · 2026-05-19 · accept · none · ref 19 · 2 links
ARC-RL is a new suite of four MuJoCo continuous-control environments featuring game-inspired hexapod and quadruped morphologies, a single closed-form multi-component reward function, CPG demonstrators, and empirical comparisons of online and offline-to-online RL algorithms.
BoolXLLM: LLM-Assisted Explainability for Boolean Models cs.AI · 2026-05-12 · unverdicted · none · ref 107
BoolXLLM augments an existing Boolean rule learner with LLMs for feature selection, discretization thresholds, and natural-language rule translation to improve interpretability while preserving accuracy.
TouchAnything: Diffusion-Guided 3D Reconstruction from Sparse Robot Touches cs.CV · 2026-04-10 · unverdicted · none · ref 46
TouchAnything reconstructs accurate 3D object geometries from only a few tactile contacts by optimizing for consistency with a pretrained visual diffusion prior.
From Local Matches to Global Masks: Template-Guided Instance Detection and Segmentation in Open-World Scenes cs.CV · 2026-03-03 · unverdicted · none · ref 5
L2G-Det detects and segments novel object instances in open scenes by using local template patch matches to generate points that prompt an augmented SAM for global masks.
House of Dextra: Cross-embodied Co-design for Dexterous Hands cs.RO · 2025-12-03 · unverdicted · none · ref 10
A co-design framework learns task-specific hand shapes and complementary control policies, supporting design, training, fabrication, and deployment of new dexterous hands in under 24 hours.
QuickLAP: Quick Language-Action Preference Learning for Semi-Autonomous Agents cs.AI · 2025-11-22 · unverdicted · none · ref 67 · 2 links
QuickLAP fuses LLM-extracted language observations with physical feedback in a closed-form Bayesian update to cut reward learning error by over 70% in a driving simulator and improve user preference in a 15-person study.
Learning Multi-Modal Whole-Body Control for Real-World Humanoid Robots cs.RO · 2024-07-30 · unverdicted · none · ref 19
A single learned controller called MHC enables real humanoid robots to execute diverse whole-body behaviors from multi-modal inputs via masked target trajectories.
FusionSense: Tri-Stage Near-Sensor Learning for Runtime-Adaptive Multimodal Edge Intelligence cs.LG · 2026-05-19 · unverdicted · none · ref 20
FusionSense uses server-side fusion learning, filter-out-safe labels, and edge compaction to enable runtime-adaptive multimodal sensing that cuts energy up to 33x while preserving task quality on RGB+Depth data.
Visibility-Aware Mobile Grasping in Dynamic Environments cs.RO · 2026-05-04 · unverdicted · none · ref 58
A visibility-aware mobile grasping system with iterative whole-body planning and behavior-tree subgoal generation achieves 68.8% success in unknown static and 58% in dynamic environments, outperforming a baseline by 22.8% and 18%.
From Spherical to Gaussian: A Comparative Analysis of Point Cloud Cropping Strategies in Large-Scale 3D Environments cs.CV · 2026-05-03 · unverdicted · none · ref 15
Gaussian and related cropping strategies for point cloud subclouds improve 3D neural network performance over spherical cropping on large outdoor scenes.
BIEVR-LIO: Robust LiDAR-Inertial Odometry through Bump-Image-Enhanced Voxel Maps cs.RO · 2026-04-15 · unverdicted · none · ref 24
BIEVR-LIO improves robustness of LiDAR-inertial odometry by representing maps as voxel-wise oriented height images and sampling points only from geometrically informative regions.
Hierarchical Awareness Adapters with Hybrid Pyramid Feature Fusion for Dense Depth Prediction cs.CV · 2026-04-03 · unverdicted · none · ref 3
A multilevel perceptual CRF model using Swin Transformer, HPF fusion, HA adapters, and dynamic scaling attention achieves state-of-the-art monocular depth estimation on NYU Depth v2, KITTI, and MatterPort3D with reduced error and fast inference.
ARROW: Augmented Replay for RObust World models cs.LG · 2026-03-12 · unverdicted · none · ref 11
ARROW adds a distribution-matching long-term replay buffer to DreamerV3 and shows reduced forgetting versus same-size baselines on Atari and Procgen continual RL benchmarks.
Toward Seamless Physical Human-Humanoid Interaction: Insights from Control, Intent, and Modeling with a Vision for What Comes Next cs.RO · 2025-12-08 · unverdicted · none · ref 105 · 2 links
A literature review of pHHI that proposes a taxonomy of interaction types by modality and engagement level while outlining pathways to integrate control, intent, and modeling for more seamless humanoid-human collaboration.
Online Adaptive Probabilistic Safety Certificate with Language Guidance eess.SY · 2025-11-16 · unverdicted · none · ref 11
A framework integrates user language and probabilistic environment estimates into adaptive safety certificates that guarantee long-term safety for stochastic systems via probabilistic invariance.
STL-Based Motion Planning and Uncertainty-Aware Risk Analysis for Human-Robot Collaboration with a Multi-Rotor Aerial Vehicle cs.RO · 2025-09-12 · unverdicted · none · ref 17
The paper proposes an STL-based optimization planner with uncertainty-aware risk analysis and event-triggered replanning for safe human-drone collaboration, demonstrated in simulations of an object handover task.
NOOUGAT: Towards Unified Online and Offline Multi-Object Tracking cs.CV · 2025-09-02 · unverdicted · none · ref 67
NOOUGAT unifies online and offline multi-object tracking with a GNN that processes non-overlapping subclips fused by an Autoregressive Long-term Tracking layer, reporting SOTA gains on DanceTrack, SportsMOT, and MOT20.
Linking Exteroception and Proprioception through Improved Contact Modeling for Soft Growing Robots cs.RO · 2025-07-14 · unverdicted · none · ref 22
Soft growing robots map unknown 2D environments by characterizing collision deformations, building a geometry-based simulator, and using Monte Carlo sampling to select optimal deployments that approach ideal actions.
4D Radar Semantic Segmentation of People in Field Conditions Using Temporal Multi-View Networks cs.CV · 2024-04-08 · unverdicted · none · ref 15
TMVA4D uses CNN and ConvLSTM encoders on multi-view 2D projections of 4D radar point clouds for semantic segmentation of people, reporting Dice 75.9% and IoU 61.2% in field tests.
A Systematic Survey on Deep Learning Architectures for Point Cloud Classification and Segmentation cs.CV · 2026-05-16 · unverdicted · none · ref 91
A systematic literature survey that categorizes deep learning architectures for point cloud classification, part segmentation, and semantic segmentation, evaluates them on benchmarks, and discusses innovations, limitations, and future directions.
The Unified Autonomy Stack: Toward a Blueprint for Generalizable Robot Autonomy cs.RO · 2026-05-12 · accept · none · ref 36 · 2 links
An open-sourced Unified Autonomy Stack fuses LiDAR, radar, vision and inertial data with sampling-based planning and control barrier functions to deliver resilient autonomy on aerial and ground robots in challenging real-world settings.
Smoothing Out the Edges: Continuous-Time Estimation with Gaussian Process Motion Priors on Factor Graphs cs.RO · 2026-05-09 · accept · none · ref 293 · 2 links
The paper recasts Gaussian-process continuous-time estimation in factor-graph language and supplies three GTSAM implementations to lower the barrier to adoption.
Explainable Planning for Hybrid Systems cs.AI · 2026-02-24 · unverdicted · none · ref 2
A comprehensive study on generating explanations for automated planning in hybrid systems.
Improving Action Smoothness for a Cascaded Online Learning Flight Control System eess.SY · 2025-07-06 · unverdicted · none · ref 29
Adds temporal smoothness and low-pass filtering to cut oscillations in cascaded online learning flight controllers, shown via FFT and simulations.
Optimal Gait Control for a Tendon-driven Soft Quadruped Robot by Model-based Reinforcement Learning cs.RO · 2024-06-11 · unverdicted · none · ref 33
Develops and tests a model-based RL controller with post-training for gait in a tendon-driven soft quadruped, reporting improved efficiency and robustness over benchmarks.
Visual Hand Gesture Recognition with Deep Learning: A Comprehensive Review of Methods, Datasets, Challenges and Future Research Directions cs.CV · 2025-07-06 · unverdicted · none · ref 211
A literature review that categorizes deep learning approaches for visual hand gesture recognition, summarizes state-of-the-art methods across tasks, reviews datasets and metrics, and identifies challenges and future directions.

Bidirectional attention network for monocular depth estimation

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer