COMPASS: Confined-space Manipulation Planning with Active Sensing Strategy

Chen Le; Dongyue Huang; Jincheng Yu; Qixuan Li; Xinlei Chen

arxiv: 2509.14787 · v2 · pith:GDL7RQWJnew · submitted 2025-09-18 · 💻 cs.RO

COMPASS: Confined-space Manipulation Planning with Active Sensing Strategy

Qixuan Li , Chen Le , Dongyue Huang , Jincheng Yu , Xinlei Chen This is my paper

Pith reviewed 2026-05-21 22:02 UTC · model grok-4.3

classification 💻 cs.RO

keywords confined spacemanipulation planningactive sensingexplorationsampling-based planningmulti-objective optimizationrobot benchmarkcluttered environments

0 comments

The pith

COMPASS improves robot manipulation success in confined spaces by combining active sensing with planning.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents COMPASS as a framework for robots to explore and manipulate objects in tight, cluttered areas where visibility is limited. It starts with a scan close to the robot to map nearby obstacles and reduce collision chances. Then it uses a utility function that scores potential viewpoints not just for new information but also for how well they set up future manipulation actions. A constrained optimization step creates safe poses for grasping while staying clear of obstacles. The method is evaluated on a custom benchmark with four difficulty levels, showing a 24.25% increase in successful manipulations over standard exploration approaches in simulations, with real-world tests confirming its practical use.

Core claim

By incorporating manipulation awareness into the exploration planner through multi-objective viewpoint selection and constrained pose optimization, the COMPASS framework enables more effective and safer manipulation in partially observable confined environments compared to information-gain-only methods.

What carries the argument

The manipulation-aware sampling-based planner that employs a multi-objective utility function for viewpoint selection and a constrained manipulation optimization strategy to respect obstacle constraints.

If this is right

Manipulation success rate increases by 24.25% in simulations compared to other exploration methods.
The framework reduces collision risks using near-field awareness scans to build local collision maps.
It generates manipulation poses that respect obstacle constraints in confined spaces.
Real-world experiments validate the capability for active sensing and manipulation.
The four-level benchmark allows systematic evaluation of performance under increasing difficulties.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Integrating sensing and action planning in this way may generalize to other robotics tasks with high uncertainty, such as search and rescue in rubble.
Future work could adapt the multi-objective weights dynamically based on task priorities.
Applying similar active sensing strategies to non-manipulation tasks like navigation in clutter could yield comparable gains.
Validation on a wider range of robot hardware and sensor types would test the framework's robustness.

Load-bearing premise

The four-level benchmark scenarios and the specific multi-objective utility weights adequately capture the real difficulties of confined-space manipulation and that observed gains are attributable to the manipulation-aware components rather than implementation details or baseline weaknesses.

What would settle it

An experiment showing that a standard information-gain exploration method achieves comparable or higher manipulation success rates in the same confined scenarios, or that the near-field scan fails to prevent collisions in real tests.

Figures

Figures reproduced from arXiv: 2509.14787 by Chen Le, Dongyue Huang, Jincheng Yu, Qixuan Li, Xinlei Chen.

**Figure 1.** Figure 1: Overview of COMPASS, a framework for active perception and manipulation in confined spaces. (Top) The three-stage pipeline solves a real-world task by sequentially performing: (I) a NearField Awareness Scan for safety, (II) MUE-RRT to find the target, and (III) Constrained Grasp optimization. (Bottom) Our principled simulation benchmark systematically evaluates performance across four levels of increasing… view at source ↗

**Figure 2.** Figure 2: Firstly, a map node incrementally builds an octomap [PITH_FULL_IMAGE:figures/full_fig_p002_2.png] view at source ↗

**Figure 2.** Figure 2: Overview of system framework of COMPASS. Depth images from the wrist-camera are used to incrementally build an OctoMap. [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

**Figure 3.** Figure 3: The runtime logic of a successful exploration and manipu [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗

**Figure 4.** Figure 4: The four challenging scenarios of our proposed benchmark, [PITH_FULL_IMAGE:figures/full_fig_p005_4.png] view at source ↗

**Figure 4.** Figure 4: As shown, our approach consistently achieves the [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗

**Figure 5.** Figure 5: Target volume exploration performance over time across the [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗

**Figure 6.** Figure 6: Snapshots of real-world experiments. build a map of the confined space and search for the target ( [PITH_FULL_IMAGE:figures/full_fig_p007_6.png] view at source ↗

read the original abstract

Manipulation in confined and cluttered environments remains a significant challenge due to partial observability and complex configuration spaces. Effective manipulation in such environments requires an intelligent exploration strategy to safely understand the scene and search the target. In this paper, we propose COMPASS, a multi-stage exploration and manipulation framework featuring a manipulation-aware sampling-based planner. First, we reduce collision risks with a near-field awareness scan to build a local collision map. Additionally, we employ a multi-objective utility function to find viewpoints that are both informative and conducive to subsequent manipulation. Moreover, we perform a constrained manipulation optimization strategy to generate manipulation poses that respect obstacle constraints. To systematically evaluate method's performance under these difficulties, we propose a benchmark of confined-space exploration and manipulation containing four level challenging scenarios. Compared to exploration methods designed for other robots and only considering information gain, our framework increases manipulation success rate by 24.25% in simulations. Real-world experiments demonstrate our method's capability for active sensing and manipulation in confined environments.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

COMPASS ties active sensing to manipulation constraints with a new benchmark, but the 24% gain looks vulnerable to how the baselines were adapted.

read the letter

The paper's core move is a multi-stage planner that first does a near-field scan to build a local collision map, then picks viewpoints with a multi-objective utility that balances information gain against future manipulation feasibility, and finally runs constrained optimization for the actual grasp or action poses. They also release a four-level benchmark of confined scenarios. That integration is the main novelty and it addresses a real pain point in cluttered maintenance or inspection tasks where pure exploration often leaves the robot in bad poses for the next step. The real-world demos add some credibility that the sim results aren't purely theoretical. On the positive side, the constrained optimization and the explicit link from sensing to manipulation constraints feel like practical engineering that could transfer. The benchmark itself is a useful addition for the subfield. The soft spot is the central claim. The 24.25% success-rate lift is measured against exploration methods built for other robots that only optimize information gain. If those baselines were not re-implemented with the same near-field collision handling or constrained pose search, much of the delta could come from the comparison setup rather than the new utility function or the benchmark levels. The abstract gives no numbers on trial count, variance, or statistical tests, so it's difficult to know how stable the improvement is. The utility weights are also free parameters that might have been chosen with the test scenarios in mind. This work is aimed at robotics researchers who do motion planning and active perception in tight spaces. A reader who needs concrete ideas for coupling sensing and manipulation would find usable pieces here. It is worth sending to peer review because the problem matters and the framework is clearly described, even though the experiments will need more rigorous baseline controls and reporting to hold up.

Referee Report

1 major / 1 minor

Summary. The manuscript presents COMPASS, a multi-stage framework for manipulation planning in confined and cluttered spaces. It combines a near-field awareness scan to construct a local collision map, a multi-objective utility function that balances information gain with manipulation feasibility for viewpoint selection, and a constrained optimization step to generate valid manipulation poses. The authors introduce a four-level benchmark of increasing difficulty for confined-space tasks and report that their approach achieves a 24.25% higher manipulation success rate than prior exploration methods (designed for other robots and focused solely on information gain) in simulation, with supporting real-world experiments.

Significance. If the performance gains can be shown to be statistically robust and attributable to the manipulation-aware components rather than baseline implementation differences, the work would provide a useful integration of active sensing and constrained manipulation planning. The structured four-level benchmark is a constructive contribution that could facilitate more comparable evaluations in this domain.

major comments (1)

[Evaluation section] Evaluation section (results reporting the 24.25% success-rate gain): The central empirical claim lacks essential supporting details, including the number of independent trials, measures of statistical significance or variance, exact baseline implementations (e.g., whether constrained pose optimization or near-field collision maps were added to the comparison methods), and any post-hoc scenario selection criteria. Without these, it is not possible to determine whether the reported delta arises from the proposed multi-objective utility and constrained optimization or from unadapted baselines.

minor comments (1)

[Abstract] Abstract: 'four level challenging scenarios' should be written as 'four-level challenging scenarios' for standard hyphenation and readability.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive feedback. We address the single major comment on the evaluation section below and commit to revisions that strengthen the empirical reporting without altering the core claims or experimental design.

read point-by-point responses

Referee: [Evaluation section] Evaluation section (results reporting the 24.25% success-rate gain): The central empirical claim lacks essential supporting details, including the number of independent trials, measures of statistical significance or variance, exact baseline implementations (e.g., whether constrained pose optimization or near-field collision maps were added to the comparison methods), and any post-hoc scenario selection criteria. Without these, it is not possible to determine whether the reported delta arises from the proposed multi-objective utility and constrained optimization or from unadapted baselines.

Authors: We agree that the manuscript would benefit from greater transparency in the evaluation section. In the revised version we will explicitly state the number of independent trials run per benchmark level, report variance (standard deviation) across trials, and include statistical significance testing (e.g., paired t-tests) for the observed 24.25% improvement. We will also clarify that the baselines were the original information-gain-only implementations from the cited works, without the addition of our near-field collision mapping or constrained pose optimization; this choice was deliberate to isolate the benefit of the full manipulation-aware pipeline. Finally, we will confirm that every scenario in the four-level benchmark was evaluated in full, with no post-hoc filtering or selection of results. These additions will make the attribution of performance gains clearer while preserving the original experimental protocol. revision: yes

Circularity Check

0 steps flagged

No significant circularity in derivation chain

full rationale

The paper describes an empirical robotics framework (COMPASS) with components for near-field scanning, multi-objective viewpoint selection, and constrained pose optimization, evaluated on a four-level benchmark via simulation success rates and real-world trials. No equations, derivations, or first-principles results are presented that reduce the reported 24.25% success-rate gain to quantities fitted from the same data or defined in terms of the output itself. Comparisons are framed against external methods, and no load-bearing self-citations or uniqueness theorems are invoked to force the central claims. The evaluation remains independent of the method's internal definitions.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The central performance claim rests on the effectiveness of an unspecified multi-objective utility function and on the representativeness of the new benchmark scenarios; both involve design choices whose tuning details are not visible in the abstract.

free parameters (1)

weights in multi-objective utility function
Balances information gain against manipulation feasibility; values not reported in abstract and likely chosen or fitted to achieve reported gains.

axioms (1)

domain assumption Sampling-based planners can be extended to respect manipulation constraints while remaining computationally tractable in confined spaces
Invoked by the description of the manipulation-aware planner and constrained optimization step.

pith-pipeline@v0.9.0 · 5707 in / 1004 out tokens · 38306 ms · 2026-05-21T22:02:28.636621+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

26 extracted references · 26 canonical work pages · 3 internal anchors

[1]

Diffusion policy: Visuomotor policy learning via ac- tion diffusion,

C. Chi, Z. Xu, S. Feng, E. Cousineau, Y . Du, B. Burchfiel, R. Tedrake, and S. Song, “Diffusion policy: Visuomotor policy learning via ac- tion diffusion,”The International Journal of Robotics Research, p. 02783649241273668, 2023

work page 2023
[2]

Omnimanip: Towards general robotic manipulation via object-centric interaction primitives as spatial constraints,

M. Pan, J. Zhang, T. Wu, Y . Zhao, W. Gao, and H. Dong, “Omnimanip: Towards general robotic manipulation via object-centric interaction primitives as spatial constraints,” inProceedings of the Computer Vision and Pattern Recognition Conference, 2025, pp. 17 359–17 369

work page 2025
[3]

OpenVLA: An Open-Source Vision-Language-Action Model

M. J. Kim, K. Pertsch, S. Karamcheti, T. Xiao, A. Balakrishna, S. Nair, R. Rafailov, E. Foster, G. Lam, P. Sanketiet al., “Open- vla: An open-source vision-language-action model,”arXiv preprint arXiv:2406.09246, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024
[4]

3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations

Y . Ze, G. Zhang, K. Zhang, C. Hu, M. Wang, and H. Xu, “3d diffusion policy: Generalizable visuomotor policy learning via simple 3d representations,”arXiv preprint arXiv:2403.03954, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024
[5]

ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation

W. Huang, C. Wang, Y . Li, R. Zhang, and L. Fei-Fei, “Rekep: Spatio-temporal reasoning of relational keypoint constraints for robotic manipulation,”arXiv preprint arXiv:2409.01652, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024
[6]

Motion primitives-based path planning for fast and agile exploration using aerial robots,

M. Dharmadhikari, T. Dang, L. Solanka, J. Loje, H. Nguyen, N. Khedekar, and K. Alexis, “Motion primitives-based path planning for fast and agile exploration using aerial robots,” in2020 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2020, pp. 179–185

work page 2020
[7]

Rapid exploration for open-world navigation with latent goal models,

D. Shah, B. Eysenbach, G. Kahn, N. Rhinehart, and S. Levine, “Rapid exploration for open-world navigation with latent goal models,”arXiv preprint arXiv:2104.05859, 2021

work page arXiv 2021
[8]

Dsvp: Dual-stage viewpoint planner for rapid exploration by dynamic expansion,

H. Zhu, C. Cao, Y . Xia, S. Scherer, J. Zhang, and W. Wang, “Dsvp: Dual-stage viewpoint planner for rapid exploration by dynamic expansion,” in2021 IEEE/RSJ international conference on intelligent robots and systems (IROS). IEEE, 2021, pp. 7623–7630

work page 2021
[9]

Tare: A hierarchical framework for efficiently exploring complex 3d environments

C. Cao, H. Zhu, H. Choset, and J. Zhang, “Tare: A hierarchical framework for efficiently exploring complex 3d environments.” in Robotics: Science and Systems, vol. 5, 2021, p. 2

work page 2021
[10]

A survey on active simultaneous localization and mapping: State of the art and new frontiers,

J. A. Placed, J. Strader, H. Carrillo, N. Atanasov, V . Indelman, L. Carlone, and J. A. Castellanos, “A survey on active simultaneous localization and mapping: State of the art and new frontiers,”IEEE Transactions on Robotics, vol. 39, no. 3, pp. 1686–1705, 2023

work page 2023
[11]

Online next-best-view planner for 3d-exploration and inspection with a mobile manipulator robot,

M. Naazare, F. G. Rosas, and D. Schulz, “Online next-best-view planner for 3d-exploration and inspection with a mobile manipulator robot,”IEEE Robotics and Automation Letters, vol. 7, no. 2, pp. 3779– 3786, 2022

work page 2022
[12]

Autonomous 3-d reconstruction, mapping, and exploration of indoor environments with a robotic arm,

Y . Wang, S. James, E. K. Stathopoulou, C. Beltr ´an-Gonz´alez, Y . Kon- ishi, and A. Del Bue, “Autonomous 3-d reconstruction, mapping, and exploration of indoor environments with a robotic arm,”IEEE Robotics and Automation Letters, vol. 4, no. 4, pp. 3340–3347, 2019

work page 2019
[13]

Robot active neural sensing and planning in unknown cluttered environments,

H. Ren and A. H. Qureshi, “Robot active neural sensing and planning in unknown cluttered environments,”IEEE Transactions on Robotics, vol. 39, no. 4, pp. 2738–2750, 2023

work page 2023
[14]

Graspnet-1billion: A large- scale benchmark for general object grasping,

H.-S. Fang, C. Wang, M. Gou, and C. Lu, “Graspnet-1billion: A large- scale benchmark for general object grasping,” inProceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp. 11 444–11 453

work page 2020
[15]

Anygrasp: Robust and efficient grasp perception in spatial and temporal domains,

H.-S. Fang, C. Wang, H. Fang, M. Gou, J. Liu, H. Yan, W. Liu, Y . Xie, and C. Lu, “Anygrasp: Robust and efficient grasp perception in spatial and temporal domains,”IEEE Transactions on Robotics, vol. 39, no. 5, pp. 3929–3945, 2023

work page 2023
[16]

Grasp planning for occluded objects in a confined space with lateral view using monte carlo tree search,

M. Kang, H. Kee, J. Kim, and S. Oh, “Grasp planning for occluded objects in a confined space with lateral view using monte carlo tree search,” in2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2022, pp. 10 921–10 926

work page 2022
[17]

Making objects graspable in confined environments through push and pull manipulation with a tool,

S. Elliott, M. Valente, and M. Cakmak, “Making objects graspable in confined environments through push and pull manipulation with a tool,” in2016 IEEE international conference on robotics and automation (ICRA). IEEE, 2016, pp. 4851–4858

work page 2016
[18]

Octomap: An efficient probabilistic 3d mapping framework based on octrees,

A. Hornung, K. M. Wurm, M. Bennewitz, C. Stachniss, and W. Bur- gard, “Octomap: An efficient probabilistic 3d mapping framework based on octrees,”Autonomous robots, vol. 34, no. 3, pp. 189–206, 2013

work page 2013
[19]

Sampling-based motion planning: A comparative review,

A. Orthey, C. Chamzas, and L. E. Kavraki, “Sampling-based motion planning: A comparative review,”Annual Review of Control, Robotics, and Autonomous Systems, vol. 7, 2023

work page 2023
[20]

Graph-based subterranean exploration path planning using aerial and legged robots,

T. Dang, M. Tranzatto, S. Khattak, F. Mascarich, K. Alexis, and M. Hutter, “Graph-based subterranean exploration path planning using aerial and legged robots,”Journal of Field Robotics, vol. 37, no. 8, pp. 1363–1388, 2020

work page 2020
[21]

Manipulability of robotic mechanisms,

T. Yoshikawa, “Manipulability of robotic mechanisms,”The interna- tional journal of Robotics Research, vol. 4, no. 2, pp. 3–9, 1985

work page 1985
[22]

Yolo- world: Real-time open-vocabulary object detection,

T. Cheng, L. Song, Y . Ge, W. Liu, X. Wang, and Y . Shan, “Yolo- world: Real-time open-vocabulary object detection,”arXiv preprint arXiv:2401.17270, 2024

work page arXiv 2024
[23]

NVIDIA Isaac Sim,

NVIDIA Corporation, “NVIDIA Isaac Sim,” https://developer.nvidia. com/isaac/sim

work page
[24]

Franka emika,

Franka Emika, “Franka emika,” https://franka.de/

work page
[25]

Moveit![ros topics],

S. Chitta, I. Sucan, and S. Cousins, “Moveit![ros topics],”IEEE robotics & automation magazine, vol. 19, no. 1, pp. 18–19, 2012

work page 2012
[26]

An informa- tion gain formulation for active volumetric 3d reconstruction,

S. Isler, R. Sabzevari, J. Delmerico, and D. Scaramuzza, “An informa- tion gain formulation for active volumetric 3d reconstruction,” in2016 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2016, pp. 3477–3484

work page 2016

[1] [1]

Diffusion policy: Visuomotor policy learning via ac- tion diffusion,

C. Chi, Z. Xu, S. Feng, E. Cousineau, Y . Du, B. Burchfiel, R. Tedrake, and S. Song, “Diffusion policy: Visuomotor policy learning via ac- tion diffusion,”The International Journal of Robotics Research, p. 02783649241273668, 2023

work page 2023

[2] [2]

Omnimanip: Towards general robotic manipulation via object-centric interaction primitives as spatial constraints,

M. Pan, J. Zhang, T. Wu, Y . Zhao, W. Gao, and H. Dong, “Omnimanip: Towards general robotic manipulation via object-centric interaction primitives as spatial constraints,” inProceedings of the Computer Vision and Pattern Recognition Conference, 2025, pp. 17 359–17 369

work page 2025

[3] [3]

OpenVLA: An Open-Source Vision-Language-Action Model

M. J. Kim, K. Pertsch, S. Karamcheti, T. Xiao, A. Balakrishna, S. Nair, R. Rafailov, E. Foster, G. Lam, P. Sanketiet al., “Open- vla: An open-source vision-language-action model,”arXiv preprint arXiv:2406.09246, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024

[4] [4]

3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations

Y . Ze, G. Zhang, K. Zhang, C. Hu, M. Wang, and H. Xu, “3d diffusion policy: Generalizable visuomotor policy learning via simple 3d representations,”arXiv preprint arXiv:2403.03954, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024

[5] [5]

ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation

W. Huang, C. Wang, Y . Li, R. Zhang, and L. Fei-Fei, “Rekep: Spatio-temporal reasoning of relational keypoint constraints for robotic manipulation,”arXiv preprint arXiv:2409.01652, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024

[6] [6]

Motion primitives-based path planning for fast and agile exploration using aerial robots,

M. Dharmadhikari, T. Dang, L. Solanka, J. Loje, H. Nguyen, N. Khedekar, and K. Alexis, “Motion primitives-based path planning for fast and agile exploration using aerial robots,” in2020 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2020, pp. 179–185

work page 2020

[7] [7]

Rapid exploration for open-world navigation with latent goal models,

D. Shah, B. Eysenbach, G. Kahn, N. Rhinehart, and S. Levine, “Rapid exploration for open-world navigation with latent goal models,”arXiv preprint arXiv:2104.05859, 2021

work page arXiv 2021

[8] [8]

Dsvp: Dual-stage viewpoint planner for rapid exploration by dynamic expansion,

H. Zhu, C. Cao, Y . Xia, S. Scherer, J. Zhang, and W. Wang, “Dsvp: Dual-stage viewpoint planner for rapid exploration by dynamic expansion,” in2021 IEEE/RSJ international conference on intelligent robots and systems (IROS). IEEE, 2021, pp. 7623–7630

work page 2021

[9] [9]

Tare: A hierarchical framework for efficiently exploring complex 3d environments

C. Cao, H. Zhu, H. Choset, and J. Zhang, “Tare: A hierarchical framework for efficiently exploring complex 3d environments.” in Robotics: Science and Systems, vol. 5, 2021, p. 2

work page 2021

[10] [10]

A survey on active simultaneous localization and mapping: State of the art and new frontiers,

J. A. Placed, J. Strader, H. Carrillo, N. Atanasov, V . Indelman, L. Carlone, and J. A. Castellanos, “A survey on active simultaneous localization and mapping: State of the art and new frontiers,”IEEE Transactions on Robotics, vol. 39, no. 3, pp. 1686–1705, 2023

work page 2023

[11] [11]

Online next-best-view planner for 3d-exploration and inspection with a mobile manipulator robot,

M. Naazare, F. G. Rosas, and D. Schulz, “Online next-best-view planner for 3d-exploration and inspection with a mobile manipulator robot,”IEEE Robotics and Automation Letters, vol. 7, no. 2, pp. 3779– 3786, 2022

work page 2022

[12] [12]

Autonomous 3-d reconstruction, mapping, and exploration of indoor environments with a robotic arm,

Y . Wang, S. James, E. K. Stathopoulou, C. Beltr ´an-Gonz´alez, Y . Kon- ishi, and A. Del Bue, “Autonomous 3-d reconstruction, mapping, and exploration of indoor environments with a robotic arm,”IEEE Robotics and Automation Letters, vol. 4, no. 4, pp. 3340–3347, 2019

work page 2019

[13] [13]

Robot active neural sensing and planning in unknown cluttered environments,

H. Ren and A. H. Qureshi, “Robot active neural sensing and planning in unknown cluttered environments,”IEEE Transactions on Robotics, vol. 39, no. 4, pp. 2738–2750, 2023

work page 2023

[14] [14]

Graspnet-1billion: A large- scale benchmark for general object grasping,

H.-S. Fang, C. Wang, M. Gou, and C. Lu, “Graspnet-1billion: A large- scale benchmark for general object grasping,” inProceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp. 11 444–11 453

work page 2020

[15] [15]

Anygrasp: Robust and efficient grasp perception in spatial and temporal domains,

H.-S. Fang, C. Wang, H. Fang, M. Gou, J. Liu, H. Yan, W. Liu, Y . Xie, and C. Lu, “Anygrasp: Robust and efficient grasp perception in spatial and temporal domains,”IEEE Transactions on Robotics, vol. 39, no. 5, pp. 3929–3945, 2023

work page 2023

[16] [16]

Grasp planning for occluded objects in a confined space with lateral view using monte carlo tree search,

M. Kang, H. Kee, J. Kim, and S. Oh, “Grasp planning for occluded objects in a confined space with lateral view using monte carlo tree search,” in2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2022, pp. 10 921–10 926

work page 2022

[17] [17]

Making objects graspable in confined environments through push and pull manipulation with a tool,

S. Elliott, M. Valente, and M. Cakmak, “Making objects graspable in confined environments through push and pull manipulation with a tool,” in2016 IEEE international conference on robotics and automation (ICRA). IEEE, 2016, pp. 4851–4858

work page 2016

[18] [18]

Octomap: An efficient probabilistic 3d mapping framework based on octrees,

A. Hornung, K. M. Wurm, M. Bennewitz, C. Stachniss, and W. Bur- gard, “Octomap: An efficient probabilistic 3d mapping framework based on octrees,”Autonomous robots, vol. 34, no. 3, pp. 189–206, 2013

work page 2013

[19] [19]

Sampling-based motion planning: A comparative review,

A. Orthey, C. Chamzas, and L. E. Kavraki, “Sampling-based motion planning: A comparative review,”Annual Review of Control, Robotics, and Autonomous Systems, vol. 7, 2023

work page 2023

[20] [20]

Graph-based subterranean exploration path planning using aerial and legged robots,

T. Dang, M. Tranzatto, S. Khattak, F. Mascarich, K. Alexis, and M. Hutter, “Graph-based subterranean exploration path planning using aerial and legged robots,”Journal of Field Robotics, vol. 37, no. 8, pp. 1363–1388, 2020

work page 2020

[21] [21]

Manipulability of robotic mechanisms,

T. Yoshikawa, “Manipulability of robotic mechanisms,”The interna- tional journal of Robotics Research, vol. 4, no. 2, pp. 3–9, 1985

work page 1985

[22] [22]

Yolo- world: Real-time open-vocabulary object detection,

T. Cheng, L. Song, Y . Ge, W. Liu, X. Wang, and Y . Shan, “Yolo- world: Real-time open-vocabulary object detection,”arXiv preprint arXiv:2401.17270, 2024

work page arXiv 2024

[23] [23]

NVIDIA Isaac Sim,

NVIDIA Corporation, “NVIDIA Isaac Sim,” https://developer.nvidia. com/isaac/sim

work page

[24] [24]

Franka emika,

Franka Emika, “Franka emika,” https://franka.de/

work page

[25] [25]

Moveit![ros topics],

S. Chitta, I. Sucan, and S. Cousins, “Moveit![ros topics],”IEEE robotics & automation magazine, vol. 19, no. 1, pp. 18–19, 2012

work page 2012

[26] [26]

An informa- tion gain formulation for active volumetric 3d reconstruction,

S. Isler, R. Sabzevari, J. Delmerico, and D. Scaramuzza, “An informa- tion gain formulation for active volumetric 3d reconstruction,” in2016 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2016, pp. 3477–3484

work page 2016