STEM: Semantic Target Search and Exploration using MAVs in Cluttered Environments

Javier Alonso-Mora; Laura Ferranti; Max Lodel; Nikhil Sethi; Robert Babu\v{s}ka

arxiv: 2606.00762 · v1 · pith:6RDMZIIInew · submitted 2026-05-30 · 💻 cs.RO

STEM: Semantic Target Search and Exploration using MAVs in Cluttered Environments

Nikhil Sethi , Max Lodel , Laura Ferranti , Robert Babu\v{s}ka , Javier Alonso-Mora This is my paper

Pith reviewed 2026-06-28 18:27 UTC · model grok-4.3

classification 💻 cs.RO

keywords semantic target searchMAV explorationcluttered environmentsactive perceptioncombinatorial plannerviewpoint planningLLM similarity

0 comments

The pith

A combinatorial planner using propagated semantic priorities lets MAVs find targets faster in cluttered 3D spaces.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a framework for micro aerial vehicles to search for targets in complex unstructured 3D environments. It pairs a combinatorial planner that selects efficient viewpoints with an active perception pipeline that spreads semantic priorities from observed objects into nearby frontier voxels. This propagation supplies semantic information gains that steer the planner toward the target. Language-model similarity scores supply the initial object priorities. Evaluations in two simulation environments and real MAV flights show faster target detection than baselines while keeping exploration times reasonable.

Core claim

The paper establishes a semantically-guided viewpoint planner for MAVs that minimizes target search and exploration time in unstructured 3D environments by using a combinatorial planner prioritizing viewpoints based on semantic information gains, where an active perception pipeline propagates semantic priorities of observed objects into neighboring frontier voxels and LLM-based similarity scores provide the priority input.

What carries the argument

The combinatorial planner that generates efficient semantic exploration plans by prioritizing viewpoints according to semantic information gains computed after the active perception pipeline propagates object priorities to frontier voxels.

If this is right

MAVs can locate targets in rescue scenarios with shorter search times while respecting battery limits.
The planner maintains performance under small sensor ranges and semantic uncertainty.
LLM similarity scores allow flexible priority assignment without fixed object taxonomies.
The same structure works across distinct simulation settings and transfers to physical flights.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Adding explicit uncertainty modeling to the priority propagation step could make gains more robust when detections are unreliable.
The frontier-voxel propagation mechanism could be reused for other 3D tasks such as inspection or mapping.
Coordinating multiple MAVs under the same planner might cover larger areas without proportional increase in search time.

Load-bearing premise

The active perception pipeline can reliably propagate semantic priorities of observed objects into neighboring frontier voxels to compute useful information gains for the combinatorial planner.

What would settle it

Compare target search times in a cluttered test environment where semantic labels are deliberately inverted or noisy against a non-semantic baseline; if the semantic method takes longer, the claim fails.

read the original abstract

Autonomous target search is crucial for deploying Micro Aerial Vehicles (MAVs) in emergency response and rescue missions. Existing approaches either focus on 2D semantic navigation in structured environments -- which is less effective in complex 3D settings, or on robotic exploration in cluttered spaces -- which often lacks the semantic reasoning needed for efficient target search. This paper overcomes these limitations by proposing a novel framework that utilizes a semantically-guided viewpoint planner to minimize target search and exploration time in unstructured 3D environments using an MAV. Specifically, we develop a combinatorial planner that generates efficient semantic exploration plans by prioritizing viewpoints that likely lead to the target. To guide the planner towards the target, an active perception pipeline is developed that propagates semantic priorities of observed objects into neighboring frontier voxels for computing semantic information gains of frontier viewpoints. In addition, we demonstrate how LLM-based similarity scores can be leveraged as semantic priority input to our pipeline. Evaluations in two distinct simulation environments show that the proposed method consistently outperforms baselines by quickly finding the target while maintaining reasonable exploration times. Real-world experiments with an MAV further demonstrate the method's ability to handle practical constraints like limited battery life, small sensor range, and semantic uncertainty.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

STEM integrates LLM semantic scores with voxel priority propagation and a combinatorial planner for MAV target search, backed by sim and real MAV tests, but the propagation step stays high-level and the abstract gives no numbers.

read the letter

The main thing to know is that this paper puts together semantic priorities (including LLM similarity scores), propagates them into neighboring frontier voxels, and uses the resulting gains in a combinatorial viewpoint planner to find targets faster with an MAV in 3D clutter.

What is new is the specific active perception pipeline that turns observed object semantics into information gains for the planner, rather than keeping semantics and exploration separate. The real-world MAV flights that account for battery limits, small sensor range, and semantic uncertainty are a clear plus and show the method can run under practical constraints.

The soft spots are proportionate. The propagation step into voxels is the central assumption, and the paper describes it at a high level without showing the exact update rules or how it handles semantic errors. The abstract states consistent outperformance in two simulation environments but supplies no quantitative results, baselines, or error bars, so the size of the gains is hard to judge from the given text. If the full paper has clear tables and ablations, that would address it.

This is aimed at robotics researchers working on semantic navigation or exploration for aerial vehicles. A reader focused on practical MAV deployment for search tasks would get value from the integration and the hardware validation.

I would send it for peer review because it has both simulation and real MAV evidence on a relevant problem and the approach is concrete enough to benefit from referee feedback.

Referee Report

1 major / 1 minor

Summary. The manuscript presents STEM, a framework for semantic target search and exploration with MAVs in cluttered 3D environments. It introduces a combinatorial planner that prioritizes viewpoints likely to lead to the target, guided by an active perception pipeline that propagates semantic priorities of observed objects into neighboring frontier voxels to compute semantic information gains; LLM-based similarity scores serve as semantic priority inputs. Evaluations in two simulation environments are claimed to show consistent outperformance over baselines in target-finding speed while maintaining reasonable exploration times, with additional real-world MAV experiments demonstrating handling of constraints including limited battery life, small sensor range, and semantic uncertainty.

Significance. If the empirical claims hold with robust quantitative support, the work would advance integration of semantic reasoning into 3D exploration planning for MAVs, addressing gaps between 2D semantic navigation and non-semantic exploration methods. The real-world validation under practical constraints and the use of LLMs for semantic input are strengths. No machine-checked proofs, parameter-free derivations, or reproducible code artifacts are claimed; support rests on the described empirical evaluations.

major comments (1)

Abstract: the central claim that the method 'consistently outperforms baselines' and demonstrates real-world feasibility is asserted without any quantitative results, error bars, baseline details, or data exclusion rules. This is load-bearing for the empirical validation of the active perception pipeline and combinatorial planner.

minor comments (1)

Active perception pipeline (abstract paragraph): the propagation of semantic priorities into neighboring frontier voxels and the exact computation of semantic information gains are described only at a high level without algorithms, equations, or pseudocode, limiting assessment of how the pipeline produces usable gains for the planner.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for highlighting the need for quantitative support in the abstract. We address this point directly below.

read point-by-point responses

Referee: [—] Abstract: the central claim that the method 'consistently outperforms baselines' and demonstrates real-world feasibility is asserted without any quantitative results, error bars, baseline details, or data exclusion rules. This is load-bearing for the empirical validation of the active perception pipeline and combinatorial planner.

Authors: We agree that the abstract's empirical claims would be more robust with quantitative anchors. In the revised manuscript we will update the abstract to include concise performance highlights drawn from the results (e.g., mean target-search-time reductions with standard deviations relative to each baseline, number of trials, and the two simulation environments). These numbers will be cross-referenced to the corresponding figures/tables that already report error bars, baseline implementations, and data-exclusion criteria. The revision will keep the abstract within length limits while directly supporting the stated contributions of the combinatorial planner and active-perception pipeline. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The provided abstract and manuscript description contain no equations, derivations, or parameter-fitting steps that could reduce to self-definition or fitted inputs called predictions. The framework is described at a high level (combinatorial planner guided by active perception pipeline using semantic priorities and LLM similarity scores), with support coming from empirical evaluations in two simulation environments and real-world MAV experiments. No load-bearing self-citations, uniqueness theorems, or ansatzes smuggled via citation are referenced in the text. The central claims rest on direct experimental outperformance rather than any internal reduction to inputs by construction, making the argument self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review; no free parameters, axioms, or invented entities are identifiable or extractable from the provided text.

pith-pipeline@v0.9.1-grok · 5752 in / 1092 out tokens · 18111 ms · 2026-06-28T18:27:19.159054+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

38 extracted references · 1 canonical work pages

[1]

In:Proceedings of (ECCV) Euro- pean Conference on Computer Vision, pp

Chaplot, D.S., Jiang, H., Gupta, S., Gupta, A.: Semantic Curiosity for Active Visual Learning. In:Proceedings of (ECCV) Euro- pean Conference on Computer Vision, pp. 309–326 (2020)

2020
[2]

In:Proceedings of The 6th Conference on Robot Learning, vol

Kim, N., Kwon, O., Yoo, H., Choi, Y., Park, J., Oh, S.: Topological semantic graph mem- ory for image-goal navigation. In:Proceedings of The 6th Conference on Robot Learning, vol. 205, pp. 393–402 (2023)

2023
[3]

In:2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)(2022)

Ramakrishnan, S.K., Chaplot, D.S., Al- Halah, Z., Malik, J., Grauman, K.: PONI: Potential Functions for ObjectGoal Naviga- tion with Interaction-free Learning. In:2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)(2022)

2022
[4]

In:Advances in Neural Informa- tion Processing Systems, vol

Hahn, M., Chaplot, D.S., Tulsiani, S., Mukadam, M., Rehg, J.M., Gupta, A.: No rl, no simulation: Learning to navigate without navigating. In:Advances in Neural Informa- tion Processing Systems, vol. 34, pp. 26661– 26673 (2021)

2021
[5]

In:2024 IEEE International Conference on Robotics and Automation (ICRA)(2024)

Yokoyama, N., Ha, S., Batra, D., Wang, J., Bucher, B.: VLFM: Vision-Language Fron- tier Maps for Zero-Shot Semantic Navigation. In:2024 IEEE International Conference on Robotics and Automation (ICRA)(2024)

2024
[6]

In:Proceedings of Robotics: Science and Systems(2023)

Chen, J., Li, G., Kumar, S., Ghanem, B., Yu, F.: How To Not Train Your Dragon: Training-free Embodied Object Goal Naviga- tion with Semantic Frontiers. In:Proceedings of Robotics: Science and Systems(2023)

2023
[7]

In:2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)(2023)

Gadre, S.Y., Wortsman, M., Ilharco, G., Schmidt, L., Song, S.: Cows on pasture: Baselines and benchmarks for language- driven zero-shot object navigation. In:2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)(2023)

2023
[8]

In:2023 IEEE International Conference on Robotics and Automation (ICRA)(2023)

Huang, C., Mees, O., Zeng, A., Burgard, W.: Visual language maps for robot navigation. In:2023 IEEE International Conference on Robotics and Automation (ICRA)(2023)

2023
[9]

In:Proceedings of Robotics: Science and Systems(2024)

Ginting, M.F., Kim, S.-K., Fan, D.D., Palieri, M., Kochenderfer, M.J., Agha-Mohammadi, A.-A.: SEEK: Semantic Reasoning for Object Goal Navigation in Real World Inspection 18 Tasks. In:Proceedings of Robotics: Science and Systems(2024)

2024
[10]

Zhou, B., Zhang, Y., Chen, X., Shen, S.: Fuel: Fast UAV exploration using incremen- tal frontier structure and hierarchical plan- ning.IEEE Robotics and Automation Letters 6(2), 779–786 (2021)

2021
[11]

Science Robotics8(80) (2023)

Cao, C., Zhu, H., Ren, Z., Choset, H., Zhang, J.: Representation granularity enables time- efficient autonomous exploration in large, complex worlds. Science Robotics8(80) (2023)

2023
[12]

Meng, Z., Qin, H., Chen, Z., Chen, X., Sun, H., Lin, F., Ang, M.H.: A Two-Stage Opti- mized Next-View Planning Framework for 3-D Unknown Environment Exploration, and Structural Reconstruction.IEEE Robotics and Automation Letters2(3), 1680–1687 (2017)

2017
[13]

In:2016 IEEE International Conference on Robotics and Automation (ICRA), pp

Bircher, A., Kamel, M., Alexis, K., Oleynikova, H., Siegwart, R.: Receding Horizon ”Next-Best-View” Planner for 3D Exploration. In:2016 IEEE International Conference on Robotics and Automation (ICRA), pp. 1462–1468 (2016)

2016
[14]

In:2020 IEEE International Con- ference on Robotics and Automation (ICRA) (2020)

Dharmadhikari, M., Dang, T., Solanka, L., Loje, J., Nguyen, H., Khedekar, N., Alexis, K.: Motion Primitives-based Path Planning for Fast and Agile Exploration using Aerial Robots. In:2020 IEEE International Con- ference on Robotics and Automation (ICRA) (2020)

2020
[15]

Schmid, L., Pantic, M., Khanna, R., Ott, L., Siegwart, R., Nieto, J.: An Efficient Sampling-Based Method for Online Infor- mative Path Planning in Unknown Envi- ronments.IEEE Robotics and Automation Letters5, 1–1 (2020)

2020
[16]

In:2018 IEEE Interna- tional Conference on Robotics and Automa- tion (ICRA), pp

Dang, T., Papachristos, C., Alexis, K.: Visual Saliency-Aware Receding Horizon Autonomous Exploration with Application to Aerial Robotics. In:2018 IEEE Interna- tional Conference on Robotics and Automa- tion (ICRA), pp. 2526–2533 (2018)

2018
[17]

In:Proceed- ings 1997 IEEE International Symposium on Computational Intelligence in Robotics and Automation CIRA’97, pp

Yamauchi, B.: A frontier-based approach for autonomous exploration. In:Proceed- ings 1997 IEEE International Symposium on Computational Intelligence in Robotics and Automation CIRA’97, pp. 146–151 (1997)

1997
[18]

In:2017 IEEE/RSJ International Conference on Intelligent Robots and Sys- tems (IROS), pp

Cieslewski, T., Kaufmann, E., Scaramuzza, D.: Rapid exploration with multi-rotors: A frontier selection method for high speed flight. In:2017 IEEE/RSJ International Conference on Intelligent Robots and Sys- tems (IROS), pp. 2135–2142 (2017)

2017
[19]

Huang, J., Zhou, B., Fan, Z., Zhu, Y., Jie, Y., Li, L., Cheng, H.: FAEL: Fast Autonomous Exploration for Large-scale Environments With a Mobile Robot.IEEE Robotics and Automation Letters8(3), 1667–1674 (2023)

2023
[20]

IEEE Transactions on Robotics (2024)

Zhang, Y., Chen, X., Feng, C., Zhou, B., Shen, S.: FALCON: Fast Autonomous Aerial Exploration Using Coverage Path Guidance. IEEE Transactions on Robotics (2024)

2024
[21]

In:Proceedings of the 38th Inter- national Conference on Machine Learning, vol

Radford, A., Kim, J.W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., Sas- try, G., Askell, A., Mishkin, P., Clark, J., Krueger, G., Sutskever, I.: Learning trans- ferable visual models from natural language supervision. In:Proceedings of the 38th Inter- national Conference on Machine Learning, vol. 139, pp. 8748–8763 (2021)

2021
[22]

In:Proceedings of the 40th International Conference on Machine Learn- ing, vol

Li, J., Li, D., Savarese, S., Hoi, S.: BLIP-2: Bootstrapping language-image pre-training with frozen image encoders and large lan- guage models. In:Proceedings of the 40th International Conference on Machine Learn- ing, vol. 202, pp. 19730–19742 (2023)

2023
[23]

In:Proceedings of the 2019 Conference of the North American Chap- ter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), pp

Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In:Proceedings of the 2019 Conference of the North American Chap- ter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), pp. 4171–4186 (2019)

2019
[24]

In:2023 IEEE International Conference on Robotics and Automation (ICRA), pp

Papatheodorou, S., Funk, N., Tzoumanikas, D., Choi, C., Xu, B., Leutenegger, S.: Finding 19 Things in the Unknown: Semantic Object- Centric Exploration with an MAV. In:2023 IEEE International Conference on Robotics and Automation (ICRA), pp. 3339–3345 (2023)

2023
[25]

IEEE Robotics and Automation Letters (2024)

Luo, Y., Zhuang, Z., Pan, N., Feng, C., Shen, S., Gao, F., Cheng, H., Zhou, B.: Star-searcher: A complete and efficient aerial system for autonomous target search in com- plex unknown environments. IEEE Robotics and Automation Letters (2024)

2024
[26]

In:2011 IEEE International Conference on Robotics and Automation, pp

Mellinger, D., Kumar, V.: Minimum snap trajectory generation and control for quadro- tors. In:2011 IEEE International Conference on Robotics and Automation, pp. 2520–2525 (2011)

2011
[27]

In: 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp

Han, L., Gao, F., Zhou, B., Shen, S.: Fiesta: Fast incremental euclidean distance fields for online motion planning of aerial robots. In: 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 4423–4430 (2019)

2019
[28]

In:Proceedings of the IEEE International Conference on Computer Vision, pp

He, K., Gkioxari, G., Doll´ ar, P., Girshick, R.: Mask r-cnn. In:Proceedings of the IEEE International Conference on Computer Vision, pp. 2961–2969 (2017)

2017
[29]

arXiv (2023)

Zhang, C., Han, D., Qiao, Y., Kim, J.U., Bae, S.-H., Lee, S., Hong, C.S.: Faster segment anything: Towards lightweight sam for mobile applications. arXiv (2023)

2023
[30]

In:Proceedings of the European Conference on Computer Vision (ECCV), pp

Liu, G., Reda, F.A., Shih, K.J., Wang, T.- C., Tao, A., Catanzaro, B.: Image inpainting for irregular holes using partial convolutions. In:Proceedings of the European Conference on Computer Vision (ECCV), pp. 85–100 (2018)

2018
[31]

In:Proceed- ings of the Twenty-Sixth Annual ACM Sym- posium on Theory of Computing - STOC ’94 (1994)

Blum, A., Chalasani, P., Coppersmith, D., Pulleyblank, B., Raghavan, P., Sudan, M.: The minimum latency problem. In:Proceed- ings of the Twenty-Sixth Annual ACM Sym- posium on Theory of Computing - STOC ’94 (1994)

1994
[32]

Accepted to IEEE International Conference on Robotics and Automation (ICRA) (2026)

Lodel, M., Wilde, N., Babuska, R., Alonso- Mora, J.: Learning semantic priorities for autonomous target search. Accepted to IEEE International Conference on Robotics and Automation (ICRA) (2026)

2026
[33]

In:Handbook of Metaheuristics, pp

Pisinger, D., Ropke, S.: Large Neighborhood Search. In:Handbook of Metaheuristics, pp. 99–127 (2019)

2019
[34]

II: An Analysis of Several Heuristics for the Traveling Salesman Problem

Rosenkrantz, D.J., Stearns, R.E., Lewis, P.M. II: An Analysis of Several Heuristics for the Traveling Salesman Problem. SIAM Journal on Computing6(3) (1977)

1977
[35]

Operations Research 6(6) (1958)

Croes, G.A.: A Method for Solving Traveling- Salesman Problems. Operations Research 6(6) (1958)

1958
[36]

Chen, G., Wu, S., Shi, M., Dong, W., Zhu, H., Alonso-Mora, J.: Rast: Risk-aware spatio- temporal safety corridors for mav navigation in dynamic uncertain environments.IEEE Robotics and Automation Letters8(2), 808– 815 (2023)

2023
[37]

ObjectNav revisited: On evaluation of embodied agents navigating to objects,

Batra, D., Gokaslan, A., Kembhavi, A., Maksymets, O., Mottaghi, R., Savva, M., Toshev, A., Wijmans, E.: ObjectNav Revis- ited: On Evaluation of Embodied Agents Navigating to Objects. arXiv:2006.13171 (2020)

work page arXiv 2006
[38]

In:2021 IEEE/RSJ International Conference on Intelligent Robots and Sys- tems (IROS), pp

Yokoyama, N., Ha, S., Batra, D.: Success weighted by completion time: A dynamics- aware evaluation criteria for embodied nav- igation. In:2021 IEEE/RSJ International Conference on Intelligent Robots and Sys- tems (IROS), pp. 1562–1569 (2021) 20

2021

[1] [1]

In:Proceedings of (ECCV) Euro- pean Conference on Computer Vision, pp

Chaplot, D.S., Jiang, H., Gupta, S., Gupta, A.: Semantic Curiosity for Active Visual Learning. In:Proceedings of (ECCV) Euro- pean Conference on Computer Vision, pp. 309–326 (2020)

2020

[2] [2]

In:Proceedings of The 6th Conference on Robot Learning, vol

Kim, N., Kwon, O., Yoo, H., Choi, Y., Park, J., Oh, S.: Topological semantic graph mem- ory for image-goal navigation. In:Proceedings of The 6th Conference on Robot Learning, vol. 205, pp. 393–402 (2023)

2023

[3] [3]

In:2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)(2022)

Ramakrishnan, S.K., Chaplot, D.S., Al- Halah, Z., Malik, J., Grauman, K.: PONI: Potential Functions for ObjectGoal Naviga- tion with Interaction-free Learning. In:2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)(2022)

2022

[4] [4]

In:Advances in Neural Informa- tion Processing Systems, vol

Hahn, M., Chaplot, D.S., Tulsiani, S., Mukadam, M., Rehg, J.M., Gupta, A.: No rl, no simulation: Learning to navigate without navigating. In:Advances in Neural Informa- tion Processing Systems, vol. 34, pp. 26661– 26673 (2021)

2021

[5] [5]

In:2024 IEEE International Conference on Robotics and Automation (ICRA)(2024)

Yokoyama, N., Ha, S., Batra, D., Wang, J., Bucher, B.: VLFM: Vision-Language Fron- tier Maps for Zero-Shot Semantic Navigation. In:2024 IEEE International Conference on Robotics and Automation (ICRA)(2024)

2024

[6] [6]

In:Proceedings of Robotics: Science and Systems(2023)

Chen, J., Li, G., Kumar, S., Ghanem, B., Yu, F.: How To Not Train Your Dragon: Training-free Embodied Object Goal Naviga- tion with Semantic Frontiers. In:Proceedings of Robotics: Science and Systems(2023)

2023

[7] [7]

In:2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)(2023)

Gadre, S.Y., Wortsman, M., Ilharco, G., Schmidt, L., Song, S.: Cows on pasture: Baselines and benchmarks for language- driven zero-shot object navigation. In:2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)(2023)

2023

[8] [8]

In:2023 IEEE International Conference on Robotics and Automation (ICRA)(2023)

Huang, C., Mees, O., Zeng, A., Burgard, W.: Visual language maps for robot navigation. In:2023 IEEE International Conference on Robotics and Automation (ICRA)(2023)

2023

[9] [9]

In:Proceedings of Robotics: Science and Systems(2024)

Ginting, M.F., Kim, S.-K., Fan, D.D., Palieri, M., Kochenderfer, M.J., Agha-Mohammadi, A.-A.: SEEK: Semantic Reasoning for Object Goal Navigation in Real World Inspection 18 Tasks. In:Proceedings of Robotics: Science and Systems(2024)

2024

[10] [10]

Zhou, B., Zhang, Y., Chen, X., Shen, S.: Fuel: Fast UAV exploration using incremen- tal frontier structure and hierarchical plan- ning.IEEE Robotics and Automation Letters 6(2), 779–786 (2021)

2021

[11] [11]

Science Robotics8(80) (2023)

Cao, C., Zhu, H., Ren, Z., Choset, H., Zhang, J.: Representation granularity enables time- efficient autonomous exploration in large, complex worlds. Science Robotics8(80) (2023)

2023

[12] [12]

Meng, Z., Qin, H., Chen, Z., Chen, X., Sun, H., Lin, F., Ang, M.H.: A Two-Stage Opti- mized Next-View Planning Framework for 3-D Unknown Environment Exploration, and Structural Reconstruction.IEEE Robotics and Automation Letters2(3), 1680–1687 (2017)

2017

[13] [13]

In:2016 IEEE International Conference on Robotics and Automation (ICRA), pp

Bircher, A., Kamel, M., Alexis, K., Oleynikova, H., Siegwart, R.: Receding Horizon ”Next-Best-View” Planner for 3D Exploration. In:2016 IEEE International Conference on Robotics and Automation (ICRA), pp. 1462–1468 (2016)

2016

[14] [14]

In:2020 IEEE International Con- ference on Robotics and Automation (ICRA) (2020)

Dharmadhikari, M., Dang, T., Solanka, L., Loje, J., Nguyen, H., Khedekar, N., Alexis, K.: Motion Primitives-based Path Planning for Fast and Agile Exploration using Aerial Robots. In:2020 IEEE International Con- ference on Robotics and Automation (ICRA) (2020)

2020

[15] [15]

Schmid, L., Pantic, M., Khanna, R., Ott, L., Siegwart, R., Nieto, J.: An Efficient Sampling-Based Method for Online Infor- mative Path Planning in Unknown Envi- ronments.IEEE Robotics and Automation Letters5, 1–1 (2020)

2020

[16] [16]

In:2018 IEEE Interna- tional Conference on Robotics and Automa- tion (ICRA), pp

Dang, T., Papachristos, C., Alexis, K.: Visual Saliency-Aware Receding Horizon Autonomous Exploration with Application to Aerial Robotics. In:2018 IEEE Interna- tional Conference on Robotics and Automa- tion (ICRA), pp. 2526–2533 (2018)

2018

[17] [17]

In:Proceed- ings 1997 IEEE International Symposium on Computational Intelligence in Robotics and Automation CIRA’97, pp

Yamauchi, B.: A frontier-based approach for autonomous exploration. In:Proceed- ings 1997 IEEE International Symposium on Computational Intelligence in Robotics and Automation CIRA’97, pp. 146–151 (1997)

1997

[18] [18]

In:2017 IEEE/RSJ International Conference on Intelligent Robots and Sys- tems (IROS), pp

Cieslewski, T., Kaufmann, E., Scaramuzza, D.: Rapid exploration with multi-rotors: A frontier selection method for high speed flight. In:2017 IEEE/RSJ International Conference on Intelligent Robots and Sys- tems (IROS), pp. 2135–2142 (2017)

2017

[19] [19]

Huang, J., Zhou, B., Fan, Z., Zhu, Y., Jie, Y., Li, L., Cheng, H.: FAEL: Fast Autonomous Exploration for Large-scale Environments With a Mobile Robot.IEEE Robotics and Automation Letters8(3), 1667–1674 (2023)

2023

[20] [20]

IEEE Transactions on Robotics (2024)

Zhang, Y., Chen, X., Feng, C., Zhou, B., Shen, S.: FALCON: Fast Autonomous Aerial Exploration Using Coverage Path Guidance. IEEE Transactions on Robotics (2024)

2024

[21] [21]

In:Proceedings of the 38th Inter- national Conference on Machine Learning, vol

Radford, A., Kim, J.W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., Sas- try, G., Askell, A., Mishkin, P., Clark, J., Krueger, G., Sutskever, I.: Learning trans- ferable visual models from natural language supervision. In:Proceedings of the 38th Inter- national Conference on Machine Learning, vol. 139, pp. 8748–8763 (2021)

2021

[22] [22]

In:Proceedings of the 40th International Conference on Machine Learn- ing, vol

Li, J., Li, D., Savarese, S., Hoi, S.: BLIP-2: Bootstrapping language-image pre-training with frozen image encoders and large lan- guage models. In:Proceedings of the 40th International Conference on Machine Learn- ing, vol. 202, pp. 19730–19742 (2023)

2023

[23] [23]

In:Proceedings of the 2019 Conference of the North American Chap- ter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), pp

Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In:Proceedings of the 2019 Conference of the North American Chap- ter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), pp. 4171–4186 (2019)

2019

[24] [24]

In:2023 IEEE International Conference on Robotics and Automation (ICRA), pp

Papatheodorou, S., Funk, N., Tzoumanikas, D., Choi, C., Xu, B., Leutenegger, S.: Finding 19 Things in the Unknown: Semantic Object- Centric Exploration with an MAV. In:2023 IEEE International Conference on Robotics and Automation (ICRA), pp. 3339–3345 (2023)

2023

[25] [25]

IEEE Robotics and Automation Letters (2024)

Luo, Y., Zhuang, Z., Pan, N., Feng, C., Shen, S., Gao, F., Cheng, H., Zhou, B.: Star-searcher: A complete and efficient aerial system for autonomous target search in com- plex unknown environments. IEEE Robotics and Automation Letters (2024)

2024

[26] [26]

In:2011 IEEE International Conference on Robotics and Automation, pp

Mellinger, D., Kumar, V.: Minimum snap trajectory generation and control for quadro- tors. In:2011 IEEE International Conference on Robotics and Automation, pp. 2520–2525 (2011)

2011

[27] [27]

In: 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp

Han, L., Gao, F., Zhou, B., Shen, S.: Fiesta: Fast incremental euclidean distance fields for online motion planning of aerial robots. In: 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 4423–4430 (2019)

2019

[28] [28]

In:Proceedings of the IEEE International Conference on Computer Vision, pp

He, K., Gkioxari, G., Doll´ ar, P., Girshick, R.: Mask r-cnn. In:Proceedings of the IEEE International Conference on Computer Vision, pp. 2961–2969 (2017)

2017

[29] [29]

arXiv (2023)

Zhang, C., Han, D., Qiao, Y., Kim, J.U., Bae, S.-H., Lee, S., Hong, C.S.: Faster segment anything: Towards lightweight sam for mobile applications. arXiv (2023)

2023

[30] [30]

In:Proceedings of the European Conference on Computer Vision (ECCV), pp

Liu, G., Reda, F.A., Shih, K.J., Wang, T.- C., Tao, A., Catanzaro, B.: Image inpainting for irregular holes using partial convolutions. In:Proceedings of the European Conference on Computer Vision (ECCV), pp. 85–100 (2018)

2018

[31] [31]

In:Proceed- ings of the Twenty-Sixth Annual ACM Sym- posium on Theory of Computing - STOC ’94 (1994)

Blum, A., Chalasani, P., Coppersmith, D., Pulleyblank, B., Raghavan, P., Sudan, M.: The minimum latency problem. In:Proceed- ings of the Twenty-Sixth Annual ACM Sym- posium on Theory of Computing - STOC ’94 (1994)

1994

[32] [32]

Accepted to IEEE International Conference on Robotics and Automation (ICRA) (2026)

Lodel, M., Wilde, N., Babuska, R., Alonso- Mora, J.: Learning semantic priorities for autonomous target search. Accepted to IEEE International Conference on Robotics and Automation (ICRA) (2026)

2026

[33] [33]

In:Handbook of Metaheuristics, pp

Pisinger, D., Ropke, S.: Large Neighborhood Search. In:Handbook of Metaheuristics, pp. 99–127 (2019)

2019

[34] [34]

II: An Analysis of Several Heuristics for the Traveling Salesman Problem

Rosenkrantz, D.J., Stearns, R.E., Lewis, P.M. II: An Analysis of Several Heuristics for the Traveling Salesman Problem. SIAM Journal on Computing6(3) (1977)

1977

[35] [35]

Operations Research 6(6) (1958)

Croes, G.A.: A Method for Solving Traveling- Salesman Problems. Operations Research 6(6) (1958)

1958

[36] [36]

Chen, G., Wu, S., Shi, M., Dong, W., Zhu, H., Alonso-Mora, J.: Rast: Risk-aware spatio- temporal safety corridors for mav navigation in dynamic uncertain environments.IEEE Robotics and Automation Letters8(2), 808– 815 (2023)

2023

[37] [37]

ObjectNav revisited: On evaluation of embodied agents navigating to objects,

Batra, D., Gokaslan, A., Kembhavi, A., Maksymets, O., Mottaghi, R., Savva, M., Toshev, A., Wijmans, E.: ObjectNav Revis- ited: On Evaluation of Embodied Agents Navigating to Objects. arXiv:2006.13171 (2020)

work page arXiv 2006

[38] [38]

In:2021 IEEE/RSJ International Conference on Intelligent Robots and Sys- tems (IROS), pp

Yokoyama, N., Ha, S., Batra, D.: Success weighted by completion time: A dynamics- aware evaluation criteria for embodied nav- igation. In:2021 IEEE/RSJ International Conference on Intelligent Robots and Sys- tems (IROS), pp. 1562–1569 (2021) 20

2021