An Active Perception Game for Robust Exploration

Igor Spasojevic; Pratik Chaudhari; Siming He; Vijay Kumar; Yuezhan Tao

arxiv: 2404.00769 · v5 · submitted 2024-03-31 · 💻 cs.RO

An Active Perception Game for Robust Exploration

Siming He , Yuezhan Tao , Igor Spasojevic , Vijay Kumar , Pratik Chaudhari This is my paper

Pith reviewed 2026-05-24 02:02 UTC · model grok-4.3

classification 💻 cs.RO

keywords informationgainperceptionactiveapproachestimationdifferentestimate

0 comments

The pith

A game-theoretic analysis enables online estimation of true information gain with sub-linear regret for active perception.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that the discrepancy between the estimated information gain (an expectation over future observations) and the true information gain (known only after observation) in active perception admits a game-theoretic formulation. This formulation yields an online estimator achieving sub-linear regret over time steps. A sympathetic reader would care because inaccurate estimates can cause suboptimal viewpoint selection in safety-critical tasks such as locating a person in distress. Experiments across simulations, real-world data, and physical robots confirm the estimator reduces estimation errors while improving downstream perception metrics.

Core claim

By analyzing the mathematical relationship between active perception and the estimation error of the information gain in a game-theoretic setting, the authors develop an online estimation approach that achieves sub-linear regret (in the number of time-steps) for the estimation of the true information gain and reduces the sub-optimality of active perception systems.

What carries the argument

The game-theoretic formulation of the discrepancy between estimated and true information gain, which enables derivation of the online estimator with sub-linear regret.

If this is right

Robotic systems achieve 7% higher information gain on average during exploration.
Information gain estimation errors drop by 42% on average across tested environments and map representations.
PSNR improves by 5% and semantic accuracy (correctly localized objects) increases by 6%.
Real-world ground robots produce complex trajectories that better explore occluded regions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The estimator could extend to other sequential decision settings where rewards are revealed only after actions are taken.
Integration with learned perception models might further tighten the regret bound in practice.
Controlled tests in rapidly changing scenes would check whether the sub-linear bound holds when information gain statistics drift.
keywords

Load-bearing premise

The mathematical relationship between active perception and the estimation error of the information gain admits a game-theoretic formulation that yields a practical online estimator with sub-linear regret.

What would settle it

An experiment in which the cumulative estimation error of the information gain grows linearly or faster with the number of time steps would falsify the sub-linear regret claim.

Figures

Figures reproduced from arXiv: 2404.00769 by Igor Spasojevic, Pratik Chaudhari, Siming He, Vijay Kumar, Yuezhan Tao.

**Figure 2.** Figure 2: Viewpoint selection comparison. Given a set of candidate trajectories, our method selects trajectory (b), which leads to an underexplored room, while the baseline selects trajectory (a), which remains within the same room. Theorem 4.4: Given an informative path planning algorithm which selects X t+∆t t with PT t=1 r ∗ t (Xt | y t 1 ) ≥ γ PT t=1 r ∗ t (X∗ t | y t 1 ), the active perception regret is E(ϱ) ≤… view at source ↗

**Figure 3.** Figure 3: Reconstructed Occupancy Map and Exploration of Occluded [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

read the original abstract

Active perception approaches select future viewpoints by using some estimate of the information gain. An inaccurate estimate can be detrimental in critical situations, e.g., locating a person in distress. However the true information gained can only be calculated post hoc, i.e., after the observation is realized. We present an approach to estimate the discrepancy between the estimated information gain (which is the expectation over putative future observations while neglecting correlations among them) and the true information gain. The key idea is to analyze the mathematical relationship between active perception and the estimation error of the information gain in a game-theoretic setting. Using this, we develop an online estimation approach that achieves sub-linear regret (in the number of time-steps) for the estimation of the true information gain and reduces the sub-optimality of active perception systems. We demonstrate our approach for active perception using a comprehensive set of experiments on: (a) different types of environments, including a quadrotor in a photorealistic simulation, real-world robotic data, and real-world experiments with ground robots exploring indoor and outdoor scenes; (b) different types of robotic perception data; and (c) different map representations. On average, our approach reduces information gain estimation errors by 42%, increases the information gain by 7%, PSNR by 5%, and semantic accuracy (measured as the number of objects that are localized correctly) by 6%. In real-world experiments with a Jackal ground robot, our approach demonstrated complex trajectories to explore occluded regions.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Game-theoretic online estimator for info-gain discrepancy with sub-linear regret, backed by broad robot experiments but light on derivation details.

read the letter

The main new piece is the game-theoretic framing of the gap between predicted and actual information gain, turned into an online estimator that carries a sub-linear regret bound in the number of steps. That setup is what lets them claim they can reduce sub-optimality without waiting for post-hoc ground truth during operation. The abstract does not present this as a direct rewrite of standard expected-information-gain methods, so the angle looks distinct on the surface. Experiments are the stronger part of the work. They run the method on a quadrotor in photorealistic simulation, real-world datasets, and hardware with ground robots across indoor and outdoor scenes, using multiple map representations and perception types. Average reported gains are 42% lower estimation error, 7% higher information gain, 5% PSNR, and 6% semantic accuracy, with the Jackal runs producing trajectories that reach occluded areas. That breadth of platforms and environments is useful evidence that the approach is not tied to one narrow setting. The soft spots sit in the theory and reporting. The regret claim rests on the game formulation, yet the abstract supplies no derivation steps or explicit assumptions about the loss or boundedness, so the full paper needs checking for whether the bound is load-bearing or circular. Results are given as averages only, with no error bars or trial counts visible here, which leaves open how sensitive the numbers are to particular runs or environments. These issues are not fatal but they do mean the central claims require verification rather than being taken at face value. The paper is for people working on active perception and exploration planning in robotics, especially those who need to handle cases where information-gain estimates can mislead in safety-critical tasks. A reader already familiar with regret bounds in online learning would get the most out of the theory section. I would send it to peer review because the experimental scope is wide enough and the claim is concrete enough to be worth referee time, even if revisions on the proof and stats are likely.

Referee Report

0 major / 3 minor

Summary. The manuscript develops a game-theoretic formulation to analyze the discrepancy between the estimated information gain (expectation over future observations, neglecting correlations) and the true post-hoc information gain in active perception. This yields an online estimator achieving sub-linear regret in the number of time-steps, which is then used to reduce sub-optimality of viewpoint selection. The approach is evaluated on quadrotor simulation, real-world robotic datasets, and physical ground-robot experiments across indoor/outdoor scenes and multiple map representations, reporting average gains of 42% error reduction, 7% information gain increase, 5% PSNR, and 6% semantic accuracy.

Significance. If the sub-linear regret bound is rigorously established under the stated assumptions, the work provides a principled way to make active perception more robust to estimation errors in information gain, which is load-bearing for safety-critical tasks such as search-and-rescue. The breadth of experimental validation (simulation, real data, physical robots) and the explicit focus on the estimation-true gap distinguish it from heuristic active-perception methods. No machine-checked proofs or open reproducible code are mentioned.

minor comments (3)

Abstract: the reported averages (42% error reduction, etc.) are given without the number of trials, standard deviations, or data-exclusion criteria; adding these would strengthen the empirical claims.
The game-theoretic relationship is introduced without an explicit statement of the boundedness or Lipschitz assumptions required for the regret analysis; a short paragraph clarifying these would improve accessibility.
Figure captions and table headers could more clearly indicate which baseline each metric is compared against (e.g., “vs. standard IG” or “vs. random”).

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the positive assessment of our manuscript and the recommendation for minor revision. No specific major comments were provided in the report.

Circularity Check

0 steps flagged

No significant circularity; derivation presented as independent game-theoretic analysis

full rationale

The paper's central claim is that a game-theoretic analysis of the discrepancy between estimated and true information gain produces an online estimator achieving sub-linear regret. No equations, self-citations, or fitted parameters are shown in the provided material that reduce the regret bound or estimator to a tautology or to the input data by construction. The abstract and skeptic analysis locate no self-definitional steps, no renaming of known results, and no load-bearing self-citations. The derivation is therefore treated as self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The approach rests on the domain assumption that a game-theoretic model of the estimation error yields a tractable online algorithm; no free parameters or invented entities are identifiable from the abstract alone.

axioms (1)

domain assumption The discrepancy between estimated and realized information gain admits a game-theoretic analysis that produces an online estimator with sub-linear regret.
Stated as the key idea enabling the online estimation approach.

pith-pipeline@v0.9.0 · 5805 in / 1176 out tokens · 26682 ms · 2026-05-24T02:02:00.757039+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

42 extracted references · 42 canonical work pages · 1 internal anchor

[1]

A survey on active simultaneous localization and mapping: State of the art and new frontiers,

J. A. Placed, J. Strader, H. Carrillo, N. Atanasov, V . Indelman, L. Carlone, and J. A. Castellanos, “A survey on active simultaneous localization and mapping: State of the art and new frontiers,” IEEE Transactions on Robotics , vol. 39, no. 3, pp. 1686–1705, 2023

work page 2023
[2]

View planning in robot active vision: A survey of systems, algorithms, and applications,

R. Zeng, Y . Wen, W. Zhao, and Y .-J. Liu, “View planning in robot active vision: A survey of systems, algorithms, and applications,” Comp. Visual Media, vol. 6, p. 225–245, 2020

work page 2020
[3]

Active perception,

R. Bajcsy, “Active perception,”Proceedings of the IEEE, vol. 76, no. 8, pp. 966–1005, 1988

work page 1988
[4]

Revisiting active perception,

R. Bajcsy, Y . Aloimonos, and J. K. Tsotsos, “Revisiting active perception,” Autonomous Robots, vol. 42, pp. 177–196, 2018

work page 2018
[5]

Near-optimal sensor placements in gaussian processes: Theory, efficient algorithms and empirical studies,

A. Krause, A. Singh, and C. Guestrin, “Near-optimal sensor placements in gaussian processes: Theory, efficient algorithms and empirical studies,” J. Mach. Learn. Res. , vol. 9, p. 235–284, jun 2008

work page 2008
[6]

Nonmyopic active learning of gaussian processes: an exploration-exploitation approach,

A. Krause and C. Guestrin, “Nonmyopic active learning of gaussian processes: an exploration-exploitation approach,” in Proceedings of the 24th International Conference on Machine Learning , ser. ICML ’07. New York, NY , USA: Association for Computing Machinery, 2007, p. 449–456. [Online]. Available: https://doi.org/10.1145/1273496.1273553

work page doi:10.1145/1273496.1273553 2007
[7]

Efficient informative sensing using multiple robots,

A. Singh, A. Krause, C. Guestrin, and W. J. Kaiser, “Efficient informative sensing using multiple robots,” Journal of Artificial Intelligence Research , vol. 34, p. 707–755, Apr. 2009. [Online]. Available: http://dx.doi.org/10.1613/jair.2674

work page doi:10.1613/jair.2674 2009
[8]

Branch and bound for informative path planning,

J. Binney and G. S. Sukhatme, “Branch and bound for informative path planning,” in 2012 IEEE International Conference on Robotics and Automation, 2012, pp. 2147–2154

work page 2012
[9]

Robotic active information gathering for spatial field reconstruction with rapidly-exploring random trees and online learning of gaussian processes,

A. Viseras, D. Shutin, and L. Merino, “Robotic active information gathering for spatial field reconstruction with rapidly-exploring random trees and online learning of gaussian processes,” Sensors, vol. 19, no. 5,

work page
[10]

Available: https://www.mdpi.com/1424-8220/19/5/1016

[Online]. Available: https://www.mdpi.com/1424-8220/19/5/1016

work page
[11]

The determination of next best views,

C. Connolly, “The determination of next best views,” in Proceedings. 1985 IEEE International Conference on Robotics and Automation , vol. 2, 1985, pp. 432–435

work page 1985
[12]

Receding horizon

A. Bircher, M. Kamel, K. Alexis, H. Oleynikova, and R. Siegwart, “Receding horizon ”next-best-view” planner for 3d exploration,” in2016 IEEE International Conference on Robotics and Automation (ICRA) , 2016, pp. 1462–1468

work page 2016
[13]

Yoder and S

L. Yoder and S. Scherer, Autonomous Exploration for Infrastructure Modeling with a Micro Aerial Vehicle . Cham: Springer International Publishing, 2016, pp. 427–440. [Online]. Available: https://doi.org/10. 1007/978-3-319-27702-8 28

work page 2016
[14]

Surface edge explorer (see): Planning next best views directly from 3d observations,

R. Border, J. D. Gammell, and P. Newman, “Surface edge explorer (see): Planning next best views directly from 3d observations,” 2018 IEEE International Conference on Robotics and Automation (ICRA) , pp. 1–8, 2018

work page 2018
[15]

An efficient sampling-based method for online informative path planning in unknown environments,

L. Schmid, M. Pantic, R. Khanna, L. Ott, R. Siegwart, and J. Nieto, “An efficient sampling-based method for online informative path planning in unknown environments,” IEEE Robotics and Automation Letters , vol. 5, no. 2, pp. 1500–1507, 2020

work page 2020
[16]

Fsmi: Fast computa- tion of shannon mutual information for information-theoretic mapping,

Z. Zhang, T. Henderson, V . Sze, and S. Karaman, “Fsmi: Fast computa- tion of shannon mutual information for information-theoretic mapping,” in 2019 International Conference on Robotics and Automation (ICRA) , 2019, pp. 6912–6918

work page 2019
[17]

Information-theoretic mapping using cauchy-schwarz quadratic mutual information,

B. Charrow, S. Liu, V . Kumar, and N. Michael, “Information-theoretic mapping using cauchy-schwarz quadratic mutual information,” in 2015 IEEE International Conference on Robotics and Automation (ICRA) , 2015, pp. 4791–4798

work page 2015
[18]

An efficient and continuous approach to information-theoretic exploration,

T. Henderson, V . Sze, and S. Karaman, “An efficient and continuous approach to information-theoretic exploration,” in 2020 IEEE Inter- national Conference on Robotics and Automation (ICRA) , 2020, pp. 8566–8572

work page 2020
[19]

Learned map prediction for enhanced mobile robot exploration,

R. Shrestha, F.-P. Tian, W. Feng, P. Tan, and R. Vaughan, “Learned map prediction for enhanced mobile robot exploration,” in2019 International Conference on Robotics and Automation (ICRA) , 2019, pp. 1197–1204

work page 2019
[20]

Learning to explore indoor environments using autonomous micro aerial vehicles,

Y . Tao, E. Iceland, B. Li, E. Zwecher, U. Heinemann, A. Cohen, A. Avni, O. Gal, A. Barel, and V . Kumar, “Learning to explore indoor environments using autonomous micro aerial vehicles,” in 2024 IEEE International Conference on Robotics and Automation (ICRA) , 2024, pp. 15 758–15 764

work page 2024
[21]

Seer: Safe efficient exploration for aerial robots using learning to predict information gain,

Y . Tao, Y . Wu, B. Li, F. Cladera, A. Zhou, D. Thakur, and V . Kumar, “Seer: Safe efficient exploration for aerial robots using learning to predict information gain,” in 2023 IEEE International Conference on Robotics and Automation (ICRA) , 2023, pp. 1235–1241

work page 2023
[22]

Learning to map for active semantic goal navigation,

G. Georgakis, B. Bucher, K. Schmeckpeper, S. Singh, and K. Daniilidis, “Learning to map for active semantic goal navigation,” arXiv preprint arXiv:2106.15648, 2021

work page arXiv 2021
[23]

Active perception using neural radiance fields,

H. Siming, C. D. Hsu, D. Ong, Y . S. Shao, and P. Chaudhari, “Active perception using neural radiance fields,” in 2024 American Control Conference (ACC), 2024, pp. 4353–4358

work page 2024
[24]

3d active metric-semantic slam,

Y . Tao, X. Liu, I. Spasojevic, S. Agarwal, and V . Kumar, “3d active metric-semantic slam,” IEEE Robotics and Automation Letters , vol. 9, no. 3, pp. 2989–2996, 2024

work page 2024
[25]

Active scout: Multi-target tracking using neural radiance fields in dense urban environments,

C. D. Hsu and P. Chaudhari, “Active scout: Multi-target tracking using neural radiance fields in dense urban environments,” arXiv preprint arXiv:2406.07431, 2024

work page arXiv 2024
[26]

Semantic octree mapping and shannon mutual information computation for robot exploration,

A. Asgharivaskasi and N. Atanasov, “Semantic octree mapping and shannon mutual information computation for robot exploration,” IEEE Transactions on Robotics , vol. 39, no. 3, pp. 1910–1928, 2023

work page 1910
[27]

RT-GuIDE: Real- Time Gaussian splatting for Information-Driven Exploration

Y . Tao, D. Ong, V . Murali, I. Spasojevic, P. Chaudhari, and V . Ku- mar, “Rt-guide: Real-time gaussian splatting for information-driven exploration,” arXiv preprint arXiv:2409.18122 , 2024

work page arXiv 2024
[28]

Learning to Act by Predicting the Future

A. Dosovitskiy and V . Koltun, “Learning to act by predicting the future,” arXiv preprint arXiv:1611.01779 , 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016
[29]

Multi-agent exploration of an unknown sparse landmark complex via deep reinforcement learning,

X. Sun, Y . Wu, S. Bhattacharya, and V . Kumar, “Multi-agent exploration of an unknown sparse landmark complex via deep reinforcement learning,” arXiv preprint arXiv:2209.11794 , 2022

work page arXiv 2022
[30]

Learning to explore using active neural slam,

D. S. Chaplot, D. Gandhi, S. Gupta, A. Gupta, and R. Salakhutdi- nov, “Learning to explore using active neural slam,” arXiv preprint arXiv:2004.05155, 2020

work page arXiv 2004
[31]

Lecture notes: Bandits, experts and games (lecture 8),

A. Slivkins, “Lecture notes: Bandits, experts and games (lecture 8),” 2016. [Online]. Available: https://www.cs.umd.edu/ ∼slivkins/ CMSC858G-fall16/lecture8-both.pdf

work page 2016
[32]

Nerfacc: Efficient sampling accelerates nerfs

R. Li, H. Gao, M. Tancik, and A. Kanazawa, “Nerfacc: Efficient sampling accelerates nerfs.” arXiv preprint arXiv:2305.04966 , 2023

work page arXiv 2023
[33]

Habitat: A platform for embodied ai research,

M. Savva, A. Kadian, O. Maksymets, Y . Zhao, E. Wijmans, B. Jain, J. Straub, J. Liu, V . Koltun, J. Malik, et al., “Habitat: A platform for embodied ai research,” in Proceedings of the IEEE/CVF international conference on computer vision , 2019, pp. 9339–9347

work page 2019
[34]

Habitat 2.0: Training home assistants to rearrange their habitat,

A. Szot, A. Clegg, E. Undersander, E. Wijmans, Y . Zhao, J. Turner, N. Maestre, M. Mukadam, D. S. Chaplot, O. Maksymets, et al., “Habitat 2.0: Training home assistants to rearrange their habitat,” Advances in neural information processing systems , vol. 34, pp. 251–266, 2021

work page 2021
[35]

Habitat 3.0: A co-habitat for humans, avatars and robots.arXiv preprint arXiv:2310.13724, 2023

X. Puig, E. Undersander, A. Szot, M. D. Cote, T.-Y . Yang, R. Partsey, R. Desai, A. W. Clegg, M. Hlavac, S. Y . Min, et al. , “Habitat 3.0: A co-habitat for humans, avatars and robots,” arXiv preprint arXiv:2310.13724, 2023

work page arXiv 2023
[36]

RotorPy: A python-based multirotor simulator with aerodynamics for education and research,

S. Folk, J. Paulos, and V . Kumar, “Rotorpy: A python-based multirotor simulator with aerodynamics for education and research,”arXiv preprint arXiv:2306.04485, 2023

work page arXiv 2023
[37]

A frontier-based approach for autonomous exploration,

B. Yamauchi, “A frontier-based approach for autonomous exploration,” in Proceedings 1997 IEEE International Symposium on Computational Intelligence in Robotics and Automation CIRA’97. ’Towards New Computational Principles for Robotics and Automation’ . IEEE, 1997, pp. 146–151

work page 1997
[38]

M3ed: Multi-robot, multi-sensor, multi-environment event dataset,

K. Chaney, F. Cladera, Z. Wang, A. Bisulco, M. A. Hsieh, C. Korpela, V . Kumar, C. J. Taylor, and K. Daniilidis, “M3ed: Multi-robot, multi-sensor, multi-environment event dataset,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, June 2023, pp. 4016–4023

work page 2023
[39]

Stronger together: Air-ground robotic collaboration using semantics,

I. D. Miller, F. Cladera, T. Smith, C. J. Taylor, and V . Kumar, “Stronger together: Air-ground robotic collaboration using semantics,” IEEE Robotics and Automation Letters , vol. 7, no. 4, 2022

work page 2022
[40]

Faster-lio: Lightweight tightly coupled lidar-inertial odometry using parallel sparse incremental voxels,

C. Bai, T. Xiao, Y . Chen, H. Wang, F. Zhang, and X. Gao, “Faster-lio: Lightweight tightly coupled lidar-inertial odometry using parallel sparse incremental voxels,” IEEE Robotics and Automation Letters , vol. 7, no. 2, pp. 4861–4868, 2022

work page 2022
[41]

Marder-Eppstein

E. Marder-Eppstein. (2024) ROS move base package

work page 2024
[42]

Learning in games (and games in learning),

A. Roth, “Learning in games (and games in learning),” 2023. [Online]. Available: https://www.cis.upenn.edu/∼aaroth/GamesInLearning.pdf

work page 2023

[1] [1]

A survey on active simultaneous localization and mapping: State of the art and new frontiers,

J. A. Placed, J. Strader, H. Carrillo, N. Atanasov, V . Indelman, L. Carlone, and J. A. Castellanos, “A survey on active simultaneous localization and mapping: State of the art and new frontiers,” IEEE Transactions on Robotics , vol. 39, no. 3, pp. 1686–1705, 2023

work page 2023

[2] [2]

View planning in robot active vision: A survey of systems, algorithms, and applications,

R. Zeng, Y . Wen, W. Zhao, and Y .-J. Liu, “View planning in robot active vision: A survey of systems, algorithms, and applications,” Comp. Visual Media, vol. 6, p. 225–245, 2020

work page 2020

[3] [3]

Active perception,

R. Bajcsy, “Active perception,”Proceedings of the IEEE, vol. 76, no. 8, pp. 966–1005, 1988

work page 1988

[4] [4]

Revisiting active perception,

R. Bajcsy, Y . Aloimonos, and J. K. Tsotsos, “Revisiting active perception,” Autonomous Robots, vol. 42, pp. 177–196, 2018

work page 2018

[5] [5]

Near-optimal sensor placements in gaussian processes: Theory, efficient algorithms and empirical studies,

A. Krause, A. Singh, and C. Guestrin, “Near-optimal sensor placements in gaussian processes: Theory, efficient algorithms and empirical studies,” J. Mach. Learn. Res. , vol. 9, p. 235–284, jun 2008

work page 2008

[6] [6]

Nonmyopic active learning of gaussian processes: an exploration-exploitation approach,

A. Krause and C. Guestrin, “Nonmyopic active learning of gaussian processes: an exploration-exploitation approach,” in Proceedings of the 24th International Conference on Machine Learning , ser. ICML ’07. New York, NY , USA: Association for Computing Machinery, 2007, p. 449–456. [Online]. Available: https://doi.org/10.1145/1273496.1273553

work page doi:10.1145/1273496.1273553 2007

[7] [7]

Efficient informative sensing using multiple robots,

A. Singh, A. Krause, C. Guestrin, and W. J. Kaiser, “Efficient informative sensing using multiple robots,” Journal of Artificial Intelligence Research , vol. 34, p. 707–755, Apr. 2009. [Online]. Available: http://dx.doi.org/10.1613/jair.2674

work page doi:10.1613/jair.2674 2009

[8] [8]

Branch and bound for informative path planning,

J. Binney and G. S. Sukhatme, “Branch and bound for informative path planning,” in 2012 IEEE International Conference on Robotics and Automation, 2012, pp. 2147–2154

work page 2012

[9] [9]

Robotic active information gathering for spatial field reconstruction with rapidly-exploring random trees and online learning of gaussian processes,

A. Viseras, D. Shutin, and L. Merino, “Robotic active information gathering for spatial field reconstruction with rapidly-exploring random trees and online learning of gaussian processes,” Sensors, vol. 19, no. 5,

work page

[10] [10]

Available: https://www.mdpi.com/1424-8220/19/5/1016

[Online]. Available: https://www.mdpi.com/1424-8220/19/5/1016

work page

[11] [11]

The determination of next best views,

C. Connolly, “The determination of next best views,” in Proceedings. 1985 IEEE International Conference on Robotics and Automation , vol. 2, 1985, pp. 432–435

work page 1985

[12] [12]

Receding horizon

A. Bircher, M. Kamel, K. Alexis, H. Oleynikova, and R. Siegwart, “Receding horizon ”next-best-view” planner for 3d exploration,” in2016 IEEE International Conference on Robotics and Automation (ICRA) , 2016, pp. 1462–1468

work page 2016

[13] [13]

Yoder and S

L. Yoder and S. Scherer, Autonomous Exploration for Infrastructure Modeling with a Micro Aerial Vehicle . Cham: Springer International Publishing, 2016, pp. 427–440. [Online]. Available: https://doi.org/10. 1007/978-3-319-27702-8 28

work page 2016

[14] [14]

Surface edge explorer (see): Planning next best views directly from 3d observations,

R. Border, J. D. Gammell, and P. Newman, “Surface edge explorer (see): Planning next best views directly from 3d observations,” 2018 IEEE International Conference on Robotics and Automation (ICRA) , pp. 1–8, 2018

work page 2018

[15] [15]

An efficient sampling-based method for online informative path planning in unknown environments,

L. Schmid, M. Pantic, R. Khanna, L. Ott, R. Siegwart, and J. Nieto, “An efficient sampling-based method for online informative path planning in unknown environments,” IEEE Robotics and Automation Letters , vol. 5, no. 2, pp. 1500–1507, 2020

work page 2020

[16] [16]

Fsmi: Fast computa- tion of shannon mutual information for information-theoretic mapping,

Z. Zhang, T. Henderson, V . Sze, and S. Karaman, “Fsmi: Fast computa- tion of shannon mutual information for information-theoretic mapping,” in 2019 International Conference on Robotics and Automation (ICRA) , 2019, pp. 6912–6918

work page 2019

[17] [17]

Information-theoretic mapping using cauchy-schwarz quadratic mutual information,

B. Charrow, S. Liu, V . Kumar, and N. Michael, “Information-theoretic mapping using cauchy-schwarz quadratic mutual information,” in 2015 IEEE International Conference on Robotics and Automation (ICRA) , 2015, pp. 4791–4798

work page 2015

[18] [18]

An efficient and continuous approach to information-theoretic exploration,

T. Henderson, V . Sze, and S. Karaman, “An efficient and continuous approach to information-theoretic exploration,” in 2020 IEEE Inter- national Conference on Robotics and Automation (ICRA) , 2020, pp. 8566–8572

work page 2020

[19] [19]

Learned map prediction for enhanced mobile robot exploration,

R. Shrestha, F.-P. Tian, W. Feng, P. Tan, and R. Vaughan, “Learned map prediction for enhanced mobile robot exploration,” in2019 International Conference on Robotics and Automation (ICRA) , 2019, pp. 1197–1204

work page 2019

[20] [20]

Learning to explore indoor environments using autonomous micro aerial vehicles,

Y . Tao, E. Iceland, B. Li, E. Zwecher, U. Heinemann, A. Cohen, A. Avni, O. Gal, A. Barel, and V . Kumar, “Learning to explore indoor environments using autonomous micro aerial vehicles,” in 2024 IEEE International Conference on Robotics and Automation (ICRA) , 2024, pp. 15 758–15 764

work page 2024

[21] [21]

Seer: Safe efficient exploration for aerial robots using learning to predict information gain,

Y . Tao, Y . Wu, B. Li, F. Cladera, A. Zhou, D. Thakur, and V . Kumar, “Seer: Safe efficient exploration for aerial robots using learning to predict information gain,” in 2023 IEEE International Conference on Robotics and Automation (ICRA) , 2023, pp. 1235–1241

work page 2023

[22] [22]

Learning to map for active semantic goal navigation,

G. Georgakis, B. Bucher, K. Schmeckpeper, S. Singh, and K. Daniilidis, “Learning to map for active semantic goal navigation,” arXiv preprint arXiv:2106.15648, 2021

work page arXiv 2021

[23] [23]

Active perception using neural radiance fields,

H. Siming, C. D. Hsu, D. Ong, Y . S. Shao, and P. Chaudhari, “Active perception using neural radiance fields,” in 2024 American Control Conference (ACC), 2024, pp. 4353–4358

work page 2024

[24] [24]

3d active metric-semantic slam,

Y . Tao, X. Liu, I. Spasojevic, S. Agarwal, and V . Kumar, “3d active metric-semantic slam,” IEEE Robotics and Automation Letters , vol. 9, no. 3, pp. 2989–2996, 2024

work page 2024

[25] [25]

Active scout: Multi-target tracking using neural radiance fields in dense urban environments,

C. D. Hsu and P. Chaudhari, “Active scout: Multi-target tracking using neural radiance fields in dense urban environments,” arXiv preprint arXiv:2406.07431, 2024

work page arXiv 2024

[26] [26]

Semantic octree mapping and shannon mutual information computation for robot exploration,

A. Asgharivaskasi and N. Atanasov, “Semantic octree mapping and shannon mutual information computation for robot exploration,” IEEE Transactions on Robotics , vol. 39, no. 3, pp. 1910–1928, 2023

work page 1910

[27] [27]

RT-GuIDE: Real- Time Gaussian splatting for Information-Driven Exploration

Y . Tao, D. Ong, V . Murali, I. Spasojevic, P. Chaudhari, and V . Ku- mar, “Rt-guide: Real-time gaussian splatting for information-driven exploration,” arXiv preprint arXiv:2409.18122 , 2024

work page arXiv 2024

[28] [28]

Learning to Act by Predicting the Future

A. Dosovitskiy and V . Koltun, “Learning to act by predicting the future,” arXiv preprint arXiv:1611.01779 , 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016

[29] [29]

Multi-agent exploration of an unknown sparse landmark complex via deep reinforcement learning,

X. Sun, Y . Wu, S. Bhattacharya, and V . Kumar, “Multi-agent exploration of an unknown sparse landmark complex via deep reinforcement learning,” arXiv preprint arXiv:2209.11794 , 2022

work page arXiv 2022

[30] [30]

Learning to explore using active neural slam,

D. S. Chaplot, D. Gandhi, S. Gupta, A. Gupta, and R. Salakhutdi- nov, “Learning to explore using active neural slam,” arXiv preprint arXiv:2004.05155, 2020

work page arXiv 2004

[31] [31]

Lecture notes: Bandits, experts and games (lecture 8),

A. Slivkins, “Lecture notes: Bandits, experts and games (lecture 8),” 2016. [Online]. Available: https://www.cs.umd.edu/ ∼slivkins/ CMSC858G-fall16/lecture8-both.pdf

work page 2016

[32] [32]

Nerfacc: Efficient sampling accelerates nerfs

R. Li, H. Gao, M. Tancik, and A. Kanazawa, “Nerfacc: Efficient sampling accelerates nerfs.” arXiv preprint arXiv:2305.04966 , 2023

work page arXiv 2023

[33] [33]

Habitat: A platform for embodied ai research,

M. Savva, A. Kadian, O. Maksymets, Y . Zhao, E. Wijmans, B. Jain, J. Straub, J. Liu, V . Koltun, J. Malik, et al., “Habitat: A platform for embodied ai research,” in Proceedings of the IEEE/CVF international conference on computer vision , 2019, pp. 9339–9347

work page 2019

[34] [34]

Habitat 2.0: Training home assistants to rearrange their habitat,

A. Szot, A. Clegg, E. Undersander, E. Wijmans, Y . Zhao, J. Turner, N. Maestre, M. Mukadam, D. S. Chaplot, O. Maksymets, et al., “Habitat 2.0: Training home assistants to rearrange their habitat,” Advances in neural information processing systems , vol. 34, pp. 251–266, 2021

work page 2021

[35] [35]

Habitat 3.0: A co-habitat for humans, avatars and robots.arXiv preprint arXiv:2310.13724, 2023

X. Puig, E. Undersander, A. Szot, M. D. Cote, T.-Y . Yang, R. Partsey, R. Desai, A. W. Clegg, M. Hlavac, S. Y . Min, et al. , “Habitat 3.0: A co-habitat for humans, avatars and robots,” arXiv preprint arXiv:2310.13724, 2023

work page arXiv 2023

[36] [36]

RotorPy: A python-based multirotor simulator with aerodynamics for education and research,

S. Folk, J. Paulos, and V . Kumar, “Rotorpy: A python-based multirotor simulator with aerodynamics for education and research,”arXiv preprint arXiv:2306.04485, 2023

work page arXiv 2023

[37] [37]

A frontier-based approach for autonomous exploration,

B. Yamauchi, “A frontier-based approach for autonomous exploration,” in Proceedings 1997 IEEE International Symposium on Computational Intelligence in Robotics and Automation CIRA’97. ’Towards New Computational Principles for Robotics and Automation’ . IEEE, 1997, pp. 146–151

work page 1997

[38] [38]

M3ed: Multi-robot, multi-sensor, multi-environment event dataset,

K. Chaney, F. Cladera, Z. Wang, A. Bisulco, M. A. Hsieh, C. Korpela, V . Kumar, C. J. Taylor, and K. Daniilidis, “M3ed: Multi-robot, multi-sensor, multi-environment event dataset,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, June 2023, pp. 4016–4023

work page 2023

[39] [39]

Stronger together: Air-ground robotic collaboration using semantics,

I. D. Miller, F. Cladera, T. Smith, C. J. Taylor, and V . Kumar, “Stronger together: Air-ground robotic collaboration using semantics,” IEEE Robotics and Automation Letters , vol. 7, no. 4, 2022

work page 2022

[40] [40]

Faster-lio: Lightweight tightly coupled lidar-inertial odometry using parallel sparse incremental voxels,

C. Bai, T. Xiao, Y . Chen, H. Wang, F. Zhang, and X. Gao, “Faster-lio: Lightweight tightly coupled lidar-inertial odometry using parallel sparse incremental voxels,” IEEE Robotics and Automation Letters , vol. 7, no. 2, pp. 4861–4868, 2022

work page 2022

[41] [41]

Marder-Eppstein

E. Marder-Eppstein. (2024) ROS move base package

work page 2024

[42] [42]

Learning in games (and games in learning),

A. Roth, “Learning in games (and games in learning),” 2023. [Online]. Available: https://www.cis.upenn.edu/∼aaroth/GamesInLearning.pdf

work page 2023