Simulation Based Reward Function Validation for Multi-Agent On Orbit Inspection

Bala Prenith Reddy Gopu; George M. Nehma; Madhur Tiwari; Patrick Quinn

arxiv: 2607.01367 · v1 · pith:VTQEW2UVnew · submitted 2026-07-01 · 💻 cs.MA · cs.RO

Simulation Based Reward Function Validation for Multi-Agent On Orbit Inspection

Patrick Quinn , Bala Prenith Reddy Gopu , George M. Nehma , Madhur Tiwari This is my paper

Pith reviewed 2026-07-03 01:29 UTC · model grok-4.3

classification 💻 cs.MA cs.RO

keywords multi-agent reinforcement learningon-orbit inspectionreward function3D reconstructionspacecraft controlMARLsimulation validation

0 comments

The pith

Generalized reward functions from 3D reconstruction analysis let MARL agents decide when to collect inspection images in orbit.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper develops a generalized reward function for multi-agent reinforcement learning in spacecraft inspection tasks. The function draws from analysis of 3D reconstructions built from simulated images rather than relying on a fixed set of predetermined inspection points. This change allows the reward to evaluate any number of images taken at arbitrary locations and gives the trained agents full authority over the timing of image collection. The resulting insights apply to best practices within the MARL setting and extend to inspection missions more broadly.

Core claim

The paper establishes that a reward function informed by the analysis of 3D reconstructions of inspected objects in orbit enables multi-agent reinforcement learning agents to achieve complete control over when images are collected, because the function can assess inspection quality for arbitrary numbers of images at arbitrary locations instead of being limited to predetermined points.

What carries the argument

The generalized reward function informed by 3D reconstruction analysis of simulated images, which supplies an evaluation signal for any collection of images rather than a fixed set of points.

If this is right

Agents gain complete control over when images are collected.
Any number of images at arbitrary locations can be evaluated by the reward.
The approach yields insights into best practices for the MARL inspection task.
Key takeaways apply to the broader inspection task outside a MARL context.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The method could reduce total images required for adequate coverage by letting agents select collection moments based on reconstruction quality.
It may extend to other multi-agent robotic systems where agents must autonomously decide on visual data gathering.
Simulation results would need direct comparison against real orbital image data to confirm transfer of the reward signal.

Load-bearing premise

Analysis of 3D reconstructions from simulated images supplies a reliable signal for designing reward functions that improve inspection performance.

What would settle it

If agents trained with this reward function produce incomplete or lower-quality 3D reconstructions of target objects during actual orbital tests compared with agents limited to fixed inspection points, the central claim would be falsified.

Figures

Figures reproduced from arXiv: 2607.01367 by Bala Prenith Reddy Gopu, George M. Nehma, Madhur Tiwari, Patrick Quinn.

**Figure 5.** Figure 5: Using the same dataset, Instant-NGP produced a fuzzier but more complete reconstruction of the target. This [PITH_FULL_IMAGE:figures/full_fig_p010_5.png] view at source ↗

read the original abstract

A proposed method for the control of groups of inspection spacecraft is Multi-Agent Reinforcement Learning (MARL). While MARL has already been employed for this purpose in previous work, the reward functions used focus on reaching a finite set of predetermined inspection points around the target. In this work, we study and develop a generalized reward function for the MARL inspection task informed by the analysis of 3D reconstructions of inspected objects in orbit. Because the reward function is generalized such that any number of images at arbitrary locations may evaluated, we also allow trained agents to have complete control over when images are collected. With this approach, we gather insights into best practices for not only the specific MARL inspection task, but also gain key takeaways informative to the broader inspection task outside of a MARL context.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This paper extends MARL reward design for orbital inspection by tying it to 3D reconstruction quality instead of fixed points, but the abstract shows no actual simulation outcomes.

read the letter

The main new piece is a reward function that scores any set of images based on how well they support 3D reconstruction of the target, rather than requiring agents to reach a preset list of locations. This change lets the agents decide on their own when to take pictures, which removes a constraint from earlier work on the same task.

The approach is presented clearly as an extension that stays grounded in external reconstruction analysis instead of fitting to its own outputs. That keeps the logic straightforward and avoids obvious circularity.

The soft spot is the complete absence of results. The text describes the method and states that insights were gathered from simulations, but it gives no numbers, no comparisons, and no description of what the agents actually did or how the reconstructions performed. Without that, the claim that the method yields useful best practices stays untested.

This is for researchers already working on multi-agent RL for spacecraft or similar inspection problems. Someone looking for reward-shaping ideas in that niche could pull a useful concept from it, but only if the full paper supplies the missing simulation details and setup.

I would send it for peer review. The core idea is a reasonable response to a documented limitation in the literature, and the authors treat it as exploratory rather than definitive. A referee could check the reward implementation and the simulation design once those are shown.

Referee Report

1 major / 0 minor

Summary. The manuscript proposes a generalized reward function for multi-agent reinforcement learning (MARL) applied to on-orbit inspection by groups of spacecraft. Unlike prior work limited to finite predetermined inspection points, the reward is informed by analysis of 3D reconstructions from simulated images of inspected objects; this generalization permits evaluation of arbitrary image sets at arbitrary locations and therefore allows trained agents full control over image collection timing. The authors present the approach as a means to gather insights into best practices for the specific MARL task and for orbital inspection more broadly.

Significance. If the simulation validation demonstrates that the reconstruction-derived reward produces effective policies, the work would supply a more flexible reward-design methodology for MARL in space applications and could yield transferable heuristics for inspection planning outside the MARL setting.

major comments (1)

The central claim that the generalized reward function improves inspection performance rests on an unshown simulation study; no quantitative results, performance metrics, baseline comparisons, error bars, or validation details are supplied, so the data-to-claim link cannot be assessed.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their detailed review and for identifying this important gap in the presentation of our work. We address the major comment below.

read point-by-point responses

Referee: The central claim that the generalized reward function improves inspection performance rests on an unshown simulation study; no quantitative results, performance metrics, baseline comparisons, error bars, or validation details are supplied, so the data-to-claim link cannot be assessed.

Authors: We agree that the submitted manuscript does not include the quantitative simulation results needed to substantiate the performance claims. The current version emphasizes the reward-function design and the 3D-reconstruction validation methodology, but omits the actual simulation outcomes, metrics, baselines, and statistical details. In the revised manuscript we will add a dedicated results section that reports the simulation study, including quantitative performance metrics, comparisons against the finite predetermined-point baseline, error bars from repeated trials, and full validation details. revision: yes

Circularity Check

0 steps flagged

No significant circularity; reward function informed by external reconstruction analysis

full rationale

The abstract presents the generalized reward function as informed by analysis of 3D reconstructions from simulated images, which is treated as an external signal rather than a self-referential fit or definition. No equations, fitted parameters renamed as predictions, or self-citation chains are described that would reduce the central claim to its inputs by construction. The extension allowing agent control over image timing is a logical consequence of the generalization, not a circular loop. Prior MARL work is referenced only as context, not as a load-bearing uniqueness result. The derivation chain remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

No free parameters, axioms, or invented entities are identifiable from the abstract alone; the work appears to rest on standard MARL and simulation assumptions not detailed here.

pith-pipeline@v0.9.1-grok · 5668 in / 1093 out tokens · 33826 ms · 2026-07-03T01:29:22.475045+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

26 extracted references · 3 canonical work pages · 2 internal anchors

[1]

Tipping points of space debris in low Earth orbit,

Nomura, K., Rella, S., Merritt, H., Baltussen, M., Bird, D., Tjuka, A., and Falk, D., “Tipping points of space debris in low Earth orbit,”Int. J. Commons, Vol. 18, No. 1, 2024

2024
[2]

ADVANCING ACTIVE DEBRIS REMOVAL: ACHIEVEMENTS AND PROSPECTS IN THE COMMERCIAL REMOVAL OF DEBRIS DEMONSTRATION,

Yamamoto, T., Okamoto, H., Tomitaka, M., Nakamura, R., and Takeuchi, Y., “ADVANCING ACTIVE DEBRIS REMOVAL: ACHIEVEMENTS AND PROSPECTS IN THE COMMERCIAL REMOVAL OF DEBRIS DEMONSTRATION,” ESA Space Debris Office, 2025

2025
[3]

Space-Based 3D Reconstruction: Advancing Object Characterization in Orbit,

Navidzadeh, T., “Space-Based 3D Reconstruction: Advancing Object Characterization in Orbit,”https://www.scout. space/news/3d-reconstruction, Dec. 2024. Accessed: 2025-8-8

2024
[4]

3D reconstruction of non-cooperative resident space objects using instant NGP-accelerated NeRF and D-NeRF,

Caruso, B., Mahendrakar, T., Nguyen, V. M., White, R. T., and Steffen, T., “3D reconstruction of non-cooperative resident space objects using instant NGP-accelerated NeRF and D-NeRF,”arXiv [cs.CV], 2023

2023
[5]

Operational reality of collision avoidance manoeuvres,

Symonds, K. G., Flohrer, T., Mardle, N., Fornarelli, D., Marc, X., and Ormston, T., “Operational reality of collision avoidance manoeuvres,”SpaceOps 2014 Conference, American Institute of Aeronautics and Astronautics, Reston, Virginia, 2014

2014
[6]

Coordinated motion planning for on-orbit satellite inspection using a swarm of small-spacecraft,

Bernhard, B., Choi, C., Rahmani, A., Chung, S.-J., and Hadaegh, F., “Coordinated motion planning for on-orbit satellite inspection using a swarm of small-spacecraft,”2020 IEEE Aerospace Conference, IEEE, 2020

2020
[7]

Deep reinforcement learning for multi-agent autonomous satellite inspection,

Lei, H. H., Shubert, M., Damron, N., Lang, K., and Phillips, S., “Deep reinforcement learning for multi-agent autonomous satellite inspection,”Proceedings of the 44th Annual American Astronautical Society Guidance, Navigation, and Control Conference, 2022, edited by M. Sandnas and D. B. Spencer, Springer International Publishing, Cham, 2024, pp. 1391–1412

2022
[8]

Deep Reinforcement Learning for scalable multiagent spacecraft inspection,

Dunlap, K., Hamilton, N., and Hobbs, K. L., “Deep Reinforcement Learning for scalable multiagent spacecraft inspection,” arXiv [eess.SY], 2024

2024
[9]

Autonomous small body science operations using reinforcement learning,

Herrmann, A., and Schaub, H., “Autonomous small body science operations using reinforcement learning,”J. Aerosp. Comput. Inf. Commun., 2024, pp. 1–20

2024
[10]

Neuralangelo: High-Fidelity Neural Surface Reconstruction,

Li, Z., Müller, T., Evans, A., Taylor, R. H., Unberath, M., Liu, M.-Y., and Lin, C.-H., “Neuralangelo: High-Fidelity Neural Surface Reconstruction,”arXiv [cs.CV], 2023

2023
[11]

Terminal guidance system for satellite rendezvous,

Clohessy, W. H., and Wiltshire, R. S., “Terminal guidance system for satellite rendezvous,”J. Aerosp. Sci., Vol. 27, No. 9, 1960, pp. 653–658

1960
[12]

ProximalPolicyOptimizationAlgorithms,

Schulman,J.,Wolski,F.,Dhariwal,P.,Radford,A.,andKlimov,O.,“ProximalPolicyOptimizationAlgorithms,”arXiv [cs.LG], 2017

2017
[13]

An introduction to centralized training for decentralized execution in cooperative multi-agent reinforcement learning,

Amato, C., “An introduction to centralized training for decentralized execution in cooperative multi-agent reinforcement learning,”arXiv [cs.LG], 2024

2024
[14]

Pixelwise View Selection for Unstructured Multi-View Stereo,

Schönberger, J. L., Zheng, E., Pollefeys, M., and Frahm, J.-M., “Pixelwise View Selection for Unstructured Multi-View Stereo,” European Conference on Computer Vision (ECCV), 2016

2016
[15]

Structure-from-Motion Revisited,

Schönberger, J. L., and Frahm, J.-M., “Structure-from-Motion Revisited,”Conference on Computer Vision and Pattern Recognition (CVPR), 2016. 12

2016
[16]

Instant Neural Graphics Primitives with a Multiresolution Hash Encoding,

Müller, T., Evans, A., Schied, C., and Keller, A., “Instant Neural Graphics Primitives with a Multiresolution Hash Encoding,” ACM Trans. Graph., Vol. 41, No. 4, 2022, pp. 102:1–102:15. https://doi.org/10.1145/3528223.3530127, URL https: //doi.org/10.1145/3528223.3530127

work page doi:10.1145/3528223.3530127 2022
[17]

Methodologies for assessing the quality of 3D models obtained using close-range photogrammetry,

di Filippo, A., Antinozzi, S., Cappetti, N., and Villecco, F., “Methodologies for assessing the quality of 3D models obtained using close-range photogrammetry,”Int. J. Interact. Des. Manuf. (IJIDeM), Vol. 18, No. 8, 2024, pp. 5917–5924

2024
[18]

Continuous-discrete reinforcement learning for hybrid control in robotics,

Neunert, M., Abdolmaleki, A., Wulfmeier, M., Lampe, T., Springenberg, J. T., Hafner, R., Romano, F., Buchli, J., Heess, N., and Riedmiller, M., “Continuous-discrete reinforcement learning for hybrid control in robotics,”arXiv [cs.LG], 2020

2020
[19]

HyAR: Addressing discrete-continuous action Reinforcement Learning via Hybrid Action Representation,

Li, B., Tang, H., Zheng, Y., Hao, J., Li, P., Wang, Z., Meng, Z., and Wang, L., “HyAR: Addressing discrete-continuous action Reinforcement Learning via Hybrid Action Representation,”arXiv [cs.LG], 2021

2021
[20]

Curriculum learning,

Bengio, Y., Louradour, J., Collobert, R., and Weston, J., “Curriculum learning,”Proceedings of the 26th Annual International Conference on Machine Learning, ACM, New York, NY, USA, 2009

2009
[21]

Gymnasium: A standard interface for reinforcement learning environments,

Towers, M., Kwiatkowski, A., Terry, J., Balis, J. U., De Cola, G., Deleu, T., Goulão, M., Kallinteris, A., Krimmel, M., Kg, A., Perez-Vicente, R., Pierré, A., Schulhoff, S., Tai, J. J., Tan, H., and Younis, O. G., “Gymnasium: A standard interface for reinforcement learning environments,”arXiv [cs.LG], 2025

2025
[22]

RLlib: Abstractions for Distributed Reinforcement Learning

Liang, E., Liaw, R., Nishihara, R., Moritz, P., Fox, R., Goldberg, K., Gonzalez, J. E., Jordan, M. I., and Stoica, I., “RLlib: Abstractions for Distributed Reinforcement Learning,”International Conference on Machine Learning (ICML), 2018. URL https://arxiv.org/pdf/1712.09381

work page internal anchor Pith review Pith/arXiv arXiv 2018
[23]

RLlib Flow: Distributed Reinforcement Learning is a Dataflow Problem,

Wu, Z., Liang, E., Luo, M., Mika, S., Gonzalez, J. E., and Stoica, I., “RLlib Flow: Distributed Reinforcement Learning is a Dataflow Problem,”Conference on Neural Information Processing Systems (NeurIPS), 2021. URL https://proceedings.neurips. cc/paper/2021/file/2bce32ed409f5ebcee2a7b417ad9beed-Paper.pdf

2021
[24]

Isaac Sim,

NVIDIA, “Isaac Sim,” , October 2025. URL https://github.com/isaac-sim/IsaacSim

2025
[25]

Ice, Clouds, and Land Elevation Satellite-2 (ICESat-2) (A),

Sanders, M. G., “Ice, Clouds, and Land Elevation Satellite-2 (ICESat-2) (A),” https://science.nasa.gov/3d-resources/ice-clouds- and-land-elevation-satellite-2-icesat-2-a/, Apr. 2025. Accessed: 2025-12-15

2025
[26]

Segment Anything

Kirillov, A., Mintun, E., Ravi, N., Mao, H., Rolland, C., Gustafson, L., Xiao, T., Whitehead, S., Berg, A. C., Lo, W.-Y., Dollár, P., and Girshick, R., “Segment Anything,”arXiv:2304.02643, 2023. 13

work page internal anchor Pith review Pith/arXiv arXiv 2023

[1] [1]

Tipping points of space debris in low Earth orbit,

Nomura, K., Rella, S., Merritt, H., Baltussen, M., Bird, D., Tjuka, A., and Falk, D., “Tipping points of space debris in low Earth orbit,”Int. J. Commons, Vol. 18, No. 1, 2024

2024

[2] [2]

ADVANCING ACTIVE DEBRIS REMOVAL: ACHIEVEMENTS AND PROSPECTS IN THE COMMERCIAL REMOVAL OF DEBRIS DEMONSTRATION,

Yamamoto, T., Okamoto, H., Tomitaka, M., Nakamura, R., and Takeuchi, Y., “ADVANCING ACTIVE DEBRIS REMOVAL: ACHIEVEMENTS AND PROSPECTS IN THE COMMERCIAL REMOVAL OF DEBRIS DEMONSTRATION,” ESA Space Debris Office, 2025

2025

[3] [3]

Space-Based 3D Reconstruction: Advancing Object Characterization in Orbit,

Navidzadeh, T., “Space-Based 3D Reconstruction: Advancing Object Characterization in Orbit,”https://www.scout. space/news/3d-reconstruction, Dec. 2024. Accessed: 2025-8-8

2024

[4] [4]

3D reconstruction of non-cooperative resident space objects using instant NGP-accelerated NeRF and D-NeRF,

Caruso, B., Mahendrakar, T., Nguyen, V. M., White, R. T., and Steffen, T., “3D reconstruction of non-cooperative resident space objects using instant NGP-accelerated NeRF and D-NeRF,”arXiv [cs.CV], 2023

2023

[5] [5]

Operational reality of collision avoidance manoeuvres,

Symonds, K. G., Flohrer, T., Mardle, N., Fornarelli, D., Marc, X., and Ormston, T., “Operational reality of collision avoidance manoeuvres,”SpaceOps 2014 Conference, American Institute of Aeronautics and Astronautics, Reston, Virginia, 2014

2014

[6] [6]

Coordinated motion planning for on-orbit satellite inspection using a swarm of small-spacecraft,

Bernhard, B., Choi, C., Rahmani, A., Chung, S.-J., and Hadaegh, F., “Coordinated motion planning for on-orbit satellite inspection using a swarm of small-spacecraft,”2020 IEEE Aerospace Conference, IEEE, 2020

2020

[7] [7]

Deep reinforcement learning for multi-agent autonomous satellite inspection,

Lei, H. H., Shubert, M., Damron, N., Lang, K., and Phillips, S., “Deep reinforcement learning for multi-agent autonomous satellite inspection,”Proceedings of the 44th Annual American Astronautical Society Guidance, Navigation, and Control Conference, 2022, edited by M. Sandnas and D. B. Spencer, Springer International Publishing, Cham, 2024, pp. 1391–1412

2022

[8] [8]

Deep Reinforcement Learning for scalable multiagent spacecraft inspection,

Dunlap, K., Hamilton, N., and Hobbs, K. L., “Deep Reinforcement Learning for scalable multiagent spacecraft inspection,” arXiv [eess.SY], 2024

2024

[9] [9]

Autonomous small body science operations using reinforcement learning,

Herrmann, A., and Schaub, H., “Autonomous small body science operations using reinforcement learning,”J. Aerosp. Comput. Inf. Commun., 2024, pp. 1–20

2024

[10] [10]

Neuralangelo: High-Fidelity Neural Surface Reconstruction,

Li, Z., Müller, T., Evans, A., Taylor, R. H., Unberath, M., Liu, M.-Y., and Lin, C.-H., “Neuralangelo: High-Fidelity Neural Surface Reconstruction,”arXiv [cs.CV], 2023

2023

[11] [11]

Terminal guidance system for satellite rendezvous,

Clohessy, W. H., and Wiltshire, R. S., “Terminal guidance system for satellite rendezvous,”J. Aerosp. Sci., Vol. 27, No. 9, 1960, pp. 653–658

1960

[12] [12]

ProximalPolicyOptimizationAlgorithms,

Schulman,J.,Wolski,F.,Dhariwal,P.,Radford,A.,andKlimov,O.,“ProximalPolicyOptimizationAlgorithms,”arXiv [cs.LG], 2017

2017

[13] [13]

An introduction to centralized training for decentralized execution in cooperative multi-agent reinforcement learning,

Amato, C., “An introduction to centralized training for decentralized execution in cooperative multi-agent reinforcement learning,”arXiv [cs.LG], 2024

2024

[14] [14]

Pixelwise View Selection for Unstructured Multi-View Stereo,

Schönberger, J. L., Zheng, E., Pollefeys, M., and Frahm, J.-M., “Pixelwise View Selection for Unstructured Multi-View Stereo,” European Conference on Computer Vision (ECCV), 2016

2016

[15] [15]

Structure-from-Motion Revisited,

Schönberger, J. L., and Frahm, J.-M., “Structure-from-Motion Revisited,”Conference on Computer Vision and Pattern Recognition (CVPR), 2016. 12

2016

[16] [16]

Instant Neural Graphics Primitives with a Multiresolution Hash Encoding,

Müller, T., Evans, A., Schied, C., and Keller, A., “Instant Neural Graphics Primitives with a Multiresolution Hash Encoding,” ACM Trans. Graph., Vol. 41, No. 4, 2022, pp. 102:1–102:15. https://doi.org/10.1145/3528223.3530127, URL https: //doi.org/10.1145/3528223.3530127

work page doi:10.1145/3528223.3530127 2022

[17] [17]

Methodologies for assessing the quality of 3D models obtained using close-range photogrammetry,

di Filippo, A., Antinozzi, S., Cappetti, N., and Villecco, F., “Methodologies for assessing the quality of 3D models obtained using close-range photogrammetry,”Int. J. Interact. Des. Manuf. (IJIDeM), Vol. 18, No. 8, 2024, pp. 5917–5924

2024

[18] [18]

Continuous-discrete reinforcement learning for hybrid control in robotics,

Neunert, M., Abdolmaleki, A., Wulfmeier, M., Lampe, T., Springenberg, J. T., Hafner, R., Romano, F., Buchli, J., Heess, N., and Riedmiller, M., “Continuous-discrete reinforcement learning for hybrid control in robotics,”arXiv [cs.LG], 2020

2020

[19] [19]

HyAR: Addressing discrete-continuous action Reinforcement Learning via Hybrid Action Representation,

Li, B., Tang, H., Zheng, Y., Hao, J., Li, P., Wang, Z., Meng, Z., and Wang, L., “HyAR: Addressing discrete-continuous action Reinforcement Learning via Hybrid Action Representation,”arXiv [cs.LG], 2021

2021

[20] [20]

Curriculum learning,

Bengio, Y., Louradour, J., Collobert, R., and Weston, J., “Curriculum learning,”Proceedings of the 26th Annual International Conference on Machine Learning, ACM, New York, NY, USA, 2009

2009

[21] [21]

Gymnasium: A standard interface for reinforcement learning environments,

Towers, M., Kwiatkowski, A., Terry, J., Balis, J. U., De Cola, G., Deleu, T., Goulão, M., Kallinteris, A., Krimmel, M., Kg, A., Perez-Vicente, R., Pierré, A., Schulhoff, S., Tai, J. J., Tan, H., and Younis, O. G., “Gymnasium: A standard interface for reinforcement learning environments,”arXiv [cs.LG], 2025

2025

[22] [22]

RLlib: Abstractions for Distributed Reinforcement Learning

Liang, E., Liaw, R., Nishihara, R., Moritz, P., Fox, R., Goldberg, K., Gonzalez, J. E., Jordan, M. I., and Stoica, I., “RLlib: Abstractions for Distributed Reinforcement Learning,”International Conference on Machine Learning (ICML), 2018. URL https://arxiv.org/pdf/1712.09381

work page internal anchor Pith review Pith/arXiv arXiv 2018

[23] [23]

RLlib Flow: Distributed Reinforcement Learning is a Dataflow Problem,

Wu, Z., Liang, E., Luo, M., Mika, S., Gonzalez, J. E., and Stoica, I., “RLlib Flow: Distributed Reinforcement Learning is a Dataflow Problem,”Conference on Neural Information Processing Systems (NeurIPS), 2021. URL https://proceedings.neurips. cc/paper/2021/file/2bce32ed409f5ebcee2a7b417ad9beed-Paper.pdf

2021

[24] [24]

Isaac Sim,

NVIDIA, “Isaac Sim,” , October 2025. URL https://github.com/isaac-sim/IsaacSim

2025

[25] [25]

Ice, Clouds, and Land Elevation Satellite-2 (ICESat-2) (A),

Sanders, M. G., “Ice, Clouds, and Land Elevation Satellite-2 (ICESat-2) (A),” https://science.nasa.gov/3d-resources/ice-clouds- and-land-elevation-satellite-2-icesat-2-a/, Apr. 2025. Accessed: 2025-12-15

2025

[26] [26]

Segment Anything

Kirillov, A., Mintun, E., Ravi, N., Mao, H., Rolland, C., Gustafson, L., Xiao, T., Whitehead, S., Berg, A. C., Lo, W.-Y., Dollár, P., and Girshick, R., “Segment Anything,”arXiv:2304.02643, 2023. 13

work page internal anchor Pith review Pith/arXiv arXiv 2023