QuadPiPS: A Perception-informed Footstep Planner for Quadrupeds With Semantic Affordance Prediction

Max Asselmeier; Patricio A. Vela; Ye Zhao

arxiv: 2501.00112 · v3 · submitted 2024-12-30 · 💻 cs.RO

QuadPiPS: A Perception-informed Footstep Planner for Quadrupeds With Semantic Affordance Prediction

Max Asselmeier , Ye Zhao , Patricio A. Vela This is my paper

Pith reviewed 2026-05-23 05:54 UTC · model grok-4.3

classification 💻 cs.RO

keywords quadruped locomotionfootstep planningsemantic perceptionegocentric mappingmotion planningterrain awarenesslegged robotstrajectory optimization

0 comments

The pith

QuadPiPS plans quadruped footsteps by encoding geometry and semantics in an ego-centric map to select safe footholds in constrained terrain.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces QuadPiPS as a framework that performs foothold planning directly in perception space for quadrupedal robots. It builds a legged egocan representation that jointly encodes geometric structure and semantic affordances to label surfaces as suitable or unsafe for foot placement. The method over-segments this map with superpixels to generate candidate regions, then applies search and nonlinear trajectory optimization to produce kinodynamically feasible swing trajectories and whole-body references. These are tracked via model predictive control and whole-body control. Benchmark results on the ANYmal C in simulation and the Unitree Go2 on hardware show improved performance over baselines when safe footholds are scarce.

Core claim

QuadPiPS extends the ALEF planning approach by synthesizing perception-informed, real-time, kinodynamically-feasible reference trajectories in the legged egocan through discrete search over superpixel regions combined with continuous trajectory optimization, enabling terrain-aware locomotion that excels in safety-critical settings with limited available footholds.

What carries the argument

The legged egocan, an ego-centric local environment representation that jointly encodes geometry and semantic affordances to support foothold planning and control.

If this is right

The planner generates long-horizon whole-body reference motions that are tracked under model predictive control.
It supports exhaustive searching over discrete foothold candidates while maintaining continuous feasibility through optimization.
Real-world deployment is enabled by real-time synthesis of perception-informed trajectories on hardware.
Performance gains appear specifically in safety-critical scenes where the number of viable footholds is restricted.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same egocan-plus-superpixel structure could be tested on bipeds or hexapods to check whether the discrete-continuous partition generalizes beyond quadrupeds.
Replacing the current semantic network with an online-updating version might allow the planner to adapt when terrain properties change during a mission.
Coupling the generated reference trajectories with learned whole-body controllers could reduce the reliance on explicit model predictive control for tracking.

Load-bearing premise

The semantic affordance prediction network accurately labels surfaces as safe or unsafe and the superpixel segmentation reliably produces planar regions suitable as candidate footholds.

What would settle it

A controlled test environment in which the semantic network mislabels a safe planar surface as unsafe or an unsafe surface as safe, resulting in either missed feasible plans or selection of unstable footholds that cause falls during execution.

Figures

Figures reproduced from arXiv: 2501.00112 by Max Asselmeier, Patricio A. Vela, Ye Zhao.

**Figure 2.** Figure 2: Results of training. (a) Total training loss over the training iterations. [PITH_FULL_IMAGE:figures/full_fig_p007_2.png] view at source ↗

**Figure 3.** Figure 3: Example outputs of learned steppability model. Top row: input depth [PITH_FULL_IMAGE:figures/full_fig_p008_3.png] view at source ↗

**Figure 4.** Figure 4: Offline planning results for Section 4.1. On the left side, results of all [PITH_FULL_IMAGE:figures/full_fig_p010_4.png] view at source ↗

**Figure 5.** Figure 5: Online tracking performance of offline reference trajectory in simulation. [PITH_FULL_IMAGE:figures/full_fig_p011_5.png] view at source ↗

**Figure 6.** Figure 6: Contact planning performed over regular terrain (stairs) within an online [PITH_FULL_IMAGE:figures/full_fig_p013_6.png] view at source ↗

**Figure 7.** Figure 7: Contact planning performed in the presence of an online disturbance in [PITH_FULL_IMAGE:figures/full_fig_p014_7.png] view at source ↗

read the original abstract

This work proposes QuadPiPS, a perception-informed framework for quadrupedal foothold planning in the perception space. QuadPiPS employs a novel ego-centric local environment representation, known as the legged egocan, that is extended here to capture unique legged affordances through a joint geometric and semantic encoding that supports local motion planning and control for quadrupeds. QuadPiPS takes inspiration from the Augmented Leafs with Experience on Foliations (ALEF) planning framework to partition the foothold planning space into its discrete and continuous subspaces. To facilitate real-world deployment, QuadPiPS broadens the ALEF approach by synthesizing perception-informed, real-time, and kinodynamically-feasible reference trajectories through search and trajectory optimization techniques. To support deliberate and exhaustive searching, QuadPiPS over-segments the egocan floor via superpixels to provide a set of planar regions suitable for candidate footholds. Nonlinear trajectory optimization methods then compute swing trajectories to transition between selected footholds and provide long-horizon whole-body reference motions that are tracked under model predictive control and whole body control. Benchmarking with the ANYmal C quadruped across ten simulation environments and five baselines reveals that QuadPiPS excels in safety-critical settings with limited available footholds. Real-world validation on the Unitree Go2 quadruped equipped with a custom computational suite demonstrates that QuadPiPS enables terrain-aware locomotion on hardware.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

QuadPiPS adds a semantic layer to the legged egocan and ALEF partitioning for foothold planning, but the abstract supplies no metrics to support the safety-critical benchmarking claims.

read the letter

The one thing to know is that this paper introduces a legged egocan representation that jointly encodes geometry and semantic affordances, then feeds superpixel-segmented planar regions into an ALEF-inspired discrete search plus trajectory optimization loop for quadruped footsteps. That combination is the concrete new piece beyond the cited prior work. It also closes the loop to whole-body control and MPC for hardware deployment on the Go2. The pipeline description is clear and the motivation for perception-informed planning on constrained terrain is straightforward. The approach looks workable on paper for fusing semantics into kinodynamic planning without obvious circularity in the claims. The soft spot is exactly where the stress-test note flags it: the central result that QuadPiPS excels over five baselines in safety-critical settings rests on unshown perception performance. The abstract states benchmarking across ten simulation environments and real-world validation but gives no precision numbers, success rates, planarity errors, or failure cases for the affordance network or superpixels. That leaves the empirical superiority dependent on assumptions that are not quantified here. Minor issues include the lack of implementation details on training the semantic predictor or handling edge cases in over-segmentation. This is for readers working on perception-aware legged locomotion who want a worked example of semantic affordance feeding into real-time search and optimization. It is coherent enough on its own terms to deserve a serious referee, even if the experiments will require heavy scrutiny on the data. I would send it to peer review.

Referee Report

3 major / 2 minor

Summary. The paper proposes QuadPiPS, a perception-informed footstep planner for quadrupeds that uses a novel legged egocan representation combining geometric and semantic encoding. It extends the ALEF framework by over-segmenting the egocan floor with superpixels to generate planar candidate footholds, then applies discrete search followed by nonlinear trajectory optimization to produce kinodynamically feasible swing trajectories tracked under MPC and whole-body control. The central empirical claims are that benchmarking on ANYmal C across ten simulation environments against five baselines shows superiority in safety-critical settings with limited footholds, and real-world tests on Unitree Go2 demonstrate terrain-aware locomotion.

Significance. If the perception components prove reliable, the integration of semantic affordance prediction directly into an ALEF-style discrete-continuous planner could improve safety for legged robots in foothold-limited terrains. The explicit use of superpixels for exhaustive search and the hardware deployment on a custom computational suite are concrete strengths that could support reproducible follow-on work.

major comments (3)

[Abstract] Abstract (benchmarking paragraph): the claim that QuadPiPS 'excels in safety-critical settings with limited available footholds' is the headline result, yet the abstract supplies no success rates, collision counts, traversal times, or statistical comparisons against the five baselines across the ten environments; without these numbers the superiority assertion cannot be evaluated.
[Abstract] Abstract (egocan and superpixels paragraph): the semantic affordance prediction network and superpixel over-segmentation are the direct inputs to the ALEF-inspired discrete search; the manuscript provides no precision/recall, planarity error, or failure-case statistics for these modules on held-out data, rendering the safety-critical benchmarking claim dependent on unverified perceptual preconditions.
[Abstract] Real-world validation paragraph: the statement that QuadPiPS 'enables terrain-aware locomotion on hardware' is presented without quantitative metrics (e.g., foothold success rate, recovery time after mislabeling, or comparison to baselines) or exclusion criteria, so the hardware claim cannot be assessed for robustness.

minor comments (2)

[Abstract] The acronym 'ALEF' is expanded on first use but the original reference is not cited in the abstract; adding the citation would improve traceability.
[Abstract] The term 'legged egocan' is introduced as novel; a one-sentence parenthetical definition or pointer to the methods section would aid readers unfamiliar with the representation.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for highlighting the need for quantitative support in the abstract. We agree that the abstract should better substantiate its claims with key metrics from the manuscript and will revise it to include representative results from the benchmarking and hardware sections.

read point-by-point responses

Referee: [Abstract] Abstract (benchmarking paragraph): the claim that QuadPiPS 'excels in safety-critical settings with limited available footholds' is the headline result, yet the abstract supplies no success rates, collision counts, traversal times, or statistical comparisons against the five baselines across the ten environments; without these numbers the superiority assertion cannot be evaluated.

Authors: We acknowledge the abstract currently omits specific numerical results. The full manuscript reports these details (success rates, collision counts, traversal times, and baseline comparisons) in the experiments section across the ten environments. We will revise the abstract to include the most representative quantitative comparisons supporting the superiority claim in limited-foothold settings. revision: yes
Referee: [Abstract] Abstract (egocan and superpixels paragraph): the semantic affordance prediction network and superpixel over-segmentation are the direct inputs to the ALEF-inspired discrete search; the manuscript provides no precision/recall, planarity error, or failure-case statistics for these modules on held-out data, rendering the safety-critical benchmarking claim dependent on unverified perceptual preconditions.

Authors: The perceptual modules' reliability is validated indirectly through end-to-end planning success in the safety-critical benchmarks. However, the manuscript does not report isolated precision/recall or planarity statistics on held-out data. We will either add any such metrics if they were computed during development or explicitly note the reliance on integrated system performance in the revised abstract. revision: partial
Referee: [Abstract] Real-world validation paragraph: the statement that QuadPiPS 'enables terrain-aware locomotion on hardware' is presented without quantitative metrics (e.g., foothold success rate, recovery time after mislabeling, or comparison to baselines) or exclusion criteria, so the hardware claim cannot be assessed for robustness.

Authors: We agree the abstract lacks quantitative hardware metrics. The manuscript describes successful terrain-aware deployment on the Unitree Go2 but does not detail numbers such as foothold success rates in the abstract. We will revise the abstract to incorporate key quantitative indicators and any exclusion criteria from the real-world experiments section. revision: yes

Circularity Check

0 steps flagged

No circularity: framework extends external ALEF with independent perception components and reports empirical benchmarks

full rationale

The paper defines QuadPiPS as an extension of the external ALEF planning framework, adding an egocan representation, semantic affordance prediction, and superpixel segmentation for foothold candidates, followed by search and trajectory optimization. No equations, fitted parameters, or self-citations are presented as load-bearing derivations that reduce to inputs by construction. Central claims rest on benchmarking across simulation environments and real-world hardware validation rather than tautological redefinitions or renamed fits. The perceptual assumptions (network labeling accuracy, superpixel planarity) are empirical preconditions, not circular steps.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 1 invented entities

Only the abstract is available, so the ledger is necessarily incomplete; the central claim rests on the unstated accuracy of the semantic prediction model and the assumption that superpixels yield usable planar foothold candidates.

invented entities (1)

legged egocan no independent evidence
purpose: ego-centric local environment representation that jointly encodes geometry and semantic affordances for legged motion planning
Introduced as a novel extension of prior egocentric maps; no independent evidence supplied in abstract.

pith-pipeline@v0.9.0 · 5788 in / 1284 out tokens · 25375 ms · 2026-05-23T05:54:26.802388+00:00 · methodology

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Task-Conditioned Uncertainty Costmaps for Legged Locomotion
cs.RO 2026-04 unverdicted novelty 5.0

Task-conditioned epistemic uncertainty in foothold predictions enables OOD detection and up to 37% lower feasibility error in uncertainty-aware costmaps for legged robot planning.

Reference graph

Works this paper leans on

36 extracted references · 36 canonical work pages · cited by 1 Pith paper

[1]

The DARPA LAGR program: Goals, challenges, methodology, and phase I results,

L. D. Jackel, E. Krotkov, M. Perschbacher, J. Pippine, and C. Sullivan, “The DARPA LAGR program: Goals, challenges, methodology, and phase I results,” Journal of Field Robotics , vol. 23, no. 11-12, pp. 945–973, 2006

work page 2006
[2]

Stereo-vision-based perception capabilities developed during the Robotics Collaborative Technology Alliances pro- gram,

A. Rankin, M. Bajracharya, A. Huertas, A. Howard, B. Moghaddam, S. Brennan, A. Ansar, B. Tang, M. Turmon, and L. Matthies, “Stereo-vision-based perception capabilities developed during the Robotics Collaborative Technology Alliances pro- gram,” G. R. Gerhart, D. W. Gage, and C. M. Shoemaker, Eds., Orlando, Florida, Apr. 2010

work page 2010
[3]

Learning Off-Road Terrain Traversability With Self- Supervisions Only,

J. Seo, S. Sim, and I. Shim, “Learning Off-Road Terrain Traversability With Self- Supervisions Only,” IEEE Robotics and Automation Letters , vol. 8, no. 8, pp. 4617–4624, Aug. 2023

work page 2023
[4]

Terrain traversability prediction through self-supervised learning and un- supervised domain adaptation on synthetic data,

G. Vecchio, S. Palazzo, D. C. Guastella, D. Giordano, G. Muscato, and C. Spamp- inato, “Terrain traversability prediction through self-supervised learning and un- supervised domain adaptation on synthetic data,” Autonomous Robots , vol. 48, no. 2, p. 4, Mar. 2024

work page 2024
[5]

Elevation Mapping for Locomotion and Navigation using GPU,

T. Miki, L. Wellhausen, R. Grandia, F. Jenelten, T. Homberger, and M. Hutter, “Elevation Mapping for Locomotion and Navigation using GPU,” Apr. 2022

work page 2022
[6]

STEP: Stochastic Traversability Evaluation and Planning for Risk-Aware Off-road Navigation; Results from the DARPA Subterranean Challenge,

A. Dixit, D. D. Fan, K. Otsu, S. Dey, A.-A. Agha-Mohammadi, and J. W. Bur- dick, “STEP: Stochastic Traversability Evaluation and Planning for Risk-Aware Off-road Navigation; Results from the DARPA Subterranean Challenge,” Field Robotics, vol. 4, no. 1, pp. 182–210, Jan. 2024

work page 2024
[7]

Mine Tunnel Exploration using Multiple Quadrupedal Robots,

I. D. Miller, F. Cladera, A. Cowley, S. S. Shivakumar, E. S. Lee, L. Jarin-Lipschitz, A. Bhat, N. Rodrigues, A. Zhou, A. Cohen, A. Kulkarni, J. Laney, C. J. Taylor, 16 Max Asselmeier et al. and V. Kumar, “Mine Tunnel Exploration using Multiple Quadrupedal Robots,” Feb. 2020

work page 2020
[8]

OHM: GPU Based Occupancy Map Generation,

K. Stepanas, J. Williams, E. Hern´ andez, F. Ruetz, and T. Hines, “OHM: GPU Based Occupancy Map Generation,”IEEE Robotics and Automation Letters, vol. 7, no. 4, pp. 11 078–11 085, Oct. 2022

work page 2022
[9]

Flexible Supervised Autonomy for Exploration in Subterranean Environments,

H. Biggie, E. R. Rush, D. G. Riley, S. Ahmad, M. T. Ohradzansky, K. Harlow, M. J. Miles, D. Torres, S. McGuire, E. W. Frew, C. Heckman, and J. S. Humbert, “Flexible Supervised Autonomy for Exploration in Subterranean Environments,” Field Robotics, vol. 3, no. 1, pp. 125–189, Jan. 2023

work page 2023
[10]

Rough Terrain Navigation for Legged Robots using Reachability Planning and Template Learning,

L. Wellhausen and M. Hutter, “Rough Terrain Navigation for Legged Robots using Reachability Planning and Template Learning,” in 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) , Sep. 2021

work page 2021
[11]

NeBula: Quest for Robotic Autonomy in Challenging Environments; TEAM CoSTAR at the DARPA Subterranean Challenge,

A. Agha, K. Otsu, B. Morrell, D. D. Fan, R. Thakker, A. Santamaria-Navarro, S.-K. Kim, A. Bouman, X. Lei, J. Edlund, M. F. Ginting, K. Ebadi, M. Ander- son, T. Pailevanian, E. Terry, M. Wolf, A. Tagliabue, T. S. Vaquero, M. Palieri, S. Tepsuporn, Y. Chang, A. Kalantari, F. Chavez, B. Lopez, N. Funabiki, G. Miles, T. Touma, A. Buscicchio, J. Tordesillas, N...

work page arXiv 2021
[12]

Heterogeneous robot teams with uni- fied perception and autonomy: How Team CSIRO Data61 tied for the top score at the DARPA Subterranean Challenge,

N. Kottege, J. Williams, B. Tidd, F. Talbot, R. Steindl, M. Cox, D. Frousheger, T. Hines, A. Pitt, B. Tam, B. Wood, L. Hanson, K. L. Surdo, T. Molnar, M. Wildie, K. Stepanas, G. Catt, L. Tychsen-Smith, D. Penfold, L. Overs, M. Ramezani, K. Khosoussi, F. Kendoul, G. Wagner, D. Palmer, J. Manderson, C. Medek, M. O’Brien, S. Chen, and R. C. Arkin, “Heterogen...

work page 2023
[13]

Perceptive Locomotion in Rough Terrain – Online Foothold Optimization,

F. Jenelten, T. Miki, A. E. Vijayan, M. Bjelonic, and M. Hutter, “Perceptive Locomotion in Rough Terrain – Online Foothold Optimization,” IEEE Robotics and Automation Letters , vol. 5, no. 4, pp. 5370–5376, Oct. 2020

work page 2020
[14]

Where Should I Walk? Predicting Terrain Properties From Images Via Self- Supervised Learning,

L. Wellhausen, A. Dosovitskiy, R. Ranftl, K. Walas, C. Cadena, and M. Hut- ter, “Where Should I Walk? Predicting Terrain Properties From Images Via Self- Supervised Learning,” IEEE Robotics and Automation Letters , vol. 4, no. 2, pp. 1509–1516, Apr. 2019

work page 2019
[15]

Prediction Maps for Real-Time 3D Footstep Planning in Dynamic Environments,

P. Karkowski and M. Bennewitz, “Prediction Maps for Real-Time 3D Footstep Planning in Dynamic Environments,” in2019 International Conference on Robotics and Automation (ICRA) , May 2019, pp. 2517–2523

work page 2019
[16]

Vision Aided Dynamic Exploration of Unstructured Terrain with a Small-Scale Quadruped Robot,

D. Kim, D. Carballo, J. Di Carlo, B. Katz, G. Bledt, B. Lim, and S. Kim, “Vision Aided Dynamic Exploration of Unstructured Terrain with a Small-Scale Quadruped Robot,” in 2020 IEEE International Conference on Robotics and Au- tomation (ICRA) , May 2020, pp. 2464–2470

work page 2020
[17]

Foot- step Planning for Autonomous Walking Over Rough Terrain,

R. J. Griffin, G. Wiedebach, S. McCrory, S. Bertrand, I. Lee, and J. Pratt, “Foot- step Planning for Autonomous Walking Over Rough Terrain,” in 2019 IEEE-RAS 19th International Conference on Humanoid Robots (Humanoids) , Oct. 2019, pp. 9–16. Steppability-informed Quadrupedal Contact Planning 17

work page 2019
[18]

Detecting Usable Planar Regions for Legged Robot Locomotion,

S. Bertrand, I. Lee, B. Mishra, D. Calvert, J. Pratt, and R. Griffin, “Detecting Usable Planar Regions for Legged Robot Locomotion,” in 2020 IEEE/RSJ Inter- national Conference on Intelligent Robots and Systems (IROS) . Las Vegas, NV, USA: IEEE, Oct. 2020, pp. 4736–4742

work page 2020
[19]

Perceptive whole-body planning for multilegged robots in confined spaces,

R. Buchanan, L. Wellhausen, M. Bjelonic, T. Bandyopadhyay, N. Kottege, and M. Hutter, “Perceptive whole-body planning for multilegged robots in confined spaces,” Journal of Field Robotics , vol. 38, no. 1, pp. 68–84, 2021

work page 2021
[20]

Perceptive Lo- comotion through Nonlinear Model Predictive Control,

R. Grandia, F. Jenelten, S. Yang, F. Farshidian, and M. Hutter, “Perceptive Lo- comotion through Nonlinear Model Predictive Control,” in arXiv, Aug. 2022

work page 2022
[21]

PiPS: Planning in perception space,

J. S. Smith and P. Vela, “PiPS: Planning in perception space,” in 2017 IEEE International Conference on Robotics and Automation (ICRA) , 2017, pp. 6204– 6209

work page 2017
[22]

Potential Gap: A Gap-Informed Reactive Policy for Safe Hierarchical Navigation,

R. Xu, S. Feng, and P. A. Vela, “Potential Gap: A Gap-Informed Reactive Policy for Safe Hierarchical Navigation,” IEEE Robotics and Automation Letters , vol. 6, no. 4, pp. 8325–8332, 2021

work page 2021
[23]

Real-Time Egocentric Navigation Using 3D Sensing,

J. S. Smith, S. Feng, F. Lyu, and P. A. Vela, “Real-Time Egocentric Navigation Using 3D Sensing,” in Machine Vision and Navigation , 2020, pp. 431–484

work page 2020
[24]

egoTEB: Egocentric, Perception Space Navigation Using Timed-Elastic-Bands,

J. S. Smith, R. Xu, and P. Vela, “egoTEB: Egocentric, Perception Space Navigation Using Timed-Elastic-Bands,” in 2020 IEEE International Conference on Robotics and Automation (ICRA) , 2020, pp. 2703–2709

work page 2020
[25]

Deep Visual Reasoning: Learning to Predict Action Sequences for Task and Motion Planning from an Initial Scene Image,

D. Driess, J.-S. Ha, and M. Toussaint, “Deep Visual Reasoning: Learning to Predict Action Sequences for Task and Motion Planning from an Initial Scene Image,” in Robotics: Science and Systems XVI . Robotics: Science and Systems Foundation, Jul. 2020

work page 2020
[26]

Deep Visual Heuristics: Learning Feasibility of Mixed-Integer Programs for Manipulation Planning,

D. Driess, O. Oguz, J.-S. Ha, and M. Toussaint, “Deep Visual Heuristics: Learning Feasibility of Mixed-Integer Programs for Manipulation Planning,” in 2020 IEEE International Conference on Robotics and Automation (ICRA) , May 2020, pp. 9563–9569

work page 2020
[27]

Learning to Navigate Sidewalks in Outdoor Environments,

M. Sorokin, J. Tan, C. K. Liu, and S. Ha, “Learning to Navigate Sidewalks in Outdoor Environments,” Sep. 2021

work page 2021
[28]

Legged Locomotion in Chal- lenging Terrains using Egocentric Vision,

A. Agarwal, A. Kumar, J. Malik, and D. Pathak, “Legged Locomotion in Chal- lenging Terrains using Egocentric Vision,” Sep. 2022

work page 2022
[29]

Shape- Primitive Based Object Recognition and Grasping,

M. Nieuwenhuisen, J. Stueckler, A. Berner, R. Klein, and S. Behnke, “Shape- Primitive Based Object Recognition and Grasping,” in ROBOTIK 2012; 7th Ger- man Conference on Robotics , May 2012, pp. 1–5

work page 2012
[30]

Using synthetic data and deep net- works to recognize primitive shapes for object grasping,

Y. Lin, C. Tang, F.-J. Chu, and P. A. Vela, “Using synthetic data and deep net- works to recognize primitive shapes for object grasping,” in 2020 IEEE Interna- tional Conference on Robotics and Automation (ICRA) . IEEE, 2020, pp. 10 494– 10 501

work page 2020
[31]

Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation,

L.-C. Chen, Y. Zhu, G. Papandreou, F. Schroff, and H. Adam, “Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation,” Aug. 2018

work page 2018
[32]

Detectron2,

Yuxin Wu, Alexander Kirillov, Francisco Massa, Wan-Yen Lo, and Ross Girshick, “Detectron2,” 2019

work page 2019
[33]

Hierarchical Experience-informed Navigation for Multi-modal Quadrupedal Rebar Grid Traver- sal,

M. Asselmeier, J. Ivanova, Z. Zhou, P. A. Vela, and Y. Zhao, “Hierarchical Experience-informed Navigation for Multi-modal Quadrupedal Rebar Grid Traver- sal,” 2024 IEEE International Conference on Robotics and Automation (ICRA) , 2024

work page 2024
[34]

Scaling Multimodal Planning: Using Experience and Informing Discrete Search,

Z. Kingston and L. E. Kavraki, “Scaling Multimodal Planning: Using Experience and Informing Discrete Search,” IEEE Transactions on Robotics , vol. 39, no. 1, pp. 128–146, Feb. 2023. 18 Max Asselmeier et al

work page 2023
[35]

Centroidal dynamics of a humanoid robot,

D. E. Orin, A. Goswami, and S.-H. Lee, “Centroidal dynamics of a humanoid robot,” Autonomous Robots, vol. 35, no. 2, pp. 161–176, Oct. 2013

work page 2013
[36]

A Unified MPC Framework for Whole-Body Dynamic Locomotion and Manipulation,

J.-P. Sleiman, F. Farshidian, M. V. Minniti, and M. Hutter, “A Unified MPC Framework for Whole-Body Dynamic Locomotion and Manipulation,” IEEE Robotics and Automation Letters , vol. 6, no. 3, pp. 4688–4695, Jul. 2021

work page 2021

[1] [1]

The DARPA LAGR program: Goals, challenges, methodology, and phase I results,

L. D. Jackel, E. Krotkov, M. Perschbacher, J. Pippine, and C. Sullivan, “The DARPA LAGR program: Goals, challenges, methodology, and phase I results,” Journal of Field Robotics , vol. 23, no. 11-12, pp. 945–973, 2006

work page 2006

[2] [2]

Stereo-vision-based perception capabilities developed during the Robotics Collaborative Technology Alliances pro- gram,

A. Rankin, M. Bajracharya, A. Huertas, A. Howard, B. Moghaddam, S. Brennan, A. Ansar, B. Tang, M. Turmon, and L. Matthies, “Stereo-vision-based perception capabilities developed during the Robotics Collaborative Technology Alliances pro- gram,” G. R. Gerhart, D. W. Gage, and C. M. Shoemaker, Eds., Orlando, Florida, Apr. 2010

work page 2010

[3] [3]

Learning Off-Road Terrain Traversability With Self- Supervisions Only,

J. Seo, S. Sim, and I. Shim, “Learning Off-Road Terrain Traversability With Self- Supervisions Only,” IEEE Robotics and Automation Letters , vol. 8, no. 8, pp. 4617–4624, Aug. 2023

work page 2023

[4] [4]

Terrain traversability prediction through self-supervised learning and un- supervised domain adaptation on synthetic data,

G. Vecchio, S. Palazzo, D. C. Guastella, D. Giordano, G. Muscato, and C. Spamp- inato, “Terrain traversability prediction through self-supervised learning and un- supervised domain adaptation on synthetic data,” Autonomous Robots , vol. 48, no. 2, p. 4, Mar. 2024

work page 2024

[5] [5]

Elevation Mapping for Locomotion and Navigation using GPU,

T. Miki, L. Wellhausen, R. Grandia, F. Jenelten, T. Homberger, and M. Hutter, “Elevation Mapping for Locomotion and Navigation using GPU,” Apr. 2022

work page 2022

[6] [6]

STEP: Stochastic Traversability Evaluation and Planning for Risk-Aware Off-road Navigation; Results from the DARPA Subterranean Challenge,

A. Dixit, D. D. Fan, K. Otsu, S. Dey, A.-A. Agha-Mohammadi, and J. W. Bur- dick, “STEP: Stochastic Traversability Evaluation and Planning for Risk-Aware Off-road Navigation; Results from the DARPA Subterranean Challenge,” Field Robotics, vol. 4, no. 1, pp. 182–210, Jan. 2024

work page 2024

[7] [7]

Mine Tunnel Exploration using Multiple Quadrupedal Robots,

I. D. Miller, F. Cladera, A. Cowley, S. S. Shivakumar, E. S. Lee, L. Jarin-Lipschitz, A. Bhat, N. Rodrigues, A. Zhou, A. Cohen, A. Kulkarni, J. Laney, C. J. Taylor, 16 Max Asselmeier et al. and V. Kumar, “Mine Tunnel Exploration using Multiple Quadrupedal Robots,” Feb. 2020

work page 2020

[8] [8]

OHM: GPU Based Occupancy Map Generation,

K. Stepanas, J. Williams, E. Hern´ andez, F. Ruetz, and T. Hines, “OHM: GPU Based Occupancy Map Generation,”IEEE Robotics and Automation Letters, vol. 7, no. 4, pp. 11 078–11 085, Oct. 2022

work page 2022

[9] [9]

Flexible Supervised Autonomy for Exploration in Subterranean Environments,

H. Biggie, E. R. Rush, D. G. Riley, S. Ahmad, M. T. Ohradzansky, K. Harlow, M. J. Miles, D. Torres, S. McGuire, E. W. Frew, C. Heckman, and J. S. Humbert, “Flexible Supervised Autonomy for Exploration in Subterranean Environments,” Field Robotics, vol. 3, no. 1, pp. 125–189, Jan. 2023

work page 2023

[10] [10]

Rough Terrain Navigation for Legged Robots using Reachability Planning and Template Learning,

L. Wellhausen and M. Hutter, “Rough Terrain Navigation for Legged Robots using Reachability Planning and Template Learning,” in 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) , Sep. 2021

work page 2021

[11] [11]

NeBula: Quest for Robotic Autonomy in Challenging Environments; TEAM CoSTAR at the DARPA Subterranean Challenge,

A. Agha, K. Otsu, B. Morrell, D. D. Fan, R. Thakker, A. Santamaria-Navarro, S.-K. Kim, A. Bouman, X. Lei, J. Edlund, M. F. Ginting, K. Ebadi, M. Ander- son, T. Pailevanian, E. Terry, M. Wolf, A. Tagliabue, T. S. Vaquero, M. Palieri, S. Tepsuporn, Y. Chang, A. Kalantari, F. Chavez, B. Lopez, N. Funabiki, G. Miles, T. Touma, A. Buscicchio, J. Tordesillas, N...

work page arXiv 2021

[12] [12]

Heterogeneous robot teams with uni- fied perception and autonomy: How Team CSIRO Data61 tied for the top score at the DARPA Subterranean Challenge,

N. Kottege, J. Williams, B. Tidd, F. Talbot, R. Steindl, M. Cox, D. Frousheger, T. Hines, A. Pitt, B. Tam, B. Wood, L. Hanson, K. L. Surdo, T. Molnar, M. Wildie, K. Stepanas, G. Catt, L. Tychsen-Smith, D. Penfold, L. Overs, M. Ramezani, K. Khosoussi, F. Kendoul, G. Wagner, D. Palmer, J. Manderson, C. Medek, M. O’Brien, S. Chen, and R. C. Arkin, “Heterogen...

work page 2023

[13] [13]

Perceptive Locomotion in Rough Terrain – Online Foothold Optimization,

F. Jenelten, T. Miki, A. E. Vijayan, M. Bjelonic, and M. Hutter, “Perceptive Locomotion in Rough Terrain – Online Foothold Optimization,” IEEE Robotics and Automation Letters , vol. 5, no. 4, pp. 5370–5376, Oct. 2020

work page 2020

[14] [14]

Where Should I Walk? Predicting Terrain Properties From Images Via Self- Supervised Learning,

L. Wellhausen, A. Dosovitskiy, R. Ranftl, K. Walas, C. Cadena, and M. Hut- ter, “Where Should I Walk? Predicting Terrain Properties From Images Via Self- Supervised Learning,” IEEE Robotics and Automation Letters , vol. 4, no. 2, pp. 1509–1516, Apr. 2019

work page 2019

[15] [15]

Prediction Maps for Real-Time 3D Footstep Planning in Dynamic Environments,

P. Karkowski and M. Bennewitz, “Prediction Maps for Real-Time 3D Footstep Planning in Dynamic Environments,” in2019 International Conference on Robotics and Automation (ICRA) , May 2019, pp. 2517–2523

work page 2019

[16] [16]

Vision Aided Dynamic Exploration of Unstructured Terrain with a Small-Scale Quadruped Robot,

D. Kim, D. Carballo, J. Di Carlo, B. Katz, G. Bledt, B. Lim, and S. Kim, “Vision Aided Dynamic Exploration of Unstructured Terrain with a Small-Scale Quadruped Robot,” in 2020 IEEE International Conference on Robotics and Au- tomation (ICRA) , May 2020, pp. 2464–2470

work page 2020

[17] [17]

Foot- step Planning for Autonomous Walking Over Rough Terrain,

R. J. Griffin, G. Wiedebach, S. McCrory, S. Bertrand, I. Lee, and J. Pratt, “Foot- step Planning for Autonomous Walking Over Rough Terrain,” in 2019 IEEE-RAS 19th International Conference on Humanoid Robots (Humanoids) , Oct. 2019, pp. 9–16. Steppability-informed Quadrupedal Contact Planning 17

work page 2019

[18] [18]

Detecting Usable Planar Regions for Legged Robot Locomotion,

S. Bertrand, I. Lee, B. Mishra, D. Calvert, J. Pratt, and R. Griffin, “Detecting Usable Planar Regions for Legged Robot Locomotion,” in 2020 IEEE/RSJ Inter- national Conference on Intelligent Robots and Systems (IROS) . Las Vegas, NV, USA: IEEE, Oct. 2020, pp. 4736–4742

work page 2020

[19] [19]

Perceptive whole-body planning for multilegged robots in confined spaces,

R. Buchanan, L. Wellhausen, M. Bjelonic, T. Bandyopadhyay, N. Kottege, and M. Hutter, “Perceptive whole-body planning for multilegged robots in confined spaces,” Journal of Field Robotics , vol. 38, no. 1, pp. 68–84, 2021

work page 2021

[20] [20]

Perceptive Lo- comotion through Nonlinear Model Predictive Control,

R. Grandia, F. Jenelten, S. Yang, F. Farshidian, and M. Hutter, “Perceptive Lo- comotion through Nonlinear Model Predictive Control,” in arXiv, Aug. 2022

work page 2022

[21] [21]

PiPS: Planning in perception space,

J. S. Smith and P. Vela, “PiPS: Planning in perception space,” in 2017 IEEE International Conference on Robotics and Automation (ICRA) , 2017, pp. 6204– 6209

work page 2017

[22] [22]

Potential Gap: A Gap-Informed Reactive Policy for Safe Hierarchical Navigation,

R. Xu, S. Feng, and P. A. Vela, “Potential Gap: A Gap-Informed Reactive Policy for Safe Hierarchical Navigation,” IEEE Robotics and Automation Letters , vol. 6, no. 4, pp. 8325–8332, 2021

work page 2021

[23] [23]

Real-Time Egocentric Navigation Using 3D Sensing,

J. S. Smith, S. Feng, F. Lyu, and P. A. Vela, “Real-Time Egocentric Navigation Using 3D Sensing,” in Machine Vision and Navigation , 2020, pp. 431–484

work page 2020

[24] [24]

egoTEB: Egocentric, Perception Space Navigation Using Timed-Elastic-Bands,

J. S. Smith, R. Xu, and P. Vela, “egoTEB: Egocentric, Perception Space Navigation Using Timed-Elastic-Bands,” in 2020 IEEE International Conference on Robotics and Automation (ICRA) , 2020, pp. 2703–2709

work page 2020

[25] [25]

Deep Visual Reasoning: Learning to Predict Action Sequences for Task and Motion Planning from an Initial Scene Image,

D. Driess, J.-S. Ha, and M. Toussaint, “Deep Visual Reasoning: Learning to Predict Action Sequences for Task and Motion Planning from an Initial Scene Image,” in Robotics: Science and Systems XVI . Robotics: Science and Systems Foundation, Jul. 2020

work page 2020

[26] [26]

Deep Visual Heuristics: Learning Feasibility of Mixed-Integer Programs for Manipulation Planning,

D. Driess, O. Oguz, J.-S. Ha, and M. Toussaint, “Deep Visual Heuristics: Learning Feasibility of Mixed-Integer Programs for Manipulation Planning,” in 2020 IEEE International Conference on Robotics and Automation (ICRA) , May 2020, pp. 9563–9569

work page 2020

[27] [27]

Learning to Navigate Sidewalks in Outdoor Environments,

M. Sorokin, J. Tan, C. K. Liu, and S. Ha, “Learning to Navigate Sidewalks in Outdoor Environments,” Sep. 2021

work page 2021

[28] [28]

Legged Locomotion in Chal- lenging Terrains using Egocentric Vision,

A. Agarwal, A. Kumar, J. Malik, and D. Pathak, “Legged Locomotion in Chal- lenging Terrains using Egocentric Vision,” Sep. 2022

work page 2022

[29] [29]

Shape- Primitive Based Object Recognition and Grasping,

M. Nieuwenhuisen, J. Stueckler, A. Berner, R. Klein, and S. Behnke, “Shape- Primitive Based Object Recognition and Grasping,” in ROBOTIK 2012; 7th Ger- man Conference on Robotics , May 2012, pp. 1–5

work page 2012

[30] [30]

Using synthetic data and deep net- works to recognize primitive shapes for object grasping,

Y. Lin, C. Tang, F.-J. Chu, and P. A. Vela, “Using synthetic data and deep net- works to recognize primitive shapes for object grasping,” in 2020 IEEE Interna- tional Conference on Robotics and Automation (ICRA) . IEEE, 2020, pp. 10 494– 10 501

work page 2020

[31] [31]

Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation,

L.-C. Chen, Y. Zhu, G. Papandreou, F. Schroff, and H. Adam, “Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation,” Aug. 2018

work page 2018

[32] [32]

Detectron2,

Yuxin Wu, Alexander Kirillov, Francisco Massa, Wan-Yen Lo, and Ross Girshick, “Detectron2,” 2019

work page 2019

[33] [33]

Hierarchical Experience-informed Navigation for Multi-modal Quadrupedal Rebar Grid Traver- sal,

M. Asselmeier, J. Ivanova, Z. Zhou, P. A. Vela, and Y. Zhao, “Hierarchical Experience-informed Navigation for Multi-modal Quadrupedal Rebar Grid Traver- sal,” 2024 IEEE International Conference on Robotics and Automation (ICRA) , 2024

work page 2024

[34] [34]

Scaling Multimodal Planning: Using Experience and Informing Discrete Search,

Z. Kingston and L. E. Kavraki, “Scaling Multimodal Planning: Using Experience and Informing Discrete Search,” IEEE Transactions on Robotics , vol. 39, no. 1, pp. 128–146, Feb. 2023. 18 Max Asselmeier et al

work page 2023

[35] [35]

Centroidal dynamics of a humanoid robot,

D. E. Orin, A. Goswami, and S.-H. Lee, “Centroidal dynamics of a humanoid robot,” Autonomous Robots, vol. 35, no. 2, pp. 161–176, Oct. 2013

work page 2013

[36] [36]

A Unified MPC Framework for Whole-Body Dynamic Locomotion and Manipulation,

J.-P. Sleiman, F. Farshidian, M. V. Minniti, and M. Hutter, “A Unified MPC Framework for Whole-Body Dynamic Locomotion and Manipulation,” IEEE Robotics and Automation Letters , vol. 6, no. 3, pp. 4688–4695, Jul. 2021

work page 2021