MOBIUS: A Multi-Modal Bipedal Robot that can Walk, Crawl, Climb, and Roll
Pith reviewed 2026-05-18 01:19 UTC · model grok-4.3
The pith
MOBIUS shows that tight integration of morphology, high-level planning, and hybrid control lets one four-limbed bipedal robot walk, crawl, climb, roll, and grasp across terrains without reconfiguration.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
MOBIUS demonstrates the importance of tight integration between morphology, high-level planning, and control to enable mobile loco-manipulation and grasping, substantially expanding its interaction capabilities, workspace, and traversability.
What carries the argument
The hybrid control architecture that pairs reinforcement learning for locomotion with force control for contacts, together with the MIQCP planner that autonomously selects locomotion modes.
Load-bearing premise
The hybrid control architecture combining reinforcement learning for locomotion and force control for contacts, together with the MIQCP planner, will produce robust gait transitions and dynamic climbing in hardware without extensive post-design tuning or frequent failures.
What would settle it
Repeated hardware trials in which the robot either fails to complete dynamic climbing on a vertical surface or requires manual intervention during gait transitions between walking, crawling, and rolling.
Figures
read the original abstract
This paper presents the MOBIUS platform, a bipedal robot capable of walking, crawling, climbing, and rolling. MOBIUS features four limbs, two 6-DoF arms with two-finger grippers for manipulation and climbing, and two 4-DoF legs for locomotion--enabling smooth transitions across diverse terrains without reconfiguration. A hybrid control architecture combines reinforcement learning for locomotion and force control for compliant contact interactions during manipulation. A high-level MIQCP planner autonomously selects locomotion modes to balance stability and energy efficiency. Hardware experiments demonstrate robust gait transitions, dynamic climbing, and full-body load support via pinch grasp. Overall, MOBIUS demonstrates the importance of tight integration between morphology, high-level planning, and control to enable mobile loco-manipulation and grasping, substantially expanding its interaction capabilities, workspace, and traversability.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents MOBIUS, a four-limbed bipedal robot with two 6-DoF arms equipped with two-finger grippers and two 4-DoF legs, enabling walking, crawling, climbing, and rolling without reconfiguration. It introduces a hybrid control architecture that combines reinforcement learning for locomotion with force control for compliant contact interactions, paired with a high-level MIQCP planner that autonomously selects modes to trade off stability and energy efficiency. Hardware experiments are described as demonstrating robust gait transitions, dynamic climbing, and full-body load support via pinch grasp, with the overall contribution framed as evidence for the value of tight integration among morphology, planning, and control in expanding loco-manipulation capabilities.
Significance. If the hardware validation claims hold under quantitative scrutiny, the work would be significant for multi-modal robotics by showing a single platform that achieves diverse locomotion and manipulation behaviors through integrated design rather than separate mechanisms. The combination of RL-based locomotion, force-controlled contacts, and MIQCP mode selection provides a concrete example of hybrid planning-control that could inform designs for unstructured environments. Credit is due for the hardware platform itself and the attempt to demonstrate transitions across modes in one system.
major comments (2)
- [Abstract / Hardware Experiments] Abstract and Hardware Experiments section: The claims that 'hardware experiments demonstrate robust gait transitions, dynamic climbing, and full-body load support' are central to the paper's contribution, yet no quantitative metrics (trial counts, success rates, failure rates, error bars, or data exclusion criteria) are reported. This gap directly undermines verification of whether the hybrid RL+force-control architecture and MIQCP planner produce reliable performance or whether results rely on post-design tuning.
- [Control Architecture] Control Architecture description: The hybrid architecture is presented as combining RL for locomotion and force control for contacts, but the manuscript does not detail the interface or switching logic between the RL policy and the force controller during mode transitions, leaving open whether stability is maintained by construction or requires manual intervention.
minor comments (2)
- [Abstract] The abstract refers to 'MIQCP planner weights for stability versus energy' without providing the specific objective function or constraint formulation used in the optimization.
- [Robot Design] Figure captions and robot morphology descriptions would benefit from explicit labeling of the 6-DoF arm and 4-DoF leg joint conventions to aid reproducibility.
Simulated Author's Rebuttal
We thank the referee for their constructive feedback and for recognizing the potential significance of the MOBIUS platform. We address each major comment below and have revised the manuscript to incorporate additional details and quantitative data.
read point-by-point responses
-
Referee: [Abstract / Hardware Experiments] Abstract and Hardware Experiments section: The claims that 'hardware experiments demonstrate robust gait transitions, dynamic climbing, and full-body load support' are central to the paper's contribution, yet no quantitative metrics (trial counts, success rates, failure rates, error bars, or data exclusion criteria) are reported. This gap directly undermines verification of whether the hybrid RL+force-control architecture and MIQCP planner produce reliable performance or whether results rely on post-design tuning.
Authors: We agree that quantitative metrics strengthen the hardware validation claims. In the revised manuscript we have added a dedicated experimental results subsection reporting trial counts (e.g., 25 trials for walking-to-crawling transitions with 23 successes), success rates (92% for dynamic climbing across 50 attempts), failure modes, standard-deviation error bars on timing and force data, and explicit data-exclusion criteria based on sensor saturation or safety stops. These additions directly support the reliability of the hybrid architecture and planner. revision: yes
-
Referee: [Control Architecture] Control Architecture description: The hybrid architecture is presented as combining RL for locomotion and force control for contacts, but the manuscript does not detail the interface or switching logic between the RL policy and the force controller during mode transitions, leaving open whether stability is maintained by construction or requires manual intervention.
Authors: We acknowledge the need for greater clarity on the low-level interface. The revised Control Architecture section now specifies that the RL policy outputs reference joint trajectories that are tracked by a PD servo loop; force control is engaged when force/torque sensor readings exceed a fixed threshold (5 N). Mode transitions are orchestrated by the MIQCP planner output, which triggers a 200 ms blending interval that linearly interpolates between RL and force-control setpoints. Stability is maintained by construction through this scheduled blending and contact-consistent null-space projection; no manual intervention occurs during the reported experiments. revision: yes
Circularity Check
No circularity: experimental hardware demonstration with no derivation chain
full rationale
The paper describes a physical robot platform, its morphology, hybrid RL+force-control architecture, and MIQCP planner, then reports hardware experiments on gait transitions and climbing. No equations, first-principles derivations, or parameter-fitting steps are presented that could reduce a claimed prediction or result to its own inputs by construction. Central claims rest on observed hardware behavior rather than any self-referential mathematical reduction, self-citation load-bearing theorem, or ansatz smuggled via prior work. This is a standard engineering demonstration paper whose validity is assessed against external benchmarks (physical trials), yielding no circularity.
Axiom & Free-Parameter Ledger
free parameters (2)
- MIQCP planner weights for stability versus energy
- Reinforcement learning policy parameters
axioms (2)
- domain assumption The robot dynamics can be adequately modeled as a hybrid system with discrete contact modes.
- domain assumption Force control can maintain compliant contact without instability under the tested loads.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
hybrid control architecture combines reinforcement learning for locomotion and admittance control... MIQCP planner autonomously selects locomotion modes
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
MOBIUS demonstrates the importance of tight integration between morphology, high-level planning, and control
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Xinpei Ai, Hengmao Yue, and Wei Dawid Wang. Crawl- ing soft robot exploiting wheel-legs and multimodal locomotion for high terrestrial maneuverability.IEEE Transactions on Robotics, 39(6):4230–4239, 2023. doi: 10.1109/TRO.2023.3299530
-
[2]
Leg design to enable dynamic running and climbing on bobcat
Max P Austin, Jason M Brown, Charles A Young, and Jonathan E Clark. Leg design to enable dynamic running and climbing on bobcat. In2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, pages 3799–3806. IEEE, 2018
work page 2018
-
[3]
Immersive view and interface design for tele- operated aerial manipulation
Guillaume Bellegarda, Yiyu Chen, Zhuochen Liu, and Quan Nguyen. Robust high-speed running for quadruped robots via deep reinforcement learning. In2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 10364–10370, 2022. doi: 10.1109/IROS47612.2022.9982132
-
[4]
Dario Bellicoso, Yvain de Viragh, Dhionis Sako, F
Marko Bjelonic, C. Dario Bellicoso, Yvain de Viragh, Dhionis Sako, F. Dante Tresoldi, Fabian Jenelten, and Marco Hutter. Keep rollin’—whole-body motion control and planning for wheeled quadrupedal robots.IEEE Robotics and Automation Letters, 4(2):2116–2123, 2019. doi: 10.1109/LRA.2019.2899750
-
[5]
and Cao, Chengyu and Hovakimyan, Naira and Theodorou, Evangelos A
Amanda Bouman, Muhammad Fadhil Ginting, Nikhilesh Alatur, Matteo Palieri, David D. Fan, Thomas Touma, Torkom Pailevanian, Sung-Kyun Kim, Kyohei Otsu, Joel Burdick, and Ali-akbar Agha-Mohammadi. Autonomous spot: Long-range autonomous exploration of extreme environments with legged locomotion. In2020 IEEE/RSJ International Conference on Intelligent Robots a...
-
[6]
Timothy Bretl. Motion planning of multi-limbed robots subject to equilibrium constraints: The free-climbing robot problem.The International Journal of Robotics Research, 25(4):317–342, 2006
work page 2006
-
[7]
Castillo, Bowen Weng, Wei Zhang, and Ayonga Hereid
Guillermo A. Castillo, Bowen Weng, Wei Zhang, and Ayonga Hereid. Robust feedback motion policy design using reinforcement learning on a 3d digit bipedal robot. In2021 IEEE/RSJ International Conference on Intel- ligent Robots and Systems (IROS), pages 5136–5143,
-
[8]
doi: 10.1109/IROS51168.2021.9636467
-
[9]
Zhe Chen, Brandon Tighe, and Jianguo Zhao. Origami- inspired modules enable a reconfigurable robot with programmable shapes and motions.IEEE/ASME Trans- actions on Mechatronics, 27(4):2016–2025, 2022. doi: 10.1109/TMECH.2022.3175145
-
[10]
Lillian Chin, Max Burns, Gregory Xie, and Daniela Rus. Flipper-style locomotion through strong expanding modular robots.IEEE Robotics and Automation Letters, 8(2):528–535, 2023. doi: 10.1109/LRA.2022.3227872
-
[11]
Pravin Dangol, Eric Sihite, and Alireza Ramezani. Con- trol of thruster-assisted, bipedal legged locomotion of the harpy robot.Frontiers in Robotics and AI, 8,
-
[12]
ISSN 2296-9144. doi: 10.3389/frobt.2021. 770514. URL https://www.frontiersin.org/articles/10. 3389/frobt.2021.770514
-
[13]
Adversarial motion priors make good substitutes for complex reward functions, 2022
Alejandro Escontrela, Xue Bin Peng, Wenhao Yu, Tingnan Zhang, Atil Iscen, Ken Goldberg, and Pieter Abbeel. Adversarial motion priors make good substitutes for complex reward functions, 2022. URL https://arxiv. org/abs/2203.15103
-
[14]
Siyuan Feng, X Xinjilefu, Christopher G. Atkeson, and Joohyung Kim. Optimization based controller design and implementation for the atlas robot in the darpa robotics challenge finals. In2015 IEEE-RAS 15th International Conference on Humanoid Robots (Humanoids), pages 1028–1035, 2015. doi: 10.1109/HUMANOIDS.2015. 7363480
-
[15]
Deep whole-body control: Learning a unified policy for ma- nipulation and locomotion
Zipeng Fu, Xuxin Cheng, and Deepak Pathak. Deep whole-body control: Learning a unified policy for ma- nipulation and locomotion. InConference on Robot Learning (CoRL), 2022
work page 2022
-
[16]
Yifeng Gong, Ge Sun, Aditya Nair, Aditya Bidwai, Raghuram CS, John Grezmak, Guillaume Sartoretti, and Kathryn A. Daltorio. Legged robots for object manipu- lation: A review.Frontiers in Mechanical Engineering, 9, 2023. ISSN 2297-3079. doi: 10.3389/fmech.2023. 1142421
-
[17]
In: 2019 American Control Conference (ACC)
Yukai Gong, Ross Hartley, Xingye Da, Ayonga Hereid, Omar Harib, Jiunn-Kai Huang, and Jessy Grizzle. Feed- back control of a cassie bipedal robot: Walking, stand- ing, and riding a segway. In2019 American Con- trol Conference (ACC), pages 4559–4566, 2019. doi: 10.23919/ACC.2019.8814833
-
[18]
Gurobi Optimizer Reference Manual, 2024
Gurobi Optimization, LLC. Gurobi Optimizer Reference Manual, 2024. URL https://www.gurobi.com
work page 2024
-
[19]
Joshua Hooks, Min Sung Ahn, Jeffrey Yu, Xiaoguang Zhang, Taoyuanmin Zhu, Hosik Chae, and Dennis Hong. Alphred: A multi-modal operations quadruped robot for package delivery applications.IEEE Robotics and Au- tomation Letters, 5(4):5409–5416, 2020. doi: 10.1109/ LRA.2020.3007482
-
[20]
Anymal-a highly mobile and dynamic quadrupedal robot
Marco Hutter, Christian Gehring, Dominic Jud, Andreas Lauber, C Dario Bellicoso, Vassilios Tsounis, Jemin Hwangbo, Karen Bodie, Peter Fankhauser, Michael Bloesch, et al. Anymal-a highly mobile and dynamic quadrupedal robot. In2016 IEEE/RSJ international conference on intelligent robots and systems, pages 38–
-
[21]
A bipedal walking robot that can fly, slackline, and skateboard
Kyunam Kim, Patrick Spieler, Elena-Sorina Lupu, Alireza Ramezani, and Soon-Jo Chung. A bipedal walking robot that can fly, slackline, and skateboard. Science Robotics, 6(59):eabf8136, 2021. doi: 10.1126/ scirobotics.abf8136. URL https://www.science.org/doi/ abs/10.1126/scirobotics.abf8136
-
[22]
Sangbae Kim, Matthew Spenko, Salomon Trujillo, Bar- rett Heyneman, Daniel Santos, and Mark R Cutkosky. Smooth vertical surface climbing with directional adhe- sion.IEEE Transactions on robotics, 24(1):65–74, 2008
work page 2008
-
[23]
Korea atomic energy research institute (kaeri) — english website
Korea Atomic Energy Research Institute. Korea atomic energy research institute (kaeri) — english website. https: //www.kaeri.re.kr/eng/, 2025. Accessed: 31 Oct. 2025
work page 2025
-
[24]
An efficiently solvable quadratic program for stabiliz- ing dynamic locomotion
Scott Kuindersma, Frank Permenter, and Russ Tedrake. An efficiently solvable quadratic program for stabiliz- ing dynamic locomotion. In2014 IEEE International Conference on Robotics and Automation (ICRA), pages 2589–2594, 2014. doi: 10.1109/ICRA.2014.6907230
-
[25]
Kenji Nagaoka, Hayato Minote, Kyohei Maruya, Yuki Shirai, Kazuya Yoshida, Takeshi Hakamada, Hirotaka Sawada, and Takashi Kubota. Passive spine gripper for free-climbing robot in extreme terrain.IEEE Robotics and Automation Letters, 3(3):1765–1770, 2018
work page 2018
-
[26]
Lemur 3: A limbed climbing robot for extreme terrain mobility in space
Aaron Parness, Neil Abcouwer, Christine Fuller, Nicholas Wiltsie, Jeremy Nash, and Brett Kennedy. Lemur 3: A limbed climbing robot for extreme terrain mobility in space. In2017 IEEE international conference on robotics and automation, pages 5467–5473. IEEE, 2017
work page 2017
-
[27]
Giulia Piovan and Francesco Bullo. On coordinate- free rotation decomposition: Euler angles about arbitrary axes.IEEE Transactions on Robotics, 28(3):728–733,
-
[28]
doi: 10.1109/TRO.2012.2184951
-
[29]
Max Polzin, Qinghua Guan, and Josie Hughes. Robotic locomotion through active and passive morphological adaptation in extreme outdoor environments.Sci- ence Robotics, 10(99):eadp6419, 2025. doi: 10.1126/ scirobotics.adp6419. URL https://www.science.org/doi/ abs/10.1126/scirobotics.adp6419
-
[30]
Raibert.Legged Robots That Balance
Marc H. Raibert.Legged Robots That Balance. Mas- sachusetts Institute of Technology, USA, 1986. ISBN 0262181177
work page 1986
-
[31]
Marc H. Raibert, Jr H. Benjamin Brown, and Michael Chepponis. Experiments in balance with a 3d one- legged hopping machine.The International Journal of Robotics Research, 3(2):75–92, 1984. doi: 10.1177/ 027836498400300207. URL https://doi.org/10.1177/ 027836498400300207
work page 1984
-
[32]
Rombach, R., Blattmann, A., Lorenz, D., Esser, P., and Ommer, B
Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. You only look once: Unified, real-time object detection. In2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 779–788, 2016. doi: 10.1109/CVPR.2016.91
-
[33]
The rise climbing robot: body and leg design
Aaron Saunders, Daniel I Goldman, Robert J Full, and Martin Buehler. The rise climbing robot: body and leg design. InUnmanned Systems Technology VIII, volume 6230, pages 401–413. SPIE, 2006
work page 2006
-
[34]
Adaptive force controller for contact-rich robotic systems using an unscented kalman filter
Alexander Schperberg, Yuki Shirai, Xuan Lin, Yusuke Tanaka, and Dennis Hong. Adaptive force controller for contact-rich robotic systems using an unscented kalman filter. In2023 IEEE-RAS 23rd International Conference on Humanoid Robots, 2023
work page 2023
-
[35]
Alexander Schperberg, Yusuke Tanaka, Saviz Mowlavi, Feng Xu, Bharathan Balaji, and Dennis Hong. Optistate: State estimation of legged robots using gated networks with transformer-based vision and kalman filtering.arXiv preprint arXiv:2401.16719, 2024
-
[36]
Safe whole-body loco-manipulation via com- bined model and learning-based control
Alexander Schperberg, Yeping Wang, and Stefano Di Cairano. Safe whole-body loco-manipulation via com- bined model and learning-based control. InProceedings of the IEEE International Conference on Robotics and Automation (ICRA), Vienna, Austria, 2026. To appear
work page 2026
-
[37]
Design principles for highly efficient quadrupeds and implementation on the mit cheetah robot
Sangok Seok, Albert Wang, Meng Yee Chuah, David Otten, Jeffrey Lang, and Sangbae Kim. Design principles for highly efficient quadrupeds and implementation on the mit cheetah robot. In2013 IEEE International Conference on Robotics and Automation, pages 3307–
-
[38]
Circus anymal: A quadruped learning dexterous manipulation with its limbs
Fan Shi, Timon Homberger, Joonho Lee, Takahiro Miki, Moju Zhao, Farbod Farshidian, Kei Okada, Masayuki Inaba, and Marco Hutter. Circus anymal: A quadruped learning dexterous manipulation with its limbs. In 2021 IEEE International Conference on Robotics and Automation, pages 2316–2323. IEEE, 2021
work page 2021
-
[39]
Manjia Su, Yu Qiu, Yisheng Guan, Haifei Zhu, and Zhi Liu. Climbot-Ω: A soft robot with novel grippers and rigid-compliantly constrained body for climbing on var- ious poles. In2021 IEEE/RSJ International Conference on Intelligent Robots and Systems, pages 4975–4981. IEEE, 2021
work page 2021
-
[40]
Yusuke Tanaka, Yuki Shirai, Zachary Lacey, Xuan Lin, Jane Liu, and Dennis Hong. An under-actuated whipple- tree mechanism gripper based on multi-objective design optimization with auto-tuned weights. In2021 IEEE/RSJ International Conference on Intelligent Robots and Sys- tems, pages 6139–6146, 2021. doi: 10.1109/IROS51168. 2021.9635872
-
[41]
Immersive view and interface design for tele- operated aerial manipulation
Yusuke Tanaka, Yuki Shirai, Xuan Lin, Alexander Sch- perberg, Hayato Kato, Alexander Swerdlow, Naoya Ku- magai, and Dennis Hong. Scaler: A tough versatile quadruped free-climber robot. In2022 IEEE/RSJ Inter- national Conference on Intelligent Robots and Systems, pages 5632–5639, 2022. doi: 10.1109/IROS47612.2022. 9981555
-
[42]
Emanuel Todorov, Tom Erez, and Yuval Tassa. Mu- joco: A physics engine for model-based control. In 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, pages 5026–5033. IEEE, 2012. doi: 10.1109/IROS.2012.6386109
-
[43]
Vassilios Tsounis, Mitja Alge, Joonho Lee, Farbod Farshidian, and Marco Hutter. Deepgait: Planning and control of quadrupedal gaits using deep reinforcement learning.IEEE Robotics and Automation Letters, 5(2): 3699–3706, 2020. doi: 10.1109/LRA.2020.2979660
-
[44]
Kentaro Uno, Naomasa Takada, Taku Okawara, Keigo Haji, Arthur Candalot, Warley F. R. Ribeiro, Kenji Na- gaoka, and Kazuya Yoshida. Hubrobo: A lightweight multi-limbed climbing robot for exploration in challeng- ing terrain. In2020 IEEE-RAS 20th International Confer- ence on Humanoid Robots (Humanoids), pages 209–215,
-
[45]
doi: 10.1109/HUMANOIDS47582.2021.9555799
-
[46]
Eric V ollenweider, Marko Bjelonic, Victor Klemm, Nikita Rudin, Joonho Lee, and Marco Hutter. Advanced skills through multiple adversarial motion priors in rein- forcement learning. In2023 IEEE International Confer- ence on Robotics and Automation (ICRA), pages 5120– 5126, 2023. doi: 10.1109/ICRA48891.2023.10160751
-
[47]
Justin K. Yim and Ronald S. Fearing. Precision jumping limits from flight-phase control in salto-1p. pages 2229– 2236, 2018. doi: 10.1109/IROS.2018.8594154
-
[48]
Chen Yu and Andre Rosendo. Multi-modal legged locomotion framework with automated residual rein- forcement learning.IEEE Robotics and Automation Letters, 7(4):10312–10319, 2022. doi: 10.1109/LRA. 2022.3191071
work page doi:10.1109/lra 2022
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.