DART: Learning-Enhanced Model Predictive Control for Dual-Arm Non-Prehensile Manipulation

Arun Kumar Singh; Autrio Das; Keshab Patra; K Madhava Krishna; Madala Venkata Renu Jeevesh; Nagamanikandan Govindan; Shreya Bollimuntha; Tashmoy Ghosh

arxiv: 2604.17833 · v2 · submitted 2026-04-20 · 💻 cs.RO

DART: Learning-Enhanced Model Predictive Control for Dual-Arm Non-Prehensile Manipulation

Autrio Das , Shreya Bollimuntha , Madala Venkata Renu Jeevesh , Keshab Patra , Tashmoy Ghosh , Nagamanikandan Govindan , Arun Kumar Singh , K Madhava Krishna This is my paper

Pith reviewed 2026-05-10 05:00 UTC · model grok-4.3

classification 💻 cs.RO

keywords dual-arm manipulationnon-prehensile manipulationmodel predictive controlimpedance controlreinforcement learningdynamics modelingservice robotics

0 comments

The pith

DART integrates nonlinear MPC with an impedance controller and three tray-object dynamics models to achieve precise non-prehensile dual-arm manipulation on a moving tray.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces DART to let two robot arms transport and reposition objects on a tray without grasping them, a task that expands service-robot capabilities in settings like hotels. It embeds nonlinear model predictive control inside an optimization-based impedance controller so the object stays in accurate motion relative to the tray. The central evaluation compares three ways to predict how the tray and object interact inside the MPC: a fixed physics model, a model that fits fresh data online, and a reinforcement-learning model trained to handle varied object properties. Simulation tests across objects of different mass, shape, and friction show measurable differences in settling time, steady-state error, control effort, and generalization. The authors position the work as the first complete framework for this class of dual-arm non-prehensile tray tasks.

Core claim

DART achieves accurate object motion relative to a dynamically controlled tray by coupling nonlinear model predictive control with an optimization-based impedance controller. The MPC uses one of three complementary tray-object dynamics models as its state-transition function: a physics-based analytical model, an online regression-based identification model that adapts in real time, or a reinforcement-learning-based model that generalizes across object properties. Validation in simulation with objects of varying mass, geometry, and friction coefficients reveals the performance trade-offs among the three modeling choices.

What carries the argument

Nonlinear MPC whose state-transition function is supplied by one of three tray-object dynamics models (physics analytical, online regression, or RL), paired with an optimization-based impedance controller that enforces relative motion accuracy.

If this is right

Accurate relative motion between object and tray is maintained in simulation for objects differing in mass, geometry, and friction.
Clear performance trade-offs appear among the three modeling strategies on settling time, steady-state error, control effort, and cross-object generalization.
The resulting controller supports precise non-prehensile transport tasks relevant to hospitality and service robotics.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If sim-to-real transfer succeeds, the same MPC structure could be reused for other sliding or pushing tasks that do not require grasping.
The RL dynamics model could let the system handle previously unseen object properties without redesigning the controller.
Extending the impedance layer to multiple trays or moving bases would test whether the relative-motion formulation scales beyond a single tray.

Load-bearing premise

The three dynamics models inside the MPC will transfer from simulation to real hardware with comparable settling time, error, and generalization.

What would settle it

A hardware experiment on the same tray and objects that shows one or more of the three dynamics models producing settling times or steady-state errors at least twice as large as the simulation results, or complete loss of generalization to a new object.

Figures

Figures reproduced from arXiv: 2604.17833 by Arun Kumar Singh, Autrio Das, Keshab Patra, K Madhava Krishna, Madala Venkata Renu Jeevesh, Nagamanikandan Govindan, Shreya Bollimuntha, Tashmoy Ghosh.

**Figure 1.** Figure 1: Non-Prehensile Manipulation using a Dual xArm7 setup. Our framework transports a bowl of fruits to the goal location on the tray through a sequence of tilts given by MPC executed by the dual-arm system. affects object trajectories while maintaining stable, compliant control of the two manipulators. When two robotic arms coordinate to manipulate a tray for controlling object motion on its surface, several … view at source ↗

**Figure 2.** Figure 2: Task Setup: Two robotic arms are placed with the bases fixed to the ground. The tray is rigidly grasped and an object is placed on the tray. The world frame is represented by {W}, body frame by {B} and object frame by {O}. A. Object–Tray Model The pose of the object in {W} is defined as x = [p ⊤, θ ⊤] ⊤, where p ∈ R 3 is the position and θ ∈ R 3 is the orientation (roll-pitch-yaw). The linear and angular v… view at source ↗

**Figure 3.** Figure 3: DART Framework: Our proposed framework takes the current object state X and the desired target state Xref. We choose ν ref as 06×1 as inputs. These are fed into a nonlinear MPC, which computes the optimal tray-tilt commands (u). These commands are then passed to an optimization-based impedance controller, which computes the torques required to realize the tilts. Feedback from the simulator updates the obje… view at source ↗

**Figure 4.** Figure 4: Performance of two objects, Cube on the left and Cylinder [PITH_FULL_IMAGE:figures/full_fig_p007_4.png] view at source ↗

**Figure 5.** Figure 5: Visualizations of the trajectories: This figure depicts the trajectories followed by a cube(green), sphere(red), and a cylinder(purple) with mass 1kg and friction 0.2 , manipulated by using 3 different MPC methods. The curves shows only the path of the object taken on the tray, longer path doesn’t mean longer convergence. The color legend corresponds to the three methods as shown on top. genaralization ov… view at source ↗

read the original abstract

What appears effortless to a human waiter remains a major challenge for robots. Manipulating objects nonprehensilely on a tray is inherently difficult, and the complexity is amplified in dual-arm settings. Such tasks are highly relevant to service robotics in domains such as hotels and hospitality, where robots must transport and reposition diverse objects with precision. We present DART, a novel dual-arm framework that integrates nonlinear Model Predictive Control (MPC) with an optimization-based impedance controller to achieve accurate object motion relative to a dynamically controlled tray. The framework systematically evaluates three complementary strategies for modeling tray-object dynamics as the state transition function within our MPC formulation: (i) a physics-based analytical model, (ii) an online regression based identification model that adapts in real-time, and (iii) a reinforcement learning-based dynamics model that generalizes across object properties. Our pipeline is validated in simulation with objects of varying mass, geometry, and friction coefficients. Extensive evaluations highlight the trade-offs among the three modeling strategies in terms of settling time, steady-state error, control effort, and generalization across objects. To the best of our knowledge, DART constitutes the first framework for non-prehensile dual-arm manipulation of objects on a tray. Project Link: https://dart-icra.github.io/dart/

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

DART is a clean simulation comparison of three dynamics models inside MPC for dual-arm tray carrying, but all results stay in sim with no hardware transfer shown.

read the letter

The paper's core contribution is a side-by-side look at three different state-transition models inside nonlinear MPC for keeping objects stable on a moving tray held by two arms. One is a standard physics model, one does online regression to fit parameters on the fly, and one uses an RL-learned dynamics model. They wrap this with an impedance controller and run it on varied object masses, shapes, and friction values in simulation, reporting trade-offs in settling time, steady-state error, control effort, and how well each generalizes when object properties change. That comparison is the part that feels useful; it gives a reader concrete numbers on where each modeling choice wins or loses within the same task and controller setup. The abstract's claim of being the first such dual-arm non-prehensile tray framework is stated plainly, though I would check the related-work section against prior single-arm or prehensile tray papers to see how much is truly new versus an extension. The evaluations appear systematic within simulation, with no obvious circularity in how the models are built or tested. The main limitation is that everything stops at simulation. Robotics papers on contact-rich tasks routinely find that unmodeled effects like stochastic friction, arm flexibility, and vision delay change exactly the metrics being optimized here, yet there is no real-robot data, no hardware description, and no sim-to-real gap analysis. That leaves the practical claims about service-robotics applicability resting on an untested transfer assumption. The work is aimed at people already working on MPC or learning-based control for non-prehensile manipulation; a reader in that niche can extract the modeling trade-offs without needing the full service-robot story. It is coherent on its own terms and shows clear thinking about the modeling choices, so it is worth sending to peer review. Reviewers will almost certainly ask for hardware results or a stronger justification for why simulation alone is enough at this stage, but the comparative structure gives them something concrete to engage with.

Referee Report

2 major / 1 minor

Summary. The paper claims to introduce the first framework, DART, for non-prehensile dual-arm manipulation of objects on a tray using nonlinear MPC combined with impedance control. It evaluates three complementary dynamics modeling strategies (physics-based analytical, online regression identification, and RL-based) as the state transition function in MPC, and validates them in simulation across objects with varying mass, geometry, and friction, highlighting trade-offs in settling time, steady-state error, control effort, and generalization.

Significance. If the simulation-based results on the three modeling strategies hold and transfer to hardware, the work would be significant for service robotics by providing a systematic comparison of dynamics models for MPC in dual-arm non-prehensile tasks. The RL-based model for generalization is particularly promising. However, the current lack of real-world validation limits the assessed impact.

major comments (2)

[Abstract] The central performance claims (trade-offs in settling time, steady-state error, control effort, and generalization) rest entirely on simulation results, as stated in the abstract. No real-robot experiments, hardware description, or analysis of sim-to-real gaps (e.g., friction stochasticity, vision latency) are mentioned, despite the practical relevance emphasized. This is load-bearing for the generalization and accuracy claims.
[Evaluation] The manuscript validates the pipeline 'in simulation with objects of varying mass, geometry, and friction coefficients' but the abstract provides no quantitative metrics, error bars, or statistical analysis, and the evaluation appears to lack ablation studies isolating the contribution of the impedance controller or tests for statistical significance of the reported trade-offs.

minor comments (1)

The term 'nonprehensilely' in the abstract should be hyphenated as 'non-prehensilely' for standard terminology.

Simulated Author's Rebuttal

2 responses · 1 unresolved

We thank the referee for their constructive and detailed review. We appreciate the recognition of the potential significance of comparing dynamics modeling strategies for dual-arm non-prehensile manipulation. Below we provide point-by-point responses to the major comments and describe the revisions we will make.

read point-by-point responses

Referee: [Abstract] The central performance claims (trade-offs in settling time, steady-state error, control effort, and generalization) rest entirely on simulation results, as stated in the abstract. No real-robot experiments, hardware description, or analysis of sim-to-real gaps (e.g., friction stochasticity, vision latency) are mentioned, despite the practical relevance emphasized. This is load-bearing for the generalization and accuracy claims.

Authors: We agree that all reported results are obtained in simulation, as explicitly stated throughout the manuscript, and that this constrains the strength of claims about real-world generalization and accuracy. The simulation environment enables systematic variation of object mass, geometry, and friction that would be difficult to control on hardware. To address the concern, we will revise the abstract to include key quantitative metrics from the simulations and add a dedicated limitations section that discusses sim-to-real gaps, including friction stochasticity, vision latency, and other unmodeled effects, along with planned future hardware validation. We cannot, however, add real-robot experiments within the scope of this revision. revision: partial
Referee: [Evaluation] The manuscript validates the pipeline 'in simulation with objects of varying mass, geometry, and friction coefficients' but the abstract provides no quantitative metrics, error bars, or statistical analysis, and the evaluation appears to lack ablation studies isolating the contribution of the impedance controller or tests for statistical significance of the reported trade-offs.

Authors: We thank the referee for this observation. The current evaluation section reports results across multiple objects and scenarios but does not include error bars, formal statistical tests, or explicit ablations isolating the impedance controller. We will update the abstract to report quantitative metrics with variability measures. In the revised evaluation, we will add error bars and standard deviations to all plots, include ablation studies that disable the impedance controller to quantify its contribution, and perform statistical significance tests (e.g., paired t-tests) on the observed differences in settling time, steady-state error, and control effort. These changes will be incorporated in the next version. revision: yes

standing simulated objections not resolved

Real-robot experiments and hardware validation, which are outside the current simulation-focused scope of the manuscript and cannot be added without new physical experiments.

Circularity Check

0 steps flagged

No circularity in derivation chain; three dynamics models are independent alternatives

full rationale

The manuscript presents DART as a framework integrating three distinct dynamics modeling strategies (physics-based analytical, online regression identification, RL-based) as alternatives inside nonlinear MPC plus impedance control. These are described as complementary and comparatively evaluated on simulation metrics without any reduction of one to another by construction, fitted-parameter renaming, or self-citation chains. No equations or claims equate a prediction to its own inputs; the novelty claim is a standard 'first framework' statement unsupported by load-bearing self-citations. The derivation remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review provides no explicit free parameters, axioms, or invented entities; full manuscript would be required to identify any fitted scales, domain assumptions about tray-object contact, or new control primitives.

pith-pipeline@v0.9.0 · 5572 in / 1062 out tokens · 44161 ms · 2026-05-10T05:00:46.762337+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

35 extracted references · 35 canonical work pages · 1 internal anchor

[1]

Mario Tr¨ obinger, Christoph J¨ ahne, Zheng Qu, Jean Elsner, Anton Reindl, Sebastian Getz, Thore Goll, Benjamin Loinger, Tamara Loibl, Christoph Kugler, et al. Introducing garmi- a service robotics platform to support the elderly at home: Design philosophy, system overview and first results.IEEE Robotics and Automation Letters, 6(3):5857–5864, 2021

work page 2021
[2]

Hacman: Learning hybrid actor-critic maps for 6d non-prehensile manipulation.arXiv preprint arXiv:2305.03942, 2023

Wenxuan Zhou, Bowen Jiang, Fan Yang, Chris Paxton, and David Held. Hacman: Learning hybrid actor-critic maps for 6d non-prehensile manipulation.arXiv preprint arXiv:2305.03942, 2023

work page arXiv 2023
[3]

Progress in nonprehensile manipulation

Matthew T Mason. Progress in nonprehensile manipulation. The International Journal of Robotics Research, 18(11):1129– 1141, 1999

work page 1999
[4]

Closed-loop control of a nonpre- hensile manipulation system inspired by the pizza-peel mech- anism

Alejandro Guti´ errez-Giles, Fabio Ruggiero, Vincenzo Lip- piello, and Bruno Siciliano. Closed-loop control of a nonpre- hensile manipulation system inspired by the pizza-peel mech- anism. In2019 18th European Control Conference (ECC), pages 1580–1585. IEEE, 2019

work page 2019
[5]

A coordinate-free framework for robotic pizza tossing and catching

Aykut C Satici, Fabio Ruggiero, Vincenzo Lippiello, and Bruno Siciliano. A coordinate-free framework for robotic pizza tossing and catching. In2016 IEEE international conference on robotics and automation (ICRA), pages 3932–3939. IEEE, 2016

work page 2016
[6]

Control of nonprehensile planar rolling manipulation: A passivity- based approach.IEEE Transactions on Robotics, 35(2):317– 329, 2019

Diana Serra, Fabio Ruggiero, Alejandro Donaire, Luca Rosario Buonocore, Vincenzo Lippiello, and Bruno Siciliano. Control of nonprehensile planar rolling manipulation: A passivity- based approach.IEEE Transactions on Robotics, 35(2):317– 329, 2019

work page 2019
[7]

More than a million ways to be pushed

Kuan-Ting Yu, Maria Bauza, Nima Fazeli, and Alberto Ro- driguez. More than a million ways to be pushed. a high- fidelity experimental dataset of planar pushing. In2016 IEEE/RSJ international conference on intelligent robots and systems (IROS), pages 30–37. IEEE, 2016

work page 2016
[8]

A re- inforcement learning approach to non-prehensile manipulation through sliding.IEEE Robotics and Automation Letters, 2025

Hamidreza Raei, Elena De Momi, and Arash Ajoudani. A re- inforcement learning approach to non-prehensile manipulation through sliding.IEEE Robotics and Automation Letters, 2025

work page 2025
[9]

Dynamic non-prehensile object transport via model-predictive reinforcement learning, 2024

Neel Jawale, Byron Boots, Balakumar Sundaralingam, and Mohak Bhardwaj. Dynamic non-prehensile object transport via model-predictive reinforcement learning, 2024

work page 2024
[10]

Dual arm manipulation—a survey.Robotics and Autonomous systems, 60(10):1340–1353, 2012

Christian Smith, Yiannis Karayiannidis, Lazaros Nalpantidis, Xavi Gratal, Peng Qi, Dimos V Dimarogonas, and Danica Kragic. Dual arm manipulation—a survey.Robotics and Autonomous systems, 60(10):1340–1353, 2012

work page 2012
[11]

Adam Heins and Angela P Schoellig. Keep it upright: Model predictive control for nonprehensile object transportation with obstacle avoidance on a mobile manipulator.IEEE Robotics and Automation Letters, 8(12):7986–7993, 2023

work page 2023
[12]

Non-prehensile object transportation via model predictive non-sliding manipulation control.IEEE Transactions on Control Systems Technology, 31(5):2231– 2244, 2023

Mario Selvaggio, Akash Garg, Fabio Ruggiero, Giuseppe Ori- olo, and Bruno Siciliano. Non-prehensile object transportation via model predictive non-sliding manipulation control.IEEE Transactions on Control Systems Technology, 31(5):2231– 2244, 2023

work page 2023
[13]

Peract2: Benchmarking and learning for robotic bimanual manipulation tasks, 2024

Markus Grotz, Mohit Shridhar, Tamim Asfour, and Dieter Fox. Peract2: Benchmarking and learning for robotic bimanual manipulation tasks, 2024

work page 2024
[14]

A bimanual manipu- lation taxonomy.IEEE Robotics and Automation Letters, 7(4):11031–11038, 2022

Franziska Krebs and Tamim Asfour. A bimanual manipu- lation taxonomy.IEEE Robotics and Automation Letters, 7(4):11031–11038, 2022

work page 2022
[15]

Da 2 dataset: Toward dexterity-aware dual-arm grasp- ing.IEEE Robotics and Automation Letters, 7(4):8941–8948, 2022

Guangyao Zhai, Yu Zheng, Ziwei Xu, Xin Kong, Yong Liu, Benjamin Busam, Yi Ren, Nassir Navab, and Zhengyou Zhang. Da 2 dataset: Toward dexterity-aware dual-arm grasp- ing.IEEE Robotics and Automation Letters, 7(4):8941–8948, 2022

work page 2022
[16]

Dg16m: A large-scale dataset for dual-arm grasping with force-optimized grasps.arXiv preprint arXiv:2503.08358, 2025

Md Faizal Karim, Mohammed Saad Hashmi, Shreya Bol- limuntha, Mahesh Reddy Tapeti, Gaurav Singh, Naga- manikandan Govindan, and K Madhava Krishna. Dg16m: A large-scale dataset for dual-arm grasping with force-optimized grasps.arXiv preprint arXiv:2503.08358, 2025

work page arXiv 2025
[17]

Push-to-see: Learning non-prehensile manipulation to enhance instance segmentation via deep q-learning

Baris Serhan, Harit Pandya, Ayse Kucukyilmaz, and Gerhard Neumann. Push-to-see: Learning non-prehensile manipulation to enhance instance segmentation via deep q-learning. In 2022 International Conference on Robotics and Automation (ICRA), pages 1513–1519, 2022

work page 2022
[18]

Dynamic in-hand sliding manipulation.IEEE Transactions on Robotics, 33(4):778–795, 2017

Jian Shi, J Zachary Woodruff, Paul B Umbanhowar, and Kevin M Lynch. Dynamic in-hand sliding manipulation.IEEE Transactions on Robotics, 33(4):778–795, 2017

work page 2017
[19]

Two arms are better than one: A behavior based control system for assistive bi- manual manipulation

Aaron Edsinger and Charles C Kemp. Two arms are better than one: A behavior based control system for assistive bi- manual manipulation. InRecent Progress in Robotics: Viable Robotic Service to Human: An Edition of the Selected Papers from the 13th International Conference on Advanced Robotics, pages 345–355. Springer, 2007

work page 2007
[20]

From single to dual-arm collaborative robotic assembly: A case study at i-labs

Simone Pantanetti, Federico Emiliani, Daniele Costa, Gia- como Palmieri, and Albin Bajrami. From single to dual-arm collaborative robotic assembly: A case study at i-labs. In2024 20th IEEE/ASME International Conference on Mechatronic and Embedded Systems and Applications (MESA), pages 1–6, 2024

work page 2024
[21]

Six-dof impedance control of dual-arm cooperative manipulators.IEEE/ASME Transactions on Mechatronics, 13(5):576–586, 2008

Fabrizio Caccavale, Pasquale Chiacchio, Alessandro Marino, and Luigi Villani. Six-dof impedance control of dual-arm cooperative manipulators.IEEE/ASME Transactions on Mechatronics, 13(5):576–586, 2008

work page 2008
[22]

Relative impedance control for dual-arm robots performing asymmetric bimanual tasks.IEEE transactions on industrial electronics, 61(7):3786–3796, 2013

Jinoh Lee, Pyung Hun Chang, and Rodrigo S Jamisola. Relative impedance control for dual-arm robots performing asymmetric bimanual tasks.IEEE transactions on industrial electronics, 61(7):3786–3796, 2013

work page 2013
[23]

Zhao, Vikash Kumar, Sergey Levine, and Chelsea Finn

Tony Z. Zhao, Vikash Kumar, Sergey Levine, and Chelsea Finn. Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware. InProceedings of Robotics: Science and Systems, Daegu, Republic of Korea, July 2023

work page 2023
[24]

Bi-vla: Vision-language-action model-based system for bimanual robotic dexterous manipulations.arXiv preprint arXiv:2405.06039, 2024

Koffivi Fid` ele Gbagbe, Miguel Altamirano Cabrera, Ali Al- abbas, Oussama Alyunes, Artem Lykov, and Dzmitry Tset- serukou. Bi-vla: Vision-language-action model-based system for bimanual robotic dexterous manipulations.arXiv preprint arXiv:2405.06039, 2024

work page arXiv 2024
[25]

Towards human-level bimanual dexterous manipulation with reinforcement learning

Yuanpei Chen, Tianhao Wu, Shengjie Wang, Xidong Feng, Jiechuan Jiang, Zongqing Lu, Stephen McAleer, Hao Dong, Song-Chun Zhu, and Yaodong Yang. Towards human-level bimanual dexterous manipulation with reinforcement learning. Advances in Neural Information Processing Systems, 35:5150– 5163, 2022

work page 2022
[26]

Da-vil: Adaptive dual-arm manipulation with rein- forcement learning and variable impedance control

Md Faizal Karim, Shreya Bollimuntha, Mohammed Saad Hashmi, Autrio Das, Gaurav Singh, Srinath Sridhar, Arun Ku- mar Singh, Nagamanikandan Govindan, and K Madhava Kr- ishna. Da-vil: Adaptive dual-arm manipulation with rein- forcement learning and variable impedance control. In2025 IEEE International Conference on Robotics and Automation (ICRA), pages 11896–...

work page 2025
[27]

Adaptive variable impedance control of dual-arm robots for slabstone installation.ISA transactions, 128:397–408, 2022

Heyu Hu and Jianfu Cao. Adaptive variable impedance control of dual-arm robots for slabstone installation.ISA transactions, 128:397–408, 2022

work page 2022
[28]

Quadratic programming- based reference spreading control for dual-arm robotic manip- ulation with planned simultaneous impacts.IEEE Transac- tions on Robotics, 40:3341–3355, 2024

Jari van Steen, Gijs van den Brandt, Nathan van de Wouw, Jens Kober, and Alessandro Saccon. Quadratic programming- based reference spreading control for dual-arm robotic manip- ulation with planned simultaneous impacts.IEEE Transac- tions on Robotics, 40:3341–3355, 2024

work page 2024
[29]

A non-prehensile object transportation framework with adaptive tilting based on quadratic programming.IEEE Robotics and Automation Letters, 8(6):3581–3588, 2023

Rajesh Subburaman, Mario Selvaggio, and Fabio Ruggiero. A non-prehensile object transportation framework with adaptive tilting based on quadratic programming.IEEE Robotics and Automation Letters, 8(6):3581–3588, 2023

work page 2023
[30]

Armstrong-Helouvry

B. Armstrong-Helouvry. Stick-slip arising from stribeck fric- tion. InProceedings., IEEE International Conference on Robotics and Automation, pages 1377–1382 vol.2, 1990

work page 1990
[31]

Stochastic motion under nonlinear friction representing shear thinning.Physical Review E, 108(6):064606, 2023

Theo Lequy and Andreas M Menzel. Stochastic motion under nonlinear friction representing shear thinning.Physical Review E, 108(6):064606, 2023

work page 2023
[32]

Proximal Policy Optimization Algorithms

John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Rad- ford, and Oleg Klimov. Proximal policy optimization algo- rithms.arXiv preprint arXiv:1707.06347, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017
[33]

Casadi: a software framework for nonlinear optimization and optimal control.Mathematical Programming Computation, 11(1):1–36, 2019

Joel AE Andersson, Joris Gillis, Greg Horn, James B Rawl- ings, and Moritz Diehl. Casadi: a software framework for nonlinear optimization and optimal control.Mathematical Programming Computation, 11(1):1–36, 2019

work page 2019
[34]

Prentice hall, 2010

Katsuhiko Ogata.Modern control engineering. Prentice hall, 2010

work page 2010
[35]

john Wiley & sons, 2005

Sigurd Skogestad and Ian Postlethwaite.Multivariable feed- back control: analysis and design. john Wiley & sons, 2005

work page 2005

[1] [1]

Mario Tr¨ obinger, Christoph J¨ ahne, Zheng Qu, Jean Elsner, Anton Reindl, Sebastian Getz, Thore Goll, Benjamin Loinger, Tamara Loibl, Christoph Kugler, et al. Introducing garmi- a service robotics platform to support the elderly at home: Design philosophy, system overview and first results.IEEE Robotics and Automation Letters, 6(3):5857–5864, 2021

work page 2021

[2] [2]

Hacman: Learning hybrid actor-critic maps for 6d non-prehensile manipulation.arXiv preprint arXiv:2305.03942, 2023

Wenxuan Zhou, Bowen Jiang, Fan Yang, Chris Paxton, and David Held. Hacman: Learning hybrid actor-critic maps for 6d non-prehensile manipulation.arXiv preprint arXiv:2305.03942, 2023

work page arXiv 2023

[3] [3]

Progress in nonprehensile manipulation

Matthew T Mason. Progress in nonprehensile manipulation. The International Journal of Robotics Research, 18(11):1129– 1141, 1999

work page 1999

[4] [4]

Closed-loop control of a nonpre- hensile manipulation system inspired by the pizza-peel mech- anism

Alejandro Guti´ errez-Giles, Fabio Ruggiero, Vincenzo Lip- piello, and Bruno Siciliano. Closed-loop control of a nonpre- hensile manipulation system inspired by the pizza-peel mech- anism. In2019 18th European Control Conference (ECC), pages 1580–1585. IEEE, 2019

work page 2019

[5] [5]

A coordinate-free framework for robotic pizza tossing and catching

Aykut C Satici, Fabio Ruggiero, Vincenzo Lippiello, and Bruno Siciliano. A coordinate-free framework for robotic pizza tossing and catching. In2016 IEEE international conference on robotics and automation (ICRA), pages 3932–3939. IEEE, 2016

work page 2016

[6] [6]

Control of nonprehensile planar rolling manipulation: A passivity- based approach.IEEE Transactions on Robotics, 35(2):317– 329, 2019

Diana Serra, Fabio Ruggiero, Alejandro Donaire, Luca Rosario Buonocore, Vincenzo Lippiello, and Bruno Siciliano. Control of nonprehensile planar rolling manipulation: A passivity- based approach.IEEE Transactions on Robotics, 35(2):317– 329, 2019

work page 2019

[7] [7]

More than a million ways to be pushed

Kuan-Ting Yu, Maria Bauza, Nima Fazeli, and Alberto Ro- driguez. More than a million ways to be pushed. a high- fidelity experimental dataset of planar pushing. In2016 IEEE/RSJ international conference on intelligent robots and systems (IROS), pages 30–37. IEEE, 2016

work page 2016

[8] [8]

A re- inforcement learning approach to non-prehensile manipulation through sliding.IEEE Robotics and Automation Letters, 2025

Hamidreza Raei, Elena De Momi, and Arash Ajoudani. A re- inforcement learning approach to non-prehensile manipulation through sliding.IEEE Robotics and Automation Letters, 2025

work page 2025

[9] [9]

Dynamic non-prehensile object transport via model-predictive reinforcement learning, 2024

Neel Jawale, Byron Boots, Balakumar Sundaralingam, and Mohak Bhardwaj. Dynamic non-prehensile object transport via model-predictive reinforcement learning, 2024

work page 2024

[10] [10]

Dual arm manipulation—a survey.Robotics and Autonomous systems, 60(10):1340–1353, 2012

Christian Smith, Yiannis Karayiannidis, Lazaros Nalpantidis, Xavi Gratal, Peng Qi, Dimos V Dimarogonas, and Danica Kragic. Dual arm manipulation—a survey.Robotics and Autonomous systems, 60(10):1340–1353, 2012

work page 2012

[11] [11]

Adam Heins and Angela P Schoellig. Keep it upright: Model predictive control for nonprehensile object transportation with obstacle avoidance on a mobile manipulator.IEEE Robotics and Automation Letters, 8(12):7986–7993, 2023

work page 2023

[12] [12]

Non-prehensile object transportation via model predictive non-sliding manipulation control.IEEE Transactions on Control Systems Technology, 31(5):2231– 2244, 2023

Mario Selvaggio, Akash Garg, Fabio Ruggiero, Giuseppe Ori- olo, and Bruno Siciliano. Non-prehensile object transportation via model predictive non-sliding manipulation control.IEEE Transactions on Control Systems Technology, 31(5):2231– 2244, 2023

work page 2023

[13] [13]

Peract2: Benchmarking and learning for robotic bimanual manipulation tasks, 2024

Markus Grotz, Mohit Shridhar, Tamim Asfour, and Dieter Fox. Peract2: Benchmarking and learning for robotic bimanual manipulation tasks, 2024

work page 2024

[14] [14]

A bimanual manipu- lation taxonomy.IEEE Robotics and Automation Letters, 7(4):11031–11038, 2022

Franziska Krebs and Tamim Asfour. A bimanual manipu- lation taxonomy.IEEE Robotics and Automation Letters, 7(4):11031–11038, 2022

work page 2022

[15] [15]

Da 2 dataset: Toward dexterity-aware dual-arm grasp- ing.IEEE Robotics and Automation Letters, 7(4):8941–8948, 2022

Guangyao Zhai, Yu Zheng, Ziwei Xu, Xin Kong, Yong Liu, Benjamin Busam, Yi Ren, Nassir Navab, and Zhengyou Zhang. Da 2 dataset: Toward dexterity-aware dual-arm grasp- ing.IEEE Robotics and Automation Letters, 7(4):8941–8948, 2022

work page 2022

[16] [16]

Dg16m: A large-scale dataset for dual-arm grasping with force-optimized grasps.arXiv preprint arXiv:2503.08358, 2025

Md Faizal Karim, Mohammed Saad Hashmi, Shreya Bol- limuntha, Mahesh Reddy Tapeti, Gaurav Singh, Naga- manikandan Govindan, and K Madhava Krishna. Dg16m: A large-scale dataset for dual-arm grasping with force-optimized grasps.arXiv preprint arXiv:2503.08358, 2025

work page arXiv 2025

[17] [17]

Push-to-see: Learning non-prehensile manipulation to enhance instance segmentation via deep q-learning

Baris Serhan, Harit Pandya, Ayse Kucukyilmaz, and Gerhard Neumann. Push-to-see: Learning non-prehensile manipulation to enhance instance segmentation via deep q-learning. In 2022 International Conference on Robotics and Automation (ICRA), pages 1513–1519, 2022

work page 2022

[18] [18]

Dynamic in-hand sliding manipulation.IEEE Transactions on Robotics, 33(4):778–795, 2017

Jian Shi, J Zachary Woodruff, Paul B Umbanhowar, and Kevin M Lynch. Dynamic in-hand sliding manipulation.IEEE Transactions on Robotics, 33(4):778–795, 2017

work page 2017

[19] [19]

Two arms are better than one: A behavior based control system for assistive bi- manual manipulation

Aaron Edsinger and Charles C Kemp. Two arms are better than one: A behavior based control system for assistive bi- manual manipulation. InRecent Progress in Robotics: Viable Robotic Service to Human: An Edition of the Selected Papers from the 13th International Conference on Advanced Robotics, pages 345–355. Springer, 2007

work page 2007

[20] [20]

From single to dual-arm collaborative robotic assembly: A case study at i-labs

Simone Pantanetti, Federico Emiliani, Daniele Costa, Gia- como Palmieri, and Albin Bajrami. From single to dual-arm collaborative robotic assembly: A case study at i-labs. In2024 20th IEEE/ASME International Conference on Mechatronic and Embedded Systems and Applications (MESA), pages 1–6, 2024

work page 2024

[21] [21]

Six-dof impedance control of dual-arm cooperative manipulators.IEEE/ASME Transactions on Mechatronics, 13(5):576–586, 2008

Fabrizio Caccavale, Pasquale Chiacchio, Alessandro Marino, and Luigi Villani. Six-dof impedance control of dual-arm cooperative manipulators.IEEE/ASME Transactions on Mechatronics, 13(5):576–586, 2008

work page 2008

[22] [22]

Relative impedance control for dual-arm robots performing asymmetric bimanual tasks.IEEE transactions on industrial electronics, 61(7):3786–3796, 2013

Jinoh Lee, Pyung Hun Chang, and Rodrigo S Jamisola. Relative impedance control for dual-arm robots performing asymmetric bimanual tasks.IEEE transactions on industrial electronics, 61(7):3786–3796, 2013

work page 2013

[23] [23]

Zhao, Vikash Kumar, Sergey Levine, and Chelsea Finn

Tony Z. Zhao, Vikash Kumar, Sergey Levine, and Chelsea Finn. Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware. InProceedings of Robotics: Science and Systems, Daegu, Republic of Korea, July 2023

work page 2023

[24] [24]

Bi-vla: Vision-language-action model-based system for bimanual robotic dexterous manipulations.arXiv preprint arXiv:2405.06039, 2024

Koffivi Fid` ele Gbagbe, Miguel Altamirano Cabrera, Ali Al- abbas, Oussama Alyunes, Artem Lykov, and Dzmitry Tset- serukou. Bi-vla: Vision-language-action model-based system for bimanual robotic dexterous manipulations.arXiv preprint arXiv:2405.06039, 2024

work page arXiv 2024

[25] [25]

Towards human-level bimanual dexterous manipulation with reinforcement learning

Yuanpei Chen, Tianhao Wu, Shengjie Wang, Xidong Feng, Jiechuan Jiang, Zongqing Lu, Stephen McAleer, Hao Dong, Song-Chun Zhu, and Yaodong Yang. Towards human-level bimanual dexterous manipulation with reinforcement learning. Advances in Neural Information Processing Systems, 35:5150– 5163, 2022

work page 2022

[26] [26]

Da-vil: Adaptive dual-arm manipulation with rein- forcement learning and variable impedance control

Md Faizal Karim, Shreya Bollimuntha, Mohammed Saad Hashmi, Autrio Das, Gaurav Singh, Srinath Sridhar, Arun Ku- mar Singh, Nagamanikandan Govindan, and K Madhava Kr- ishna. Da-vil: Adaptive dual-arm manipulation with rein- forcement learning and variable impedance control. In2025 IEEE International Conference on Robotics and Automation (ICRA), pages 11896–...

work page 2025

[27] [27]

Adaptive variable impedance control of dual-arm robots for slabstone installation.ISA transactions, 128:397–408, 2022

Heyu Hu and Jianfu Cao. Adaptive variable impedance control of dual-arm robots for slabstone installation.ISA transactions, 128:397–408, 2022

work page 2022

[28] [28]

Quadratic programming- based reference spreading control for dual-arm robotic manip- ulation with planned simultaneous impacts.IEEE Transac- tions on Robotics, 40:3341–3355, 2024

Jari van Steen, Gijs van den Brandt, Nathan van de Wouw, Jens Kober, and Alessandro Saccon. Quadratic programming- based reference spreading control for dual-arm robotic manip- ulation with planned simultaneous impacts.IEEE Transac- tions on Robotics, 40:3341–3355, 2024

work page 2024

[29] [29]

A non-prehensile object transportation framework with adaptive tilting based on quadratic programming.IEEE Robotics and Automation Letters, 8(6):3581–3588, 2023

Rajesh Subburaman, Mario Selvaggio, and Fabio Ruggiero. A non-prehensile object transportation framework with adaptive tilting based on quadratic programming.IEEE Robotics and Automation Letters, 8(6):3581–3588, 2023

work page 2023

[30] [30]

Armstrong-Helouvry

B. Armstrong-Helouvry. Stick-slip arising from stribeck fric- tion. InProceedings., IEEE International Conference on Robotics and Automation, pages 1377–1382 vol.2, 1990

work page 1990

[31] [31]

Stochastic motion under nonlinear friction representing shear thinning.Physical Review E, 108(6):064606, 2023

Theo Lequy and Andreas M Menzel. Stochastic motion under nonlinear friction representing shear thinning.Physical Review E, 108(6):064606, 2023

work page 2023

[32] [32]

Proximal Policy Optimization Algorithms

John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Rad- ford, and Oleg Klimov. Proximal policy optimization algo- rithms.arXiv preprint arXiv:1707.06347, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017

[33] [33]

Casadi: a software framework for nonlinear optimization and optimal control.Mathematical Programming Computation, 11(1):1–36, 2019

Joel AE Andersson, Joris Gillis, Greg Horn, James B Rawl- ings, and Moritz Diehl. Casadi: a software framework for nonlinear optimization and optimal control.Mathematical Programming Computation, 11(1):1–36, 2019

work page 2019

[34] [34]

Prentice hall, 2010

Katsuhiko Ogata.Modern control engineering. Prentice hall, 2010

work page 2010

[35] [35]

john Wiley & sons, 2005

Sigurd Skogestad and Ian Postlethwaite.Multivariable feed- back control: analysis and design. john Wiley & sons, 2005

work page 2005