pith. machine review for the scientific record. sign in

arxiv: 2605.02699 · v1 · submitted 2026-05-04 · 💻 cs.RO · cs.AI· cs.CV· cs.LG

Recognition: 3 theorem links

· Lean Theorem

Learning Equivariant Neural-Augmented Object Dynamics From Few Interactions

Authors on Pith no claims yet

Pith reviewed 2026-05-08 18:09 UTC · model grok-4.3

classification 💻 cs.RO cs.AIcs.CVcs.LG
keywords object dynamicsgraph neural networksequivariant modelsspring-mass systemsrobotic manipulationdeformable objectsdata-efficient learningparticle-based models
0
0 comments X

The pith

A spring-mass analytical model paired with an equivariant graph neural network learns accurate dynamics for both rigid and deformable objects from limited real-world interactions.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces PIEGraph to learn object dynamics for robotic manipulation in a data-efficient way. It pairs a spring-mass analytical model that keeps particle motion physically feasible with an equivariant graph neural network that uses symmetries in particle interactions plus a novel action representation to steer the analytical predictions. The combination targets both rigid bodies and deformable ones such as ropes, cloth, and stuffed animals. A reader would care because pure neural models often drift into unrealistic states over long horizons and require large datasets, while this hybrid keeps predictions grounded and supports downstream planning tasks like reorientation and repositioning. If the claim holds, robots could handle varied objects after only a handful of real interactions instead of needing extensive training.

Core claim

PIEGraph consists of a physically informed particle-based analytical model implemented as a spring-mass system to enforce physically feasible motion, and an equivariant graph neural network with a novel action representation that exploits symmetries in particle interactions to guide the analytical model. Evaluated in simulation and on robot hardware for reorientation and repositioning tasks with ropes, cloth, stuffed animals and rigid objects, the method produces accurate dynamics prediction and reliable downstream robotic manipulation planning while outperforming state of the art baselines using only limited real-world interaction data.

What carries the argument

PIEGraph, a hybrid that runs a spring-mass analytical model to enforce feasible particle motion while an equivariant graph neural network with novel action representation uses interaction symmetries to correct and guide the analytical component for sets of particles representing objects.

If this is right

  • Dynamics predictions remain physically realistic and accurate over extended time horizons for both rigid and deformable objects.
  • Robotic planning succeeds at higher rates on reorientation and repositioning tasks than with existing baselines.
  • Only limited real-world interaction data is needed to train effective models across multiple object types.
  • The same architecture works in both simulation and on physical robot hardware without additional adaptation.
  • Physical feasibility constraints from the analytical model prevent drift that pure neural predictors typically show.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The symmetry handling in the graph network could extend to multi-object scenes if the particle graph is updated to include inter-object edges.
  • Learning directly from few real trials may reduce dependence on simulation-to-real techniques that currently dominate deformable-object robotics.
  • Tuning the spring-mass parameters separately could let the framework apply to related domains such as granular or fluid-like materials.

Load-bearing premise

The spring-mass analytical model together with the equivariant network and new action representation can capture the true dynamics of both rigid and deformable objects from sparse real interaction data while keeping predictions stable over long horizons.

What would settle it

Measure particle trajectory error on real robot hardware for a cloth or rope during repeated manipulation sequences; rapid growth in error beyond a few steps or lower task success rates than baselines would show the hybrid model does not deliver the claimed accuracy.

Figures

Figures reproduced from arXiv: 2605.02699 by Brandon May, George Konidaris, Laura Herlant, Sergio Orozco, Tushar Kusnur.

Figure 1
Figure 1. Figure 1: General Overview. We guide physics models toward particle￾based neural outputs to guarantee physical plausibility and realistic object motion over long horizons. Predicting future states from data is challenging because for many objects and interactions, the physical process is complex and requires a high-dimensional computation. We build on prior work which has identified possible options for representing… view at source ↗
Figure 2
Figure 2. Figure 2: Symmetry and Action Canoncialization (a) view at source ↗
Figure 3
Figure 3. Figure 3: Illustrative system diagram—training: We train an action-conditioned equivariant graph dynamics model (E1) using MSE loss (F) from human interaction data captured as an RGBD video. We initialize (C1) a spring mass system from an object point cloud at time t = 0 and track it (D) over multiple actions. We track the human’s hands at each time step to construct a representation of the action (B). III. RELATED … view at source ↗
Figure 4
Figure 4. Figure 4: Illustrative system diagram—planning. We use a learned equivariant action-conditioned graph dynamics model (E2) to guide (H) a spring mass system constructed (C2) from an initial point cloud of an object at time t = 0. This guidance process is used to plan (G) for robot actions that reach a specified goal configuration (J) implemented as a point cloud view at source ↗
Figure 5
Figure 5. Figure 5: Simulated Planning Results (Quantitative). view at source ↗
Figure 6
Figure 6. Figure 6: Simulated Planning Results (Qualitative). view at source ↗
Figure 7
Figure 7. Figure 7: Robot Planning Results. For each object, we plan action sequences to reach a goal configuration implemented as a point cloud. We plan using MPC for 3 separate goals and repeat each experiment 3 times. On the left, we hand-selected qualitative planning results. On the right, we visualize the chamfer distance over time (up to 13 pushes) and task success with varying goal thresholds. We also display the 40th … view at source ↗
Figure 8
Figure 8. Figure 8: Robot Planning with Different Data Fidelities. view at source ↗
read the original abstract

Learning data-efficient object dynamics models for robotic manipulation remains challenging, especially for deformable objects. A popular approach is to model objects as sets of 3D particles and learn their motion using graph neural networks. In practice, this is not enough to maintain physical feasibility over long horizons and may require large amounts of interaction data to learn. We introduce PIEGraph, a novel approach to combining analytical physics and data-driven models to capture object dynamics for both rigid and deformable bodies using limited real-world interaction data. PIEGraph consists of two components: (1) a \textbf{P}hysically \textbf{I}nformed particle-based analytical model (implemented as a spring--mass system) to enforce physically feasible motion, and (2) an \textbf{E}quivariant \textbf{Graph} Neural Network with a novel action representation that exploits symmetries in particle interactions to guide the analytical model. We evaluate PIEGraph in simulation and on robot hardware for reorientation and repositioning tasks with ropes, cloth, stuffed animals and rigid objects. We show that our method enables accurate dynamics prediction and reliable downstream robotic manipulation planning, which outperforms state of the art baselines.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The paper proposes PIEGraph, a hybrid approach that augments a particle-based spring-mass analytical model (PIE) with an equivariant graph neural network and a novel action representation to learn object dynamics for both rigid and deformable bodies from limited real-world interaction data. It evaluates the method in simulation and on physical robot hardware for reorientation and repositioning tasks involving ropes, cloth, stuffed animals, and rigid objects, claiming more accurate long-horizon dynamics prediction and better downstream manipulation planning than state-of-the-art baselines.

Significance. If the hybrid construction reliably enforces physical feasibility across object types while remaining data-efficient, the work would advance data-efficient dynamics modeling in robotics, particularly for deformable objects where pure GNN approaches often fail over long horizons. The explicit use of an analytical prior plus equivariant symmetries is a concrete strength that could reduce reliance on large interaction datasets.

major comments (3)
  1. [§3.1] §3.1 (PIE analytical model): The central claim that a single spring-mass formulation enforces physical feasibility for both rigid and deformable objects is load-bearing, yet the description provides no explicit mechanism (e.g., stiffness scheduling, additional rigidity constraints, or regime-switching) for handling rigid bodies. Without this, the model risks either numerical stiffness or the GNN having to cancel large residual forces, undermining the “few interactions” regime.
  2. [§5] §5 (experimental results): The abstract and results sections assert outperformance on simulation and hardware tasks, but report no error bars, number of random seeds/trials, data-exclusion criteria, or long-horizon metrics (e.g., rollout error at 50+ steps). This leaves the quantitative superiority claim only partially supported and difficult to reproduce or compare.
  3. [§4.2] §4.2 (action representation): The novel action representation is presented as key to guiding the analytical model, but the paper does not include an ablation that isolates its contribution versus a standard particle-velocity input; without it, the necessity of the invented component for the reported gains cannot be assessed.
minor comments (2)
  1. Notation for particle states and edge features is introduced without a consolidated table; adding one would improve readability.
  2. Figure captions for hardware experiments should explicitly state the number of real-world interactions used per object type.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their constructive and detailed feedback. We address each major comment below, indicating where revisions will be made to strengthen the manuscript.

read point-by-point responses
  1. Referee: [§3.1] §3.1 (PIE analytical model): The central claim that a single spring-mass formulation enforces physical feasibility for both rigid and deformable objects is load-bearing, yet the description provides no explicit mechanism (e.g., stiffness scheduling, additional rigidity constraints, or regime-switching) for handling rigid bodies. Without this, the model risks either numerical stiffness or the GNN having to cancel large residual forces, undermining the “few interactions” regime.

    Authors: We appreciate this observation on the core modeling choice. The PIE component uses a single spring-mass formulation in which rigid-body behavior is approximated by assigning high stiffness coefficients to inter-particle springs (determined from object category and mass properties), while lower values permit deformation; the equivariant GNN supplies learned residual forces that keep trajectories physically plausible. We acknowledge that the manuscript does not explicitly describe the stiffness-selection procedure or safeguards against numerical stiffness. In the revised version we will expand §3.1 with this parameterization, including how stiffness values are set from object metadata and any clipping or damping applied to prevent instability. revision: yes

  2. Referee: [§5] §5 (experimental results): The abstract and results sections assert outperformance on simulation and hardware tasks, but report no error bars, number of random seeds/trials, data-exclusion criteria, or long-horizon metrics (e.g., rollout error at 50+ steps). This leaves the quantitative superiority claim only partially supported and difficult to reproduce or compare.

    Authors: We agree that the quantitative claims would be more robust with additional statistical detail. The original experiments were run with multiple random seeds and trials, yet these were not fully reported. We will revise §5 (and the corresponding tables/figures) to include error bars computed over at least five random seeds, state the exact number of trials per condition, clarify data-exclusion criteria, and add long-horizon rollout metrics (MSE at 50 and 100 steps) for both simulation and real-robot experiments. revision: yes

  3. Referee: [§4.2] §4.2 (action representation): The novel action representation is presented as key to guiding the analytical model, but the paper does not include an ablation that isolates its contribution versus a standard particle-velocity input; without it, the necessity of the invented component for the reported gains cannot be assessed.

    Authors: Thank you for this suggestion. The action representation encodes robot actions in a symmetry-equivariant particle-centric form to better couple with the analytical prior. While the full system outperforms baselines, we did not isolate this component. We will add an ablation study in the revised manuscript that directly compares the proposed representation against a standard particle-velocity input, quantifying its contribution to prediction accuracy and planning success. revision: yes

Circularity Check

0 steps flagged

No circularity: hybrid model grounds predictions in external analytical prior and independent data.

full rationale

The paper constructs PIEGraph by combining a fixed spring-mass analytical model (to enforce physical feasibility) with a separately trained equivariant GNN (to learn data-driven corrections from limited interaction data). No step reduces a claimed prediction to a fitted parameter by construction, nor does any load-bearing claim rely on a self-citation chain whose validity is internal to the paper. The analytical component is imported as an external physics prior rather than derived from the GNN outputs, and evaluation occurs on held-out simulation and hardware tasks. This satisfies the criteria for a self-contained derivation.

Axiom & Free-Parameter Ledger

1 free parameters · 2 axioms · 1 invented entities

The central claim rests on modeling objects as particle sets connected by springs for physical feasibility and on the existence of exploitable symmetries in particle interactions that the GNN can learn from limited data.

free parameters (1)
  • spring stiffness and damping coefficients
    The analytical spring-mass component requires these parameters to enforce feasible motion; they are not derived from first principles in the abstract.
axioms (2)
  • domain assumption Object dynamics can be approximated by a spring-mass system on 3D particles while preserving physical feasibility over long horizons.
    Invoked to address limitations of pure data-driven models.
  • domain assumption Symmetries exist in particle interactions that can be exploited by an equivariant network.
    Used to justify the GNN component and novel action representation.
invented entities (1)
  • novel action representation no independent evidence
    purpose: To guide the analytical model by exploiting symmetries in particle interactions.
    Introduced as part of the equivariant GNN component.

pith-pipeline@v0.9.0 · 5518 in / 1375 out tokens · 53364 ms · 2026-05-08T18:09:19.017390+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

40 extracted references · 2 canonical work pages · 1 internal anchor

  1. [1]

    A review of learning-based dynam- ics models for robotic manipulation.Science Robotics, 10(106):eadt1497, 2025

    Bo Ai, Stephen Tian, Haochen Shi, Yixuan Wang, Tobias Pfaff, Cheston Tan, Henrik I Christensen, Hao Su, Jiajun Wu, and Yunzhu Li. A review of learning-based dynam- ics models for robotic manipulation.Science Robotics, 10(106):eadt1497, 2025

  2. [2]

    Combining physical simulators and object- based networks for control

    Anurag Ajay, Maria Bauza, Jiajun Wu, Nima Fazeli, Joshua B Tenenbaum, Alberto Rodriguez, and Leslie P Kaelbling. Combining physical simulators and object- based networks for control. In2019 International Confer- ence on Robotics and Automation (ICRA), pages 3217–

  3. [3]

    Victor Blomqvist. Pymunk. URL https://pymunk.org

  4. [4]

    Hesselink, Elise van der Pol, Erik J

    Johannes Brandstetter, Rob D. Hesselink, Elise van der Pol, Erik J. Bekkers, and Max Welling. Geometric and physical quantities improve e(3) equivariant message passing.International Conference on Learning Represen- tations (ICLR), 2021

  5. [5]

    Daxbench: Benchmarking deformable object manipulation with dif- ferentiable physics

    Siwei Chen*, Yiqing Xu*, Cunjun Yu*, Linfeng Li, Xiao Ma, Zhongwen Xu, and David Hsu. Daxbench: Benchmarking deformable object manipulation with dif- ferentiable physics. InICLR, 2023

  6. [6]

    The cross-entropy method: A uni- fied approach to combinatorial optimization, monte-carlo simulation, and machine learning.Technometrics, 48(1): 147–148, 2006

    Lih-Yuan Deng. The cross-entropy method: A uni- fied approach to combinatorial optimization, monte-carlo simulation, and machine learning.Technometrics, 48(1): 147–148, 2006

  7. [7]

    Sim-to-real of soft robots with learned residual physics.IEEE Robotics and Automation Letters, 2024

    Junpeng Gao, Mike Y Michelis, Andrew Spielberg, and Robert K Katzschmann. Sim-to-real of soft robots with learned residual physics.IEEE Robotics and Automation Letters, 2024

  8. [8]

    Learning latent dynamics for planning from pixels

    Danijar Hafner, Timothy Lillicrap, Ian Fischer, Ruben Villegas, David Ha, Honglak Lee, and James Davidson. Learning latent dynamics for planning from pixels. In Kamalika Chaudhuri and Ruslan Salakhutdinov, editors, Proceedings of the 36th International Conference on Ma- chine Learning, volume 97 ofProceedings of Machine Learning Research, pages 2555–2565....

  9. [9]

    URL https://proceedings.mlr.press/v97/hafner19a. html

  10. [10]

    Mastering diverse domains through world models.Nature, 2023

    Danijar Hafner, Jurgis Pasukonis, Jimmy Ba, and Timo- thy Lillicrap. Mastering diverse domains through world models.Nature, 2023

  11. [11]

    Learning physical dynamics with subequivariant graph neural networks

    Jiaqi Han, Wenbing Huang, Hengbo Ma, Jiachen Li, Josh Tenenbaum, and Chuang Gan. Learning physical dynamics with subequivariant graph neural networks. In S. Koyejo, S. Mohamed, A. Agarwal, D. Belgrave, K. Cho, and A. Oh, editors,Advances in Neural Informa- tion Processing Systems, volume 35, pages 26256–26268. Curran Associates, Inc., 2022

  12. [12]

    Mesh-based dynamics with occlusion reasoning for cloth manipula- tion

    Zixuan Huang, Xingyu Lin, and David Held. Mesh-based dynamics with occlusion reasoning for cloth manipula- tion. InRobotics: Science and Systems (RSS), 2022

  13. [13]

    gradsim: Differen- tiable simulation for system identification and visuo- motor control.International Conference on Learning Representations (ICLR), 2021

    Krishna Murthy Jatavallabhula, Miles Macklin, Florian Golemo, Vikram V oleti, Linda Petrini, Martin Weiss, Breandan Considine, Jerome Parent-Levesque, Kevin Xie, Kenny Erleben, Liam Paull, Florian Shkurti, Derek Nowrouzezahrai, and Sanja Fidler. gradsim: Differen- tiable simulation for system identification and visuo- motor control.International Conferenc...

  14. [14]

    PhysTwin: Physics- informed reconstruction and simulation of deformable objects from videos.ICCV, 2025

    Hanxiao Jiang, Hao-Yu Hsu, Kaifeng Zhang, Hsin-Ni Yu, Shenlong Wang, and Yunzhu Li. PhysTwin: Physics- informed reconstruction and simulation of deformable objects from videos.ICCV, 2025

  15. [15]

    3D gaussian splatting for real-time radiance field rendering.ACM Transactions on Graphics, 42(4), July 2023

    Bernhard Kerbl, Georgios Kopanas, Thomas Leimk ¨uhler, and George Drettakis. 3D gaussian splatting for real-time radiance field rendering.ACM Transactions on Graphics, 42(4), July 2023

  16. [16]

    Se (2)-equivariant pushing dynamics models for tabletop object manipulations

    Seungyeon Kim, Byeongdo Lim, Yonghyeon Lee, and Frank C Park. Se (2)-equivariant pushing dynamics models for tabletop object manipulations. InConference on Robot Learning, pages 427–436. PMLR, 2023

  17. [17]

    Segment Anything

    Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alexander C. Berg, Wan-Yen Lo, Piotr Doll ´ar, and Ross Girshick. Segment anything. arXiv:2304.02643, 2023

  18. [18]

    Context-aware dynamics model for generalization in model-based reinforcement learning

    Kimin Lee, Younggyo Seo, Seunghyun Lee, Honglak Lee, and Jinwoo Shin. Context-aware dynamics model for generalization in model-based reinforcement learning. InInternational Conference on Machine Learning, pages 5757–5766. PMLR, 2020

  19. [19]

    Learning particle dynamics for manipulating rigid bodies, deformable objects, and fluids

    Yunzhu Li, Jiajun Wu, Russ Tedrake, Joshua B Tenen- baum, and Antonio Torralba. Learning particle dynamics for manipulating rigid bodies, deformable objects, and fluids. InICLR, 2019

  20. [20]

    Propagation networks for model-based control under partial observa- tion

    Yunzhu Li, Jiajun Wu, Jun-Yan Zhu, Joshua B Tenen- baum, Antonio Torralba, and Russ Tedrake. Propagation networks for model-based control under partial observa- tion. InICRA, 2019

  21. [21]

    Fei Liu, Entong Su, Jingpei Lu, Mingen Li, and Michael C. Yip. Robotic manipulation of deformable rope-like objects using differentiable compliant position- based dynamics.IEEE Robotics and Automation Letters, 8(7):3964–3971, 2023

  22. [22]

    Soft- MAC: Differentiable soft body simulation with forecast- based contact model and two-way coupling with articu- lated rigid bodies and clothes

    Min Liu, Gang Yang, Siyuan Luo, and Lin Shao. Soft- MAC: Differentiable soft body simulation with forecast- based contact model and two-way coupling with articu- lated rigid bodies and clothes. InIEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2024

  23. [23]

    Warp: A high-performance python frame- work for gpu simulation and graphics, March 2022

    Miles Macklin. Warp: A high-performance python frame- work for gpu simulation and graphics, March 2022. NVIDIA GPU Technology Conference (GTC)

  24. [24]

    Focused adaptation of dynamics mod- els for deformable object manipulation

    Peter Mitrano, Alex LaGrassa, Oliver Kroemer, and Dmitry Berenson. Focused adaptation of dynamics mod- els for deformable object manipulation. In2023 IEEE International Conference on Robotics and Automation (ICRA), pages 5931–5937, 2023

  25. [25]

    Hierarchical foresight: Self- supervised learning of long-horizon tasks via visual sub- goal generation.International Conference on Learning Representations (ICLR), 2019

    Suraj Nair and Chelsea Finn. Hierarchical foresight: Self- supervised learning of long-horizon tasks via visual sub- goal generation.International Conference on Learning Representations (ICLR), 2019

  26. [26]

    Learning to simulate complex physics with graph net- works

    Alvaro Sanchez-Gonzalez, Jonathan Godwin, Tobias Pfaff, Rex Ying, Jure Leskovec, and Peter Battaglia. Learning to simulate complex physics with graph net- works. InInternational conference on machine learning, pages 8459–8468. PMLR, 2020

  27. [27]

    E (n) equivariant graph neural networks

    Vıctor Garcia Satorras, Emiel Hoogeboom, and Max Welling. E (n) equivariant graph neural networks. In International conference on machine learning, pages 9323–9332. PMLR, 2021

  28. [28]

    The graph neural network model.IEEE transactions on neural networks, 20(1):61–80, 2008

    Franco Scarselli, Marco Gori, Ah Chung Tsoi, Markus Hagenbuchner, and Gabriele Monfardini. The graph neural network model.IEEE transactions on neural networks, 20(1):61–80, 2008

  29. [29]

    Pugs: Zero-shot physi- cal understanding with gaussian splatting.2025 IEEE International Conference on Robotics and Automation (ICRA), 2025

    Yinghao Shuai, Ran Yu, Yuantao Chen, Zijian Jiang, Xiaowei Song, Nan Wang, Jv Zheng, Jianzhu Ma, Meng Yang, Zhicheng Wang, et al. Pugs: Zero-shot physi- cal understanding with gaussian splatting.2025 IEEE International Conference on Robotics and Automation (ICRA), 2025

  30. [30]

    MIT press Cam- bridge, 1998

    Richard S Sutton, Andrew G Barto, et al.Reinforcement learning: An introduction, volume 1. MIT press Cam- bridge, 1998

  31. [31]

    Mediapipe hands: On-device real- time hand tracking

    Andrey Vakunov, Chuo-Ling Chang, Fan Zhang, George Sung, Matthias Grundmann, and Valentin Bazarevsky. Mediapipe hands: On-device real- time hand tracking. InCV4ARVR, 2020. https://mixedreality.cs.cornell.edu/workshop

  32. [32]

    Offline-online learning of deformation model for cable manipulation with graph neural networks.IEEE Robotics and Automation Letters, 7(2):5544–5551, 2022

    Changhao Wang, Yuyou Zhang, Xiang Zhang, Zheng Wu, Xinghao Zhu, Shiyu Jin, Te Tang, and Masayoshi Tomizuka. Offline-online learning of deformation model for cable manipulation with graph neural networks.IEEE Robotics and Automation Letters, 7(2):5544–5551, 2022

  33. [33]

    Equivariant $q$ learning in spatial action spaces

    Dian Wang, Robin Walters, Xupeng Zhu, and Robert Platt. Equivariant $q$ learning in spatial action spaces. In 5th Annual Conference on Robot Learning, 2021. URL https://openreview.net/forum?id=IScz42A3iCI

  34. [34]

    arXiv preprint arXiv:2211.02222 , year=

    Kenny Young, Aditya Ramesh, Louis Kirsch, and J ¨urgen Schmidhuber. The benefits of model-based gener- alization in reinforcement learning.arXiv preprint arXiv:2211.02222, 2022

  35. [35]

    Tossingbot: Learning to throw arbitrary objects with residual physics

    Andy Zeng, Shuran Song, Johnny Lee, Alberto Ro- driguez, and Thomas Funkhouser. Tossingbot: Learning to throw arbitrary objects with residual physics. In Robotics: Science and Systems (RSS), 2019

  36. [36]

    Adaptigraph: Material-adaptive graph-based neural dynamics for robotic manipulation

    Kaifeng Zhang, Baoyu Li, Kris Hauser, and Yunzhu Li. Adaptigraph: Material-adaptive graph-based neural dynamics for robotic manipulation. InProceedings of Robotics: Science and Systems (RSS), 2024

  37. [37]

    Particle-grid neural dynamics for learning deformable object models from rgb-d videos

    Kaifeng Zhang, Baoyu Li, Kris Hauser, and Yunzhu Li. Particle-grid neural dynamics for learning deformable object models from rgb-d videos. InProceedings of Robotics: Science and Systems (RSS), 2025

  38. [38]

    Dy- namic 3d gaussian tracking for graph-based neural dy- namics modeling

    Mingtong Zhang, Kaifeng Zhang, and Yunzhu Li. Dy- namic 3d gaussian tracking for graph-based neural dy- namics modeling. In8th Annual Conference on Robot Learning, 2024

  39. [39]

    Reconstruction and simulation of elastic objects with spring-mass 3D gaussians

    Licheng Zhong, Hong-Xing Yu, Jiajun Wu, and Yunzhu Li. Reconstruction and simulation of elastic objects with spring-mass 3D gaussians. InEuropean Conference on Computer Vision, pages 407–423. Springer, 2024. VIII. APPENDIX A. Invariant Action Space Letxbe our input state,sbe our initial end-effector position, andebe our final end-effector position. We nee...

  40. [40]

    We need to prove the following equivalence a=R −(atan2(e−s)+2π)(x−e) =R −(atan2(Rθe+g−(Rθs+g)+2π)(Rθx+g−(R θe+g))

    Proof:Let’s define our action like so: a=R −(atan2(e−s)+2π)(x−e). We need to prove the following equivalence a=R −(atan2(e−s)+2π)(x−e) =R −(atan2(Rθe+g−(Rθs+g)+2π)(Rθx+g−(R θe+g)). We begin by simplifying, a=R −(atan2(Rθ(v))+2π)(Rθ(x−e)), wherev=e−s. We show thatatan2(R θ(v)) =θ+atan2(v)by first convert- ingvinto polar coordinates like so: v=r. cos(ϕ) sin...