A Compositional Object-Based Approach to Learning Physical Dynamics

Antonio Torralba; Joshua B. Tenenbaum; Michael B. Chang; Tomer Ullman

arxiv: 1612.00341 · v2 · pith:7Q5MI6E6new · submitted 2016-12-01 · 💻 cs.AI · cs.LG

A Compositional Object-Based Approach to Learning Physical Dynamics

Michael B. Chang , Tomer Ullman , Antonio Torralba , Joshua B. Tenenbaum This is my paper

classification 💻 cs.AI cs.LG

keywords dynamicsobjectcompositionaldifferentinteractionsneuralphysicalphysics

0 comments

read the original abstract

We present the Neural Physics Engine (NPE), a framework for learning simulators of intuitive physics that naturally generalize across variable object count and different scene configurations. We propose a factorization of a physical scene into composable object-based representations and a neural network architecture whose compositional structure factorizes object dynamics into pairwise interactions. Like a symbolic physics engine, the NPE is endowed with generic notions of objects and their interactions; realized as a neural network, it can be trained via stochastic gradient descent to adapt to specific object properties and dynamics of different worlds. We evaluate the efficacy of our approach on simple rigid body dynamics in two-dimensional worlds. By comparing to less structured architectures, we show that the NPE's compositional representation of the structure in physical interactions improves its ability to predict movement, generalize across variable object count and different scene configurations, and infer latent properties of objects such as mass.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 9 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Do generative video models understand physical principles?
cs.CV 2025-01 unverdicted novelty 8.0

Physics-IQ benchmark reveals that generative video models exhibit limited physical understanding unrelated to their visual quality.
ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation
cs.RO 2024-09 conditional novelty 7.0

ReKep encodes robotic tasks as optimizable Python functions over 3D keypoints that are generated automatically from language and RGB-D input, enabling real-time hierarchical planning on single- and dual-arm platforms ...
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
cs.RO 2023-07 unverdicted novelty 7.0

VoxPoser uses LLMs to compose 3D value maps via VLM interaction for model-based synthesis of robust robot trajectories on open-set language-specified manipulation tasks.
RigidFormer: Learning Rigid Dynamics using Transformers
cs.CV 2026-05 unverdicted novelty 6.0

RigidFormer learns mesh-free rigid dynamics from point clouds using object-centric anchors, Anchor-Vertex Pooling, Anchor-based RoPE, and differentiable Kabsch alignment to enforce rigidity.
Ensemble Distributionally Robust Bayesian Optimisation
cs.LG 2026-05 unverdicted novelty 6.0

A tractable ensemble distributionally robust Bayesian optimization method achieves improved sublinear regret bounds under context uncertainty.
Learning to Theorize the World from Observation
cs.LG 2026-05 unverdicted novelty 6.0

NEO induces compositional latent programs as world theories from observations and executes them to enable explanation-driven generalization.
Order Matters: Shuffling Sequence Generation for Video Prediction
cs.CV 2019-07 unverdicted novelty 6.0

SEE-Net improves video prediction by using frame shuffling to enforce learning of natural temporal order, reporting state-of-the-art results on three synthetic and real-world datasets.
Rapid training of Hamiltonian graph networks using random features
cs.LG 2025-06 unverdicted novelty 5.0

Hamiltonian Graph Networks achieve 150-600x faster training via random feature parameter construction while retaining comparable accuracy and physical invariances on N-body systems up to 10,000 particles.
Intrinsic Motivation Driven Intuitive Physics Learning using Deep Reinforcement Learning with Intrinsic Reward Normalization
cs.LG 2019-07 unverdicted novelty 5.0

Graphical physics network integrated with DRL and intrinsic reward normalization lets an agent improve its intuitive physics model via intrinsic motivation in stationary and non-stationary 3D environments.