OA-WAM uses persistent address vectors and dynamic content vectors in object slots to enable addressable world-action prediction, improving robustness on manipulation benchmarks under scene changes.
Slotformer: Unsupervised visual dynamics simulation with object-centric models
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
roles
background 1polarities
background 1representative citing papers
UniSim learns a universal real-world simulator from orchestrated diverse datasets, enabling zero-shot deployment of policies trained purely in simulation.
ChronoSC projects video temporal dynamics into a compact chrono-image via color stacking, transmits it with lightweight DeepJSCC, reconstructs explicitly, and applies a pre-trained BLIP model for VideoQA answers, delivering 192x bandwidth savings on CLEVRER.
citing papers explorer
-
OA-WAM: Object-Addressable World Action Model for Robust Robot Manipulation
OA-WAM uses persistent address vectors and dynamic content vectors in object slots to enable addressable world-action prediction, improving robustness on manipulation benchmarks under scene changes.
-
Learning Interactive Real-World Simulators
UniSim learns a universal real-world simulator from orchestrated diverse datasets, enabling zero-shot deployment of policies trained purely in simulation.
-
ChronoSC: Task-Oriented Semantic Communication via Temporal-to-Color Encoding
ChronoSC projects video temporal dynamics into a compact chrono-image via color stacking, transmits it with lightweight DeepJSCC, reconstructs explicitly, and applies a pre-trained BLIP model for VideoQA answers, delivering 192x bandwidth savings on CLEVRER.