pith. sign in

arxiv: 2605.18872 · v1 · pith:26V5AMYWnew · submitted 2026-05-15 · 💻 cs.LG · cs.AI· cs.RO

EUPHORIA: Efficient Universal Planning via Hybrid Optimization for Robust Industrial Robotic Assembly

Pith reviewed 2026-05-20 19:57 UTC · model grok-4.3

classification 💻 cs.LG cs.AIcs.RO
keywords robotic assemblyfew-shot adaptationmeta-learningphysics-informed planninggraph hypernetworkshybrid optimizationindustrial roboticssim-to-real
0
0 comments X

The pith

EUPHORIA uses graph hypernetworks and physics-biased attention to adapt robotic assembly plans to new complex geometries from few examples without retraining.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces EUPHORIA to solve the problem that existing robotic assembly planners are either specialized to one design or inefficient because they separate structure sequencing from motion planning. It proposes a Meta-Geometric Encoder that uses Graph Hypernetworks to create policy parameters directly from a small set of example geometries. A Physics-Informed Graph Transformer then applies attention modulated by contact forces from particle simulations to prioritize load-bearing steps. The system adds energy penalties during sequencing and a final differentiable correction step to improve real-world stability. If these pieces work together, planners could handle varied architectural forms like arches or domes with far less setup time and lower power draw.

Core claim

EUPHORIA achieves universal few-shot adaptability and dynamic efficiency through a hybrid optimization strategy. The Meta-Geometric Encoder based on Graph Hypernetworks dynamically generates policy parameters from a minimal support set, enabling parameter-level adaptation to complex topologies without gradient-based retraining. The Physics-Informed Graph Transformer trained via Soft Actor-Critic uses a Physics-Bias Attention mechanism that modulates attention scores with contact forces from Discrete Element Model simulations. Kinematics-Aware Sequencing penalizes high-energy transitions, and Residual Stability Correction fine-tunes coarse actions by minimizing a joint energy-stability cost.

What carries the argument

Meta-Geometric Encoder based on Graph Hypernetworks that dynamically generates policy parameters from a minimal support set for parameter-level adaptation to new topologies.

If this is right

  • Energy consumption drops compared with planners that treat sequencing and motion as separate stages.
  • Success rates reach state-of-the-art levels on previously unseen non-standard geometries using only minimal few-shot examples.
  • New designs can be handled without running gradient updates or collecting large retraining datasets.
  • Coarse simulation plans can be corrected on the fly to reduce sim-to-real failures before physical execution.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same hypernetwork-plus-physics-attention pattern could be tested on other contact-rich robotic tasks such as furniture assembly or debris sorting.
  • Replacing the discrete-element simulator with faster analytic contact models might trade some accuracy for speed in time-critical settings.
  • Collecting the support set on-site from a few manual demonstrations could let the planner adapt to site-specific materials without central retraining.
  • Combining the residual correction layer with real-time force sensing might further close the gap when material properties vary from simulation.

Load-bearing premise

The hypernetwork can produce working policy parameters for unseen complex shapes from only a few examples, and contact forces from particle simulations accurately point to the connections that matter for structural integrity.

What would settle it

Testing the system on a novel irregular geometry when the few-shot support set contains only standard shapes and measuring whether assembly success rate falls below that of a fully retrained baseline.

Figures

Figures reproduced from arXiv: 2605.18872 by Bing-Yu Chen, Chia-Ching Yen, Peter Yichen Chen, Shih-Yu Lai, Yang-Ting Shen, Yu-Lun Liu.

Figure 1
Figure 1. Figure 1: We present Euphoria, a robotic assembly system that closes the gap between parametric architectural design and physical construction. Unlike prior learning-based planners that require per-geometry retraining and decouple sequencing from motion, a designer-authored brick CAD with contact interfaces and DEM-derived forces (left) is realized by a mobile manipulator (right) via a physics-informed and design-aw… view at source ↗
Figure 2
Figure 2. Figure 2: The bricks’ graph modeling in supported case, un [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: EUPHORIA architecture. To overcome retraining for customized designs and the operational inefficiency of brick sequencing and motion planning, we unify CAD/CAM with robotic assembly via five stages: (1) Physics-Augmented Graph Construction converts a CAD design into a Physio-Geometric graph via a DEM solver with contrastive/CAD sampling; (2) a Meta-Geometric Encoder (Siamese + Graph Hypernetwork) generates… view at source ↗
Figure 4
Figure 4. Figure 4: Topological Generalization. Visual comparison be [PITH_FULL_IMAGE:figures/full_fig_p005_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Dataset Taxonomy and Graph Representation. We curate a diverse parametric CAD models categorized into four [PITH_FULL_IMAGE:figures/full_fig_p008_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: System Validation. (1) Ablations: Comparing w/ and w/o Human Priors shows that lacking priors results in unsupported, [PITH_FULL_IMAGE:figures/full_fig_p009_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Analysis of Optimization Dynamics and Energy Efficiency. [PITH_FULL_IMAGE:figures/full_fig_p010_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Training and Evaluation.(a) Universal Training Curves: The overall learning progress of the universal policy on the mixed-design dataset (excluding evaluation dataset), showing mean episode reward ± standard deviation over 200 epochs. EUPHORIA (Ours) demonstrates faster convergence and higher final rewards compared to baselines, validating the efficiency of its hybrid optimization strategy. Ablating Human … view at source ↗
Figure 10
Figure 10. Figure 10: Physio-Geometric Graph Construction Pipeline. [PITH_FULL_IMAGE:figures/full_fig_p016_10.png] view at source ↗
read the original abstract

Robotic assembly in architectural construction faces a persistent bottleneck: existing planners are either highly specialized, requiring prohibitive retraining for every new geometric design, or operationally inefficient, treating structural sequencing and kinematic motion as disjoint processes. We present EUPHORIA, a unified framework that achieves universal few-shot adaptability and dynamic efficiency through a hybrid optimization strategy. To overcome the retraining bottleneck, we propose a Meta-Geometric Encoder based on Graph Hypernetworks: unlike standard contrastive learning, which performs only feature-level recognition, our hypernetwork dynamically generates policy parameters from a minimal support set, enabling parameter-level adaptation to complex topologies (e.g., domes, arches) without gradient-based retraining. For structural reasoning, we introduce a Physics-Informed Graph Transformer trained via Soft Actor-Critic (SAC), with a Physics-Bias Attention mechanism that modulates attention scores using contact forces from Discrete Element Model (DEM) simulations, guiding the planner toward structurally critical connections. We further ensure operational efficiency through Kinematics-Aware Sequencing, where the SAC objective penalizes high-energy transitions. Finally, we bridge the Sim2Real gap via Residual Stability Correction, a differentiable optimization layer that fine-tunes coarse assembly actions by minimizing a joint energy-stability cost prior to execution. Experiments show that EUPHORIA significantly reduces energy consumption over decoupled baselines and achieves state-of-the-art success rates on unseen, non-standard geometries with minimal few-shot examples, fusing meta-learning, physics-informed attention, and residual optimization into a cohesive, generalized planner.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript proposes EUPHORIA, a unified framework for robotic assembly planning in architectural construction. It integrates a Meta-Geometric Encoder based on Graph Hypernetworks to dynamically generate policy parameters from a minimal support set for few-shot adaptation to complex topologies without retraining; a Physics-Informed Graph Transformer trained with Soft Actor-Critic that incorporates a Physics-Bias Attention mechanism modulating scores via contact forces from DEM simulations; Kinematics-Aware Sequencing that penalizes high-energy transitions; and a Residual Stability Correction layer for sim-to-real fine-tuning via joint energy-stability optimization. The central claims are universal few-shot adaptability, significant energy reduction over decoupled baselines, and state-of-the-art success rates on unseen non-standard geometries.

Significance. If the empirical claims are substantiated with rigorous quantitative validation, the work could meaningfully advance industrial robotics by addressing the retraining bottleneck and operational inefficiency of existing planners through a hybrid meta-learning and physics-informed optimization approach. The parameter-level adaptation via hypernetworks and the integration of simulation-derived structural priors represent potentially useful directions if the components prove robust beyond the described simulation settings.

major comments (2)
  1. [Abstract] Abstract: the claims of 'significantly reduces energy consumption over decoupled baselines' and 'state-of-the-art success rates on unseen, non-standard geometries with minimal few-shot examples' are presented without any quantitative results, specific baseline methods, error bars, number of trials, dataset details, or ablation studies. This absence prevents assessment of whether the hybrid strategy delivers the asserted gains in adaptability and efficiency.
  2. [Physics-Bias Attention mechanism] Physics-Bias Attention mechanism: the mechanism modulates attention scores using contact forces from DEM simulations to identify structurally critical connections. DEM is formulated for granular/particulate media with many-body collisions and flow dynamics, whereas the target domain involves rigid polyhedral parts under precise kinematic constraints and static stability. Without explicit verification that the extracted forces correspond to actual contact normals or torques in the assembly graph, the bias may supply incorrect structural priors, reducing the approach to a standard graph transformer and undermining both the few-shot and energy-reduction claims.
minor comments (2)
  1. [Meta-Geometric Encoder] Clarify the exact definition and selection procedure for the 'minimal support set size' and 'physics-bias scaling factor', which appear as free parameters whose sensitivity is not discussed.
  2. [Experiments] Provide the full experimental protocol, including simulation environment details, number of random seeds, and exact comparison methods, to support reproducibility of the reported improvements.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed feedback. We address each major comment point by point below, indicating where we agree and the specific revisions we will make to strengthen the manuscript.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the claims of 'significantly reduces energy consumption over decoupled baselines' and 'state-of-the-art success rates on unseen, non-standard geometries with minimal few-shot examples' are presented without any quantitative results, specific baseline methods, error bars, number of trials, dataset details, or ablation studies. This absence prevents assessment of whether the hybrid strategy delivers the asserted gains in adaptability and efficiency.

    Authors: We agree that the abstract would be improved by including quantitative support for the high-level claims. In the revised version, we will update the abstract to reference specific experimental results, including energy reduction percentages with error bars, success rates on unseen geometries, the decoupled baselines used for comparison, the number of trials, and a brief mention of key ablation findings from the results section. revision: yes

  2. Referee: [Physics-Bias Attention mechanism] Physics-Bias Attention mechanism: the mechanism modulates attention scores using contact forces from DEM simulations to identify structurally critical connections. DEM is formulated for granular/particulate media with many-body collisions and flow dynamics, whereas the target domain involves rigid polyhedral parts under precise kinematic constraints and static stability. Without explicit verification that the extracted forces correspond to actual contact normals or torques in the assembly graph, the bias may supply incorrect structural priors, reducing the approach to a standard graph transformer and undermining both the few-shot and energy-reduction claims.

    Authors: We acknowledge the referee's valid concern about the applicability of standard DEM formulations to rigid polyhedral assemblies. Our use of DEM is intended to derive approximate contact force magnitudes for biasing attention in the graph transformer toward structurally relevant connections. To address this directly, we will add a new subsection in the methods that explains the adaptation for rigid bodies and includes validation comparing DEM-extracted forces against contact normals and torques from our rigid-body kinematic simulator. This clarification will confirm that the physics bias supplies meaningful priors. revision: partial

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The provided manuscript text describes a hybrid framework combining a Meta-Geometric Encoder (Graph Hypernetworks), Physics-Informed Graph Transformer with Physics-Bias Attention (modulated by DEM contact forces), SAC training, Kinematics-Aware Sequencing, and Residual Stability Correction. No equations, derivations, or self-referential definitions are exhibited that reduce a claimed prediction or first-principles result to its own inputs by construction. Claims of few-shot adaptability and energy reduction rest on experimental outcomes rather than tautological fits or load-bearing self-citations, rendering the derivation self-contained against external benchmarks such as success rates on unseen geometries.

Axiom & Free-Parameter Ledger

2 free parameters · 2 axioms · 2 invented entities

The central claims rest on several unverified modeling choices and new components whose effectiveness is asserted without independent evidence in the provided abstract.

free parameters (2)
  • minimal support set size
    The hypernetwork is said to adapt from a minimal support set, but the exact size and selection criteria are not specified and would need fitting or tuning.
  • physics-bias scaling factor
    Attention scores are modulated by contact forces; the weighting strength between physics signal and learned features is a tunable parameter.
axioms (2)
  • domain assumption Discrete Element Model simulations produce contact forces that accurately identify structurally critical connections for real-world assembly.
    Invoked to justify the Physics-Bias Attention mechanism.
  • domain assumption The joint energy-stability cost minimized in the residual layer is a faithful proxy for successful real-robot execution.
    Used to bridge Sim2Real gap via differentiable optimization.
invented entities (2)
  • Meta-Geometric Encoder no independent evidence
    purpose: Dynamically generates policy parameters from few-shot support sets for new topologies
    New component introduced to achieve parameter-level adaptation without retraining.
  • Physics-Bias Attention mechanism no independent evidence
    purpose: Modulates transformer attention using DEM contact forces
    New attention variant proposed to incorporate structural physics.

pith-pipeline@v0.9.0 · 5827 in / 1694 out tokens · 60623 ms · 2026-05-20T19:57:51.344537+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

64 extracted references · 64 canonical work pages · 2 internal anchors

  1. [1]

    Rie Ando and Tong Zhang. 2006. Learning on graph with Laplacian regularization. Advances in neural information processing systems19 (2006)

  2. [2]

    Haritsa, Kevin Han, and John-Paul Ore

    Khashayar Asadi, Varun R. Haritsa, Kevin Han, and John-Paul Ore. 2021. Au- tomated Object Manipulation Using Vision-Based Mobile Robotic System for Construction Applications.Journal of Computing in Civil Engineering35, 1 (2021), 04020058. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000946

  3. [3]

    Matan Atad, Jianxiang Feng, Ismael Rodr’iguez, Maximilian Durner, and Rudolph Triebel. 2023. Efficient and Feasible Robotic Assembly Sequence Planning via Graph Representation Learning.2023 IEEE/RSJ International Conference on Intel- ligent Robots and Systems (IROS)(2023), 8262–8269. https://api.semanticscholar. org/CorpusID:257623086

  4. [4]

    2021.Siamese Neural Networks: An Overview

    Davide Chicco. 2021.Siamese Neural Networks: An Overview. Springer US, New York, NY, 73–94. https://doi.org/10.1007/978-1-0716-0826-5_3

  5. [5]

    Chung, J

    H. Chung, J. Kim, B. Knyazev, J. Lee, G. Taylor, J. Park, and M. Cho. 2021. Brick- by-Brick: Combinatorial construction with deep reinforcement learning.arXiv preprint(Oct. 2021). https://doi.org/10.48550/arXiv.2110.15481

  6. [6]

    Jimmy Envall, Roi Poranne, and Stelian Coros. 2023. Differentiable Task Assign- ment and Motion Planning. In2023 IEEE/RSJ International Conference on Intelli- gent Robots and Systems (IROS). 2049–2056. https://doi.org/10.1109/IROS55552. 2023.10341602

  7. [7]

    Chelsea Finn, Pieter Abbeel, and Sergey Levine. 2017. Model-Agnostic Meta- Learning for Fast Adaptation of Deep Networks. InProceedings of the 34th In- ternational Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 70), Doina Precup and Yee Whye Teh (Eds.). PMLR, 1126–1135. https://proceedings.mlr.press/v70/finn17a.html

  8. [8]

    Niklas Funk, Georgia Chalvatzaki, Boris Belousov, and Jan Peters. 2022. Learn2assemble with structured representations and search for robotic architec- tural construction. InConference on Robot Learning. PMLR, 1401–1411

  9. [9]

    Niklas Funk, Svenja Menzenbach, Georgia Chalvatzaki, and Jan Peters. 2022. Graph-based Reinforcement Learning meets Mixed Integer Programs: An appli- cation to 3D robot assembly discovery. In2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). 10215–10222. https://doi.org/10.1109/ IROS47612.2022.9981784

  10. [10]

    Ghasemipour, D

    K. Ghasemipour, D. Freeman, B. David, S. Gu, S. Kataoka, and I. Mordatch. 2022. Blocks Assemble! Learning to assemble with large-scale structured reinforcement learning.arXiv preprint(Mar. 2022). https://doi.org/10.48550/arXiv.2203.13733

  11. [11]

    Andrew Goldberg, Kavish Kondap, Tianshuang Qiu, Zehan Ma, Letian Fu, Justin Kerr, Huang Huang, Kaiyuan Chen, Kuan Fang, and Ken Goldberg. 2025. Blox- Net: Generative Design-for-Robot-Assembly using VLM supervision, Physics, Simulation, and a Robot with Reset. In2025 International Conference on Robotics and Automation (ICRA). IEEE

  12. [12]

    Udesh Gunarathna, Renata Borovica-Gajic, Shanika Karunasekera, and Ege- men Tanin. 2022. Dynamic graph combinatorial optimization with multi- attention deep reinforcement learning. InProceedings of the 30th Interna- tional Conference on Advances in Geographic Information Systems(<conf-loc>, <city>Seattle</city>, <state>Washington</state>, </conf-loc>)(SIG...

  13. [13]

    Shengnan Guo, Youfang Lin, Ning Feng, Chao Song, and Huaiyu Wan. 2019. Attention Based Spatial-Temporal Graph Convolutional Networks for Traffic Flow Forecasting.Proceedings of the AAAI Conference on Artificial Intelligence 33, 01 (Jul. 2019), 922–929. https://doi.org/10.1609/aaai.v33i01.3301922

  14. [14]

    Hamilton, Rex Ying, and Jure Leskovec

    William L. Hamilton, Rex Ying, and Jure Leskovec. 2017. Inductive representation learning on large graphs. InProceedings of the 31st International Conference on Neural Information Processing Systems(Long Beach, California, USA)(NIPS’17). Curran Associates Inc., Red Hook, NY, USA, 1025–1035

  15. [15]

    Hamrick, K

    J. Hamrick, K. Allen, V. Bapst, T. Zhu, K. McKee, J. Tenenbaum, and P. Battaglia

  16. [16]

    Relational inductive bias for physical construction in humans and machines

    Relational inductive bias for physical construction in humans and machines. arXiv preprint(Jun. 2018). https://doi.org/10.48550/arXiv.1806.01203

  17. [17]

    Hartmann, Andreas Orthey, Danny Driess, Ozgur S

    Valentin N. Hartmann, Andreas Orthey, Danny Driess, Ozgur S. Oguz, and Marc Toussaint. 2023. Long-Horizon Multi-Robot Rearrangement Planning for Construction Assembly.IEEE Transactions on Robotics39, 1 (2023), 239–252. https://doi.org/10.1109/TRO.2022.3198020

  18. [18]

    Masatoshi Hatano, Mamoru Minami, Toshiyuki Asakura, and Tsuyoshi Ohsumi

  19. [19]

    InProceedings of the 16th IAARC/IFAC/IEEE International Symposium on Automation and Robotics in Construction, Carlos Balaguer (Ed.)

    Construction Robot Systems Using Mobile Manipulators - Piling up Blocks with Real Systems and Positioning Accuracy. InProceedings of the 16th IAARC/IFAC/IEEE International Symposium on Automation and Robotics in Construction, Carlos Balaguer (Ed.). International Association for Automa- tion and Robotics in Construction (IAARC), Madrid, Spain, 549–554. htt...

  20. [20]

    Wenlong Huang, Hao Su, et al. 2023. Differentiable Visual Learning for Physics- Based Motion Estimation. InAdvances in Neural Information Processing Systems (NeurIPS). https://openreview.net/forum?id=ecRaDicXxw

  21. [21]

    Yijiang Huang, Pok Yin Victor Leung, Caelan Garrett, Fabio Gramazio, Matthias Kohler, and Caitlin Mueller. 2021. The new analog: A protocol for linking design and construction intent with algorithmic planning for robotic assembly of complex structures(SCF ’21). Association for Computing Machinery, New York, NY, USA, Article 9, 17 pages. https://doi.org/10...

  22. [22]

    Wei Jin, Yao Ma, Xiaorui Liu, Xianfeng Tang, Suhang Wang, and Jiliang Tang. 2020. Graph Structure Learning for Robust Graph Neural Networks. InProceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining(Virtual Event, CA, USA)(KDD ’20). Association for Computing Machinery, New York, NY, USA, 66–74. https://doi.org/10.1...

  23. [23]

    Parnian Kassraie, Andreas Krause, and Ilija Bogunovic. 2022. Graph Neural Network Bandits. InAdvances in Neural Information Processing Systems, Alice H. Oh, Alekh Agarwal, Danielle Belgrave, and Kyunghyun Cho (Eds.). https: //openreview.net/forum?id=BWa5IUE3L4

  24. [24]

    Thomas Kipf and Max Welling. 2016. Semi-Supervised Classification with Graph Convolutional Networks. (09 2016)

  25. [25]

    Manav Kulshrestha and Ahmed H Qureshi. 2023. Structural concept learning via graph attention for multi-level rearrangement planning. InConference on Robot Learning. PMLR, 3180–3193

  26. [26]

    Shih-Yu Lai, Chia-Ching Yen, Yang-Ting Shen, and Bing-Yu Chen. 2025. Architec- tural CAD/CAM in Robotic Fabrication via Differentiable Energy Optimization and Few-Shot Spatiotemporal Graph RL Policy. InProceedings of the SIGGRAPH Asia 2025 Technical Communications (SA Technical Communications ’25). As- sociation for Computing Machinery, New York, NY, USA,...

  27. [27]

    2025.Graph Q-Learning for Automatic Assembly in Design-to-Construction

    Shih-Yu Lai, Chia-Ching Yen, Yang-Ting Shen, and Bing-Yu Chen. 2025.Graph Q-Learning for Automatic Assembly in Design-to-Construction. Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/3757374. 3771881

  28. [28]

    Quentin Le Lidec, Louis Montaut, Yann de Mont-Marin, and Justin Carpentier

  29. [29]

    End-to-End and Highly-Efficient Differentiable Simulation for Robotics. (2024). https://doi.org/10.48550/arXiv.2409.07107 arXiv:2409.07107 [cs.RO]

  30. [30]

    Pok Yin Leung, Aleksandra Apolinarska, Davide Tanadini, Fabio Gramazio, and Matthias Kohler. 2021. Automatic Assembly of Jointed Timber Structure using Distributed Robotic Clamps. https://doi.org/10.52842/conf.caadria.2021.1.583

  31. [31]

    Fuxian Li, Jie Feng, Huan Yan, Guangyin Jin, Fan Yang, Funing Sun, Depeng Jin, and Yong Li. 2023. Dynamic Graph Convolutional Recurrent Network for Traffic Prediction: Benchmark and Solution.ACM Trans. Knowl. Discov. Data17, 1, Article 9 (feb 2023), 21 pages. https://doi.org/10.1145/3532611

  32. [32]

    Continuous control with deep reinforcement learning

    Timothy P. Lillicrap, Jonathan J. Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. 2019. Continuous control with deep reinforcement learning. arXiv:1509.02971 [cs.LG] https://arxiv.org/ abs/1509.02971

  33. [33]

    Wang, and Akshara Rai

    Yixin Lin, Austin S. Wang, and Akshara Rai. 2021. Efficient and Interpretable Robot Manipulation with Graph Neural Networks.IEEE Robotics and Automation LettersPP (2021), 1–1. https://api.semanticscholar.org/CorpusID:232068621

  34. [34]

    Ruixuan Liu, Philip Huang, Ava Pun, Kangle Deng, Shobhit Aggarwal, Kevin Tang, Michelle Liu, Deva Ramanan, Jun-Yan Zhu, Jiaoyang Li, and Changliu Liu. 2025. Prompt-to-Product: Generative Assembly via Bimanual Manipulation. arXiv:2508.21063 [cs.RO] https://arxiv.org/abs/2508.21063

  35. [35]

    Yue Liu, Ke Liang, Jun Xia, Xihong Yang, Sihang Zhou, Meng Liu, Xinwang Liu, and Stan Z. Li. 2023. Reinforcement Graph Clustering with Unknown Cluster Number. InProceedings of the 31st ACM International Conference on Multimedia (<conf-loc>, <city>Ottawa ON</city>, <country>Canada</country>, </conf- loc>)(MM ’23). Association for Computing Machinery, New Y...

  36. [36]

    Lomas-Barrie, R

    V. Lomas-Barrie, R. Silva-Flores, A. Neme, and M. Pena-Cabrera. 2022. A Mul- tiview Recognition Method of Predefined Objects for Robot Assembly Using Deep Learning and Its Implementation on an FPGA.Electronics11, 5 (2022), 696. https://doi.org/10.3390/electronics11050696

  37. [37]

    Dongsheng Luo, Wei Cheng, Wenchao Yu, Bo Zong, Jingchao Ni, Haifeng Chen, and Xiang Zhang. 2021. Learning to Drop: Robust Graph Neural Network via Topological Denoising. InProceedings of the 14th ACM International Conference on Web Search and Data Mining(Virtual Event, Israel)(WSDM ’21). Association for Computing Machinery, New York, NY, USA, 779–787. htt...

  38. [38]

    Volodymyr Mnih, Adria Puigdomenech Badia, Mehdi Mirza, Alex Graves, Tim- othy Lillicrap, Tim Harley, David Silver, and Koray Kavukcuoglu. 2016. Asyn- chronous Methods for Deep Reinforcement Learning. InProceedings of The 33rd International Conference on Machine Learning (Proceedings of Machine Learn- ing Research, Vol. 48), Maria Florina Balcan and Kilian...

  39. [39]

    Louis Montaut, Quentin Le Lidec, Antoine Bambade, Vladimir Petrik, Josef Sivic, and Justin Carpentier. 2023. Differentiable Collision Detection: a Randomized Smoothing Approach. InIEEE International Conference on Robotics and Automa- tion (ICRA). https://doi.org/10.48550/arXiv.2209.09012

  40. [40]

    Mingshuo Nie, Dongming Chen, and Dongqi Wang. 2023. Reinforcement Learn- ing on Graphs: A Survey.IEEE Transactions on Emerging Topics in Computational Intelligence7, 4 (2023), 1065–1082. https://doi.org/10.1109/TETCI.2022.3222545 EUPHORIA: Efficient Universal Planning via Hybrid Optimization for Robust Industrial Robotic Assembly

  41. [41]

    Dunlu Peng and Yongsheng Zhang. 2023. MA-GCN: A Memory Augmented Graph Convolutional Network for traffic prediction.Engineering Applications of Artificial Intelligence121 (2023), 106046. https://doi.org/10.1016/j.engappai.2023. 106046

  42. [42]

    Yu Rong, Wenbing Huang, Tingyang Xu, and Junzhou Huang. 2019. Dropedge: Towards deep graph convolutional networks on node classification.arXiv preprint arXiv:1907.10903(2019)

  43. [43]

    Rusu, Dushyant Rao, Jakub Sygnowski, Oriol Vinyals, Razvan Pascanu, Simon Osindero, and Raia Hadsell

    Andrei A. Rusu, Dushyant Rao, Jakub Sygnowski, Oriol Vinyals, Razvan Pascanu, Simon Osindero, and Raia Hadsell. 2019. Meta-Learning with Latent Embedding Optimization. InInternational Conference on Learning Representations. https: //openreview.net/forum?id=BJgklhAcK7

  44. [44]

    Zilin Si, Gu Zhang, Qingwei Ben, Branden Romero, Zhou Xian, Chao Liu, and Chuang Gan. 2024. DIFFTACTILE: A Physics-based Differentiable Tactile Simu- lator for Contact-rich Robotic Manipulation. (2024). https://doi.org/10.48550/ arXiv.2403.08716 arXiv:2403.08716 [cs.RO]

  45. [45]

    Jake Snell, Kevin Swersky, and Richard Zemel. 2017. Prototypical Networks for Few-shot Learning. InAdvances in Neural Information Processing Systems, I. Guyon, U. Von Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (Eds.), Vol. 30. Curran Associates, Inc. https://proceedings.neurips. cc/paper_files/paper/2017/file/cb8da6767461f2...

  46. [46]

    Sven Stumm, Johannes Braumann, and Sigrid Brell-Çokcan. 2016. Human- Machine Interaction for Intuitive Programming of Assembly Tasks in Construc- tion.Procedia CIRP44 (2016), 269–274. https://api.semanticscholar.org/CorpusID: 63263344

  47. [47]

    Sven Stumm, Johannes Braumann, Martin von Hilchen, and Sigrid Brell-Çokcan

  48. [48]

    InInternational Conference on Robotics in Alpe-Adria-Danube Region

    On-Site Robotic Construction Assistance for Assembly Using A-Priori Knowledge and Human-Robot Collaboration. InInternational Conference on Robotics in Alpe-Adria-Danube Region. https://api.semanticscholar.org/CorpusID: 2825045

  49. [49]

    Yunsheng Tian, Joshua Jacob, Yijiang Huang, Jialiang Zhao, Edward Li Gu, Pingchuan Ma, Annan Zhang, Farhad Javid, Branden Romero, Sachin Chitta, Shinjiro Sueda, Hui Li, and Wojciech Matusik. 2025. Fabrica: Dual-Arm Assembly of General Multi-Part Objects via Integrated Planning and Learning. In9th Annual Conference on Robot Learning. https://openreview.net...

  50. [50]

    Yunsheng Tian, Jie Xu, Yichen Li, Jieliang Luo, Shinjiro Sueda, Hui Li, Karl D. D. Willis, and Wojciech Matusik. 2022. Assemble Them All: Physics-Based Planning for Generalizable Assembly by Disassembly.ACM Trans. Graph.41, 6, Article 278 (nov 2022), 11 pages. https://doi.org/10.1145/3550454.3555525

  51. [51]

    Petar Veličković, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Liò, and Yoshua Bengio. 2018. Graph Attention Networks. InInternational Confer- ence on Learning Representations. https://openreview.net/forum?id=rJXMpikCZ

  52. [52]

    I Vidovszky and A Pém. 2022. Analyses of Automated Bricklaying Workflow Regarding Time and Arrangement.IOP Conference Series: Materials Science and Engineering1218, 1 (jan 2022), 012004. https://doi.org/10.1088/1757-899X/1218/ 1/012004

  53. [53]

    Ziqi Wang, Wenjun Liu, Jingwen Wang, Gabriel Vallat, Fan Shi, Stefana Parascho, and Maryam Kamgarpour. 2025. Learning to Assemble with Alternative Plans. ACM Trans. Graph.44, 4, Article 101 (July 2025), 16 pages. https://doi.org/10. 1145/3730824

  54. [54]

    Hengxu You, Yang Ye, Tianyu Zhou, Qi Zhu, and Jing Du. 2023. Robot-Enabled Construction Assembly with Automated Sequence Planning based on ChatGPT: RoboGPT. arXiv:2304.11018 [cs.RO] https://arxiv.org/abs/2304.11018

  55. [55]

    Bing Yu, Haoteng Yin, and Zhanxing Zhu. 2018. Spatio-temporal graph convolu- tional networks: a deep learning framework for traffic forecasting. InProceedings of the 27th International Joint Conference on Artificial Intelligence(Stockholm, Sweden)(IJCAI’18). AAAI Press, 3634–3640

  56. [56]

    Mingxin Yu, Lin Shao, Zhehuan Chen, Tianhao Wu, Qingnan Fan, Kaichun Mo, and Hao Dong. 2021. RoboAssembly: Learning Generalizable Furniture Assembly Policy in a Novel Multi-robot Contact-rich Simulation Environment.ArXiv abs/2112.10143 (2021). https://api.semanticscholar.org/CorpusID:245334838

  57. [57]

    Hanqing Zeng, Hongkuan Zhou, Ajitesh Srivastava, Rajgopal Kannan, and Viktor Prasanna. 2019. Graphsaint: Graph sampling based inductive learning method. arXiv preprint arXiv:1907.04931(2019)

  58. [58]

    Chris Zhang, Mengye Ren, and Raquel Urtasun. 2019. Graph HyperNetworks for Neural Architecture Search. InInternational Conference on Learning Repre- sentations. https://openreview.net/forum?id=rkgW0oA9FX

  59. [59]

    Wenshuai Zhao, Eetu Rantala, Joni Pajarinen, and Jorge Peña Queralta. 2023. Less Is More: Robust Robot Learning via Partially Observable Multi-Agent Rein- forcement Learning

  60. [60]

    Chuanpan Zheng, Xiaoliang Fan, Cheng Wang, and Jianzhong Qi. 2020. GMAN: A Graph Multi-Attention Network for Traffic Prediction.Proceedings of the AAAI Conference on Artificial Intelligence34, 01 (Apr. 2020), 1234–1241. https: //doi.org/10.1609/aaai.v34i01.5477

  61. [61]

    Jie Zhou, Ganqu Cui, Shengding Hu, Zhengyan Zhang, Cheng Yang, Zhiyuan Liu, Lifeng Wang, Changcheng Li, and Maosong Sun. 2020. Graph neural networks: A review of methods and applications.AI Open1 (2020), 57–81. https://doi.org/ 10.1016/j.aiopen.2021.01.001

  62. [62]

    hallucinates

    Yueqing Zhuang, Li Tao, Fan Yang, Cong Ma, Ziwei Zhang, Huizhu Jia, and Xiaodong Xie. 2018. RelationNet: Learning Deep-Aligned Representation for Semantic Image Segmentation. In2018 24th International Conference on Pattern Recognition (ICPR). 1506–1511. https://doi.org/10.1109/ICPR.2018.8545708 Supplementary Materials A Algorithm Details This section prov...

  63. [63]

    Critic Update:The parameters 𝜔 of the soft Q-function are updated by minimizing the Bellman residual: 𝐽𝑄(𝜔) =E B h𝑄𝜔(𝑠, 𝑎)− 𝑟+𝛾E 𝑎′∼𝜋𝜃 [𝑄 ¯𝜔(𝑠′, 𝑎′)−𝛼 ent log𝜋 𝜃 (𝑎′ |𝑠 ′)] 2i . (16)

  64. [64]

    Hypernetwork Update:Unlike standard SAC where policy pa- rameters 𝜃 are optimized directly, in EUPHORIA, 𝜃 is generated by H𝜙(𝑧). Thus, we optimize the hypernetwork parameters 𝜙 by backpropagating the policy loss through the generation process: 𝐽𝜋 (𝜙) =E 𝑠∼B,𝜖∼N h 𝛼ent log𝜋 H𝜙 (𝑧)(𝑓𝜙(𝜖, 𝑠)|𝑠)−𝑄 𝜔(𝑠, 𝑓𝜙(𝜖, 𝑠)) i , (17) where 𝑓𝜙(𝜖, 𝑠)denotes the reparameter...