Functionalization via Structure Completion and Motion Rectification

Ali Mahdavi-Amiri; Angel X. Chang; Duc Anh Nguyen; Hao Zhang; Jiayi Peng; Kai Wang; Manolis Savva; Mingrui Zhao; Ruiqi Wang; Sai Raj Kishore Perla

arxiv: 2605.18010 · v1 · pith:UI2QCWC4new · submitted 2026-05-18 · 💻 cs.CV · cs.GR

Functionalization via Structure Completion and Motion Rectification

Mingrui Zhao , Sai Raj Kishore Perla , Kai Wang , Sauradip Nag , Duc Anh Nguyen , Jiayi Peng , Ruiqi Wang , Angel X. Chang

show 3 more authors

Manolis Savva Ali Mahdavi-Amiri Hao Zhang

This is my paper

Pith reviewed 2026-05-20 12:20 UTC · model grok-4.3

classification 💻 cs.CV cs.GR

keywords object functionalizationfunctional graphgraph completionmotion rectification3D geometry realizationfurniture models3D asset repairphysical operability

0 comments

The pith

Object functionalization uses graph completion to add missing structures and rectify motions in 3D models.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces object functionalization, a task that converts visually plausible but non-functional 3D models into ones that can operate physically. It frames the problem as completing a functional graph whose nodes represent object parts with motion attributes and whose edges capture functional and contact relations. A neural Graph Functionalizer called GraFu predicts the missing nodes and corrects edges. The completed graph then guides a geometry realization step that adds connectors and other elements in 3D, which also fixes erroneous motion annotations. Tests on furniture models show gains in collision and connectivity measures while matching existing motion prediction accuracy.

Core claim

Object functionalization is solved by representing a non-functional 3D object as an incomplete functional graph, completing that graph with a neural model to predict missing parts and relations, and using the completed graph to realize added 3D geometry while rectifying motions, with the result that the output models exhibit improved physical operability.

What carries the argument

The functional graph, a representation whose labeled nodes stand for object parts carrying motion attributes and whose labeled edges encode functional and contact relations, which is completed by the neural Graph Functionalizer to drive subsequent 3D geometry realization.

If this is right

The completed graph directly instantiates predicted connectors and structural elements as 3D geometry.
Erroneous human-annotated and predicted motions are rectified as a side effect of the geometry realization stage.
Motion prediction accuracy matches state-of-the-art methods on PartNet-Mobility zero-shot and HSSD test sets.
Functionality improves substantially on collision and connectivity metrics for furniture models.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same graph-completion approach could be applied to 3D models from other categories such as tools or mechanical assemblies without retraining on furniture-specific data.
Large-scale automated production of functional 3D assets becomes feasible once the graph completion step is integrated with existing generative pipelines.
Physics-based feedback loops could be added to the geometry realization stage to further reduce residual collisions after graph completion.

Load-bearing premise

Structural and functional deficiencies in a 3D model can be fully captured as missing nodes or wrong edges in a labeled functional graph, so that completing the graph is enough to produce correct 3D elements and fixed motions.

What would settle it

A test in a physics simulator that applies the predicted motions to the functionalized models and measures whether collisions and structural failures are eliminated compared with the original non-functional versions.

Figures

Figures reproduced from arXiv: 2605.18010 by Ali Mahdavi-Amiri, Angel X. Chang, Duc Anh Nguyen, Hao Zhang, Jiayi Peng, Kai Wang, Manolis Savva, Mingrui Zhao, Ruiqi Wang, Sai Raj Kishore Perla, Sauradip Nag.

**Figure 1.** Figure 1: Our method functionalizes a cabinet model that cannot function as it should, due to a) missing connectors for its door (hinge) or drawer (rails) to open and be held in place, and b) a collision between the door and the side panel. Our solution pipeline consists of a neural graph functionalizer (GraFu) for functional graph completion, e.g., adding missing connectors (new graph edges), as well as drawer hand… view at source ↗

**Figure 2.** Figure 2: Our Pipeline. Given a non-functional 3D furniture asset, the system encodes each part node using category, point cloud, and bounding box centroid features. A Graph Transformer then models inter-node relationships across the fully-connected graph. The resulting node features are passed to a DETR-style Slot Decoder, which predicts per-slot attributes (category, bounding box, motion axis) and per-slot-pair ed… view at source ↗

**Figure 3.** Figure 3: Various application conditions for Contact-face snapping, Flushing [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗

**Figure 4.** Figure 4: Limitations and failure cases. Our method is bounded by the available [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗

**Figure 5.** Figure 5: Qualitative results and comparisons to Particulate and SINGAPO on test samples from PN-M (top 7 rows) and HSSD (bottom 2 rows). For our results, [PITH_FULL_IMAGE:figures/full_fig_p009_5.png] view at source ↗

**Figure 6.** Figure 6: A gallery of qualitative results from our object functionalization tool. Dynamic functionalization is shown in (a-g), where in each, the top is before [PITH_FULL_IMAGE:figures/full_fig_p010_6.png] view at source ↗

**Figure 7.** Figure 7: Dynamic and static functionalizations applied to a 3D model obtained from single-view 3D reconstruction by Omniparts [Yang et al. 2025]. [PITH_FULL_IMAGE:figures/full_fig_p010_7.png] view at source ↗

**Figure 8.** Figure 8: Mechanical part annotation. Row 1: interior mount hinge with non [PITH_FULL_IMAGE:figures/full_fig_p013_8.png] view at source ↗

**Figure 9.** Figure 9: 100 randomly sampled PN-M models with missing tops. Top row, GT render, bottom row: ours functionalized. Please zoom in for details. [PITH_FULL_IMAGE:figures/full_fig_p016_9.png] view at source ↗

**Figure 10.** Figure 10: Comparison against BlenderMCP. Left: BlenderMCP, right: Ours. (a) Adding rails to the drawer. (b)(d) Adding hinges to the cabinet. (c) Adding [PITH_FULL_IMAGE:figures/full_fig_p018_10.png] view at source ↗

read the original abstract

Acquisition and creation of 3D assets have been largely view- or appearance-driven. As a result, existing digital 3D models often lack the requisite structural components to function as intended, such as joints, supports, interiors, or interaction elements. At the same time, even human-annotated motions are frequently error-prone, leading to physically implausible behavior. We introduce object functionalization, a novel task aimed at transforming visually plausible but non-functional 3D models into functional and physically operable ones. We formulate functionalization as a graph completion problem over a new functional graph representation, where labeled nodes represent object parts, labeled edges encode functional and contact relations, and movable nodes carry motion attributes, so that structural functional deficiencies manifest as missing nodes or incorrect edges. We develop a neural Graph Functionalizer (GraFu) to complete an incomplete graph representing a non-functional 3D object. The completed graph then drives a geometry realization stage that instantiates predicted connectors and structural elements in 3D, with the compelling side effect of rectifying erroneous human-annotated and predicted motions. To support training and evaluation, focusing on furniture as a rich and challenging target category, we introduce FurFun-233, a dataset of 233 paired non-functional and functionalized furniture models. On PartNet-Mobility ("zero-shot") and HSSD test sets, our method matches state-of-the-art methods in motion prediction accuracy while substantially improving functionality in terms of collision and connectivity.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper frames object functionalization as graph completion on a new functional graph to add missing structure and fix motions in 3D models, with a new furniture dataset, but the geometry realization step looks under-supported.

read the letter

The main thing to know is that this work defines a new task of turning visually plausible but non-functional 3D models into operable ones by completing a labeled graph of parts, contact relations, and motion attributes, then realizing the geometry from the completed graph. They also release FurFun-233, a set of 233 paired non-functional and fixed furniture models, and test zero-shot on PartNet-Mobility and HSSD where motion accuracy stays at SOTA levels while collision and connectivity metrics improve.

Referee Report

2 major / 2 minor

Summary. The manuscript introduces the task of object functionalization to convert visually plausible but non-functional 3D models into physically operable ones. It formulates the problem as graph completion over a novel functional graph representation (labeled nodes for parts, edges for functional/contact relations, and motion attributes on movable nodes). A neural Graph Functionalizer (GraFu) completes the incomplete input graph; the completed graph then drives a geometry realization stage that instantiates predicted connectors and structural elements in 3D, with the side effect of rectifying erroneous motions. The authors introduce the FurFun-233 paired furniture dataset and report that the method matches SOTA motion-prediction accuracy on zero-shot PartNet-Mobility and HSSD evaluations while substantially improving collision and connectivity functionality metrics.

Significance. If the central claims hold, the work is significant because it directly targets the gap between appearance-driven 3D asset creation and physical operability, which is relevant for robotics, simulation, and interactive applications. The graph-completion framing and the incidental motion-rectification effect are conceptually clean; the release of FurFun-233 provides a concrete benchmark for future work. These elements would strengthen the paper's contribution provided the geometric validity of the realization stage is rigorously demonstrated.

major comments (2)

[§4.2] §4.2 (Geometry Realization): The central claim that graph completion is sufficient to produce collision-free, physically operable 3D models rests on the unstated assumption that every completed edge relation admits a unique, stable 3D embedding. The manuscript does not provide a constraint solver or disambiguation procedure; if realization uses heuristic placement, small errors in predicted edges could still produce intersecting geometry or violated joint limits, undermining the reported collision/connectivity gains.
[§5.3] §5.3 and Table 3: The zero-shot results on PartNet-Mobility and HSSD claim improved functionality metrics, yet no ablation isolates the contribution of graph completion versus the downstream realization heuristics. Without such controls, it is unclear whether the functionality improvements are robust to the discrete abstraction or merely artifacts of the particular instantiation procedure.

minor comments (2)

The abstract would be strengthened by including one or two key quantitative numbers (e.g., collision-rate reduction or connectivity score) rather than qualitative statements of improvement.
[§3.1] Notation for motion attributes on movable nodes should be defined explicitly in §3.1 to avoid ambiguity when readers compare the functional graph to standard scene graphs.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their insightful comments on our manuscript. We are pleased that the referee recognizes the significance of the object functionalization task and the introduction of the FurFun-233 dataset. We address each major comment below, providing clarifications and outlining revisions to strengthen the paper.

read point-by-point responses

Referee: [§4.2] §4.2 (Geometry Realization): The central claim that graph completion is sufficient to produce collision-free, physically operable 3D models rests on the unstated assumption that every completed edge relation admits a unique, stable 3D embedding. The manuscript does not provide a constraint solver or disambiguation procedure; if realization uses heuristic placement, small errors in predicted edges could still produce intersecting geometry or violated joint limits, undermining the reported collision/connectivity gains.

Authors: We thank the referee for highlighting this important aspect of the geometry realization stage. The realization procedure is indeed heuristic in nature, using the completed functional graph to determine attachment points, orientations, and structural additions based on the predicted relations and motion attributes. Specifically, contact edges define spatial constraints for part placement, while motion attributes on nodes specify joint axes and limits that are enforced during instantiation. Although we do not employ a general-purpose constraint solver, the graph structure ensures that the embedding is consistent with the functional specifications by design. Our quantitative results demonstrate that this approach leads to measurable improvements in collision avoidance and connectivity, indicating practical stability for the furniture category. To address the concern, we will expand §4.2 with a detailed description of the instantiation algorithm, including how potential ambiguities are resolved through priority rules derived from the graph labels. This will make the assumptions more explicit. revision: partial
Referee: [§5.3] §5.3 and Table 3: The zero-shot results on PartNet-Mobility and HSSD claim improved functionality metrics, yet no ablation isolates the contribution of graph completion versus the downstream realization heuristics. Without such controls, it is unclear whether the functionality improvements are robust to the discrete abstraction or merely artifacts of the particular instantiation procedure.

Authors: We agree that an ablation isolating the graph completion from the realization would strengthen the claims. The realization stage is deterministic given the input graph, and we apply the same procedure to both our completed graphs and those from baseline methods where applicable. The functionality gains are tied to the accuracy of the completed functional relations, as poorer graph predictions lead to more collisions in our tests. In the revised version, we will add an ablation in §5.3 that evaluates functionality metrics using the realization on (i) ground-truth graphs, (ii) our predicted graphs, and (iii) graphs from alternative completion approaches, to demonstrate that the improvements stem from better graph completion rather than the realization heuristics alone. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation is self-contained

full rationale

The paper formulates functionalization as a graph completion problem on a functional graph (nodes for parts, edges for relations, motion attributes on movable nodes), trains a neural Graph Functionalizer (GraFu) to complete incomplete graphs from non-functional inputs, and then applies a separate geometry realization stage to instantiate connectors and rectify motions. This chain does not reduce any claimed output to its inputs by construction: graph completion is a learned prediction from data, realization is a downstream geometric process, and evaluations on FurFun-233, PartNet-Mobility, and HSSD are external. No equations, fitted parameters renamed as predictions, or load-bearing self-citations appear in the provided text that would create definitional equivalence. The approach is therefore independently verifiable against the reported metrics.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

The central claim rests on the premise that functional deficiencies are expressible as graph incompleteness and that graph completion plus geometry realization yields physically operable models; no free parameters or invented physical entities are named in the abstract.

axioms (1)

domain assumption Structural and functional deficiencies of a 3D object manifest as missing nodes or incorrect edges in a functional graph
Explicitly stated in the abstract as the modeling choice that turns functionalization into a graph-completion problem.

invented entities (1)

functional graph representation no independent evidence
purpose: To encode parts, functional/contact relations, and motion attributes so that deficiencies appear as missing or wrong graph elements
New representation introduced to support the graph-completion formulation; no independent evidence outside the paper is provided in the abstract.

pith-pipeline@v0.9.0 · 5827 in / 1441 out tokens · 37410 ms · 2026-05-20T12:20:32.036377+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

296 extracted references · 296 canonical work pages · 33 internal anchors

[1]

Design and Fabrication by Example , journal = TOG, author =

work page
[2]

Stackabilization , author =

work page
[3]

Foldabilizing furniture , author =

work page
[4]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops , month =

Arora, Himanshu and Mishra, Saurabh and Peng, Shichong and Li, Ke and Mahdavi-Amiri, Ali , title =. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops , month =. 2022 , pages =

work page 2022
[5]

Proceedings of the 2003 Eurographics/ACM SIGGRAPH symposium on Geometry processing , pages=

Filling holes in meshes , author=. Proceedings of the 2003 Eurographics/ACM SIGGRAPH symposium on Geometry processing , pages=

work page 2003
[6]

School of Computing, University of Utah, UUCS-04-019, UT, USA , volume=

A hole-filling algorithm for triangular meshes , author=. School of Computing, University of Utah, UUCS-04-019, UT, USA , volume=

work page
[7]

ACM Transactions on Graphics (TOG) , volume=

Robust repair of polygonal models , author=. ACM Transactions on Graphics (TOG) , volume=. 2004 , publisher=

work page 2004
[8]

Proceedings of the fourth Eurographics symposium on Geometry processing , volume=

Poisson surface reconstruction , author=. Proceedings of the fourth Eurographics symposium on Geometry processing , volume=

work page
[9]

The Visual Computer , volume=

A robust hole-filling algorithm for triangular mesh , author=. The Visual Computer , volume=. 2007 , publisher=

work page 2007
[10]

ACM Transactions on Graphics (ToG) , volume=

Screened poisson surface reconstruction , author=. ACM Transactions on Graphics (ToG) , volume=. 2013 , publisher=

work page 2013
[11]

Computer Aided Geometric Design , volume=

Poisson-driven seamless completion of triangular meshes , author=. Computer Aided Geometric Design , volume=. 2015 , publisher=

work page 2015
[12]

IEEE Transactions on Visualization and Computer Graphics , volume=

Point cloud completion: A survey , author=. IEEE Transactions on Visualization and Computer Graphics , volume=. 2023 , publisher=

work page 2023
[13]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

Shapeformer: Transformer-based shape completion via sparse representation , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

work page
[14]

Make it stand: balancing shapes for 3D fabrication , author =

work page
[15]

Build-to-Last: Strength to Weight 3D Printed Objects , journal = TOG, year = 2014, author =

work page 2014
[16]

2025 , eprint=

Particulate: Feed-Forward 3D Object Articulation , author=. 2025 , eprint=

work page 2025
[17]

2025 , eprint=

Articulate That Object Part (ATOP): 3D Part Articulation via Text and Motion Personalization , author=. 2025 , eprint=

work page 2025
[18]

arXiv preprint arXiv:2502.02590 , year=

Articulate AnyMesh: Open-vocabulary 3D Articulated Objects Modeling , author=. arXiv preprint arXiv:2502.02590 , year=

work page arXiv
[19]

3D Gaussian Splatting for Real-Time Radiance Field Rendering , journal =

Kerbl, Bernhard and Kopanas, Georgios and Leimk. 3D Gaussian Splatting for Real-Time Radiance Field Rendering , journal =. 2023 , url =

work page 2023
[20]

2020 , booktitle=

NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis , author=. 2020 , booktitle=

work page 2020
[21]

Weikai Chen and Cheng Lin and Weiyang Li and Bo Yang , title =

work page
[22]

Yizhi Wang and Wallace Lira and Wenqi Wang and Ali Mahdavi-Amiri and and Hao Zhang , title =

work page
[23]

Hongchi Xia and Entong Su and Marius Memmel and Arhan Jain and Raymond Yu and Numfor Mbiziwo-Tiapo and Ali Farhadi and Abhishek Gupta and Shenlong Wang and Wei-Chiu Ma , title =

work page
[24]

Interaction-Driven Active 3D Reconstruction with Object Interiors , author =

work page
[25]

Survey on Modeling of Human-made Articulated Objects , author =

work page
[26]

Symmetrization , author =

work page
[27]

Eurographics State-of-the-art Report (STAR) , year =

Structure-aware shape processing , author =. Eurographics State-of-the-art Report (STAR) , year =

work page
[28]

Computers & Graphics , volume = 33, issue = 1, pages =

Sketch-based modeling: A survey , author =. Computers & Graphics , volume = 33, issue = 1, pages =

work page
[29]

Mario Botsch and Olga Sorkine , title =

work page
[30]

Guibas and Antonio Torralba and Joshua B

Yining Hong and Kaichun Mo and Li Yi and Leonidas J. Guibas and Antonio Torralba and Joshua B. Tenenbaum and Chuang Gan , title =

work page
[31]

Structured 3D Latents for Scalable and Versatile 3D Generation , author =

work page
[32]

2024 , eprint=

Deep Learning Based 3D Segmentation: A Survey , author=. 2024 , eprint=

work page 2024
[33]

2023 , eprint=

Objaverse-XL: A Universe of 10M+ 3D Objects , author=. 2023 , eprint=

work page 2023
[34]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Neumap: Neural coordinate mapping by auto-transdecoder for camera localization , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page
[35]

ACM Transactions on Graphics (TOG) , volume=

3dshape2vecset: A 3d shape representation for neural fields and generative diffusion models , author=. ACM Transactions on Graphics (TOG) , volume=. 2023 , publisher=

work page 2023
[36]

Advances in neural information processing systems , volume=

Pytorch: An imperative style, high-performance deep learning library , author=. Advances in neural information processing systems , volume=

work page
[37]

Adam: A Method for Stochastic Optimization

Adam: A method for stochastic optimization , author=. arXiv preprint arXiv:1412.6980 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[38]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Pla: Language-driven open-vocabulary 3d scene understanding , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page
[39]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

3d highlighter: Localizing regions on 3d shapes via text descriptions , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page
[40]

Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

Satr: Zero-shot semantic segmentation of 3d shapes , author=. Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

work page
[41]

Proceedings IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , year=

Self-supervised Neural Articulated Shape and Appearance Models , author =. Proceedings IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , year=

work page
[42]

arXiv preprint arXiv:2403.14937 , year=

Survey on Modeling of Articulated Objects , author=. arXiv preprint arXiv:2403.14937 , year=

work page arXiv
[43]

Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

Paris: Part-level reconstruction and motion analysis for articulated objects , author=. Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

work page
[44]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

REACTO: Reconstructing Articulated Objects from a Single Video , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page
[45]

Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

Tune-a-video: One-shot tuning of image diffusion models for text-to-video generation , author=. Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

work page
[46]

Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

Fatezero: Fusing attentions for zero-shot text-based video editing , author=. Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

work page
[47]

ACM Computing Surveys , volume=

Diffusion models: A comprehensive survey of methods and applications , author=. ACM Computing Surveys , volume=. 2023 , publisher=

work page 2023
[48]

2023 , eprint=

Zero-1-to-3: Zero-shot One Image to 3D Object , author=. 2023 , eprint=

work page 2023
[49]

MVDream: Multi-view Diffusion for 3D Generation

Mvdream: Multi-view diffusion for 3d generation , author=. arXiv preprint arXiv:2308.16512 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[50]

arXiv preprint arXiv:2312.02201 , year=

Imagedream: Image-prompt multi-view diffusion for 3d generation , author=. arXiv preprint arXiv:2312.02201 , year=

work page arXiv
[51]

ModelScope Text-to-Video Technical Report

Modelscope text-to-video technical report , author=. arXiv preprint arXiv:2308.06571 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[52]

Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets

Stable video diffusion: Scaling latent video diffusion models to large datasets , author=. arXiv preprint arXiv:2311.15127 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[53]

AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

Animatediff: Animate your personalized text-to-image diffusion models without specific tuning , author=. arXiv preprint arXiv:2307.04725 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[54]

European Conference on Computer Vision , pages=

Dynamicrafter: Animating open-domain images with video diffusion priors , author=. European Conference on Computer Vision , pages=. 2024 , organization=

work page 2024
[55]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Align your latents: High-resolution video synthesis with latent diffusion models , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page
[56]

Puppet-master: Scaling interactive video generation as a motion prior for part-level dynamics , author=

work page
[57]

Articulate-anything: Auto- matic modeling of articulated objects via a vision-language foundation model.arXiv preprint arXiv:2410.13882, 2024

Articulate-Anything: Automatic Modeling of Articulated Objects via a Vision-Language Foundation Model , author=. arXiv preprint arXiv:2410.13882 , year=

work page arXiv
[58]

European Conference on Computer Vision , pages=

Motiondirector: Motion customization of text-to-video diffusion models , author=. European Conference on Computer Vision , pages=. 2024 , organization=

work page 2024
[59]

arXiv preprint arXiv:2312.05288 , year=

Motioncrafter: One-shot motion customization of diffusion models , author=. arXiv preprint arXiv:2312.05288 , year=

work page arXiv
[60]

arXiv preprint arXiv:2402.14780 , year=

Customize-a-video: One-shot motion customization of text-to-video diffusion models , author=. arXiv preprint arXiv:2402.14780 , year=

work page arXiv
[61]

arXiv preprint arXiv:2312.04966 , year=

Customizing motion in text-to-video diffusion models , author=. arXiv preprint arXiv:2312.04966 , year=

work page arXiv
[62]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Dreamvideo: Composing your dream videos with customized subject and motion , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page
[63]

arXiv preprint arXiv:2405.20155 , year=

MotionDreamer: Zero-Shot 3D Mesh Animation from Video Diffusion Models , author=. arXiv preprint arXiv:2405.20155 , year=

work page arXiv
[64]

Xiang Wang and Hangjie Yuan and Shiwei Zhang and Dayou Chen and Jiuniu Wang and Yingya Zhang and Yujun Shen and Deli Zhao and Jingren Zhou , title =

work page
[65]

Ruiqi Wang and Akshay Patil and Fenggen Yu and Hao Zhang , title =

work page
[66]

2022 , eprint=

Imagen Video: High Definition Video Generation with Diffusion Models , author=. 2022 , eprint=

work page 2022
[67]

European Conference on Computer Vision , pages=

Sv3d: Novel multi-view synthesis and 3d generation from a single image using latent video diffusion , author=. European Conference on Computer Vision , pages=. 2025 , organization=

work page 2025
[68]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Wonder3d: Single image to 3d using cross-domain diffusion , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page
[69]

IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models

Ip-adapter: Text compatible image prompt adapter for text-to-image diffusion models , author=. arXiv preprint arXiv:2308.06721 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[70]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

Sapien: A simulated part-based interactive environment , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

work page
[71]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

SSSien: A simulated part-based interactive environment , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

work page
[72]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Shape2motion: Joint analysis of motion parts and attributes from 3d shapes , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page
[73]

Proceedings of the 3rd Conference on Robot Learning , year=

Learning to generalize kinematic models to novel objects , author=. Proceedings of the 3rd Conference on Robot Learning , year=

work page
[74]

Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

Where2act: From pixels to actions for articulated 3d objects , author=. Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

work page
[75]

2021 IEEE International Conference on Robotics and Automation (ICRA) , pages=

Screwnet: Category-independent articulation model estimation from depth images using screw theory , author=. 2021 IEEE International Conference on Robotics and Automation (ICRA) , pages=. 2021 , organization=

work page 2021
[76]

ACM Transactions On Graphics (TOG) , volume=

Learning to predict part mobility from a single static snapshot , author=. ACM Transactions On Graphics (TOG) , volume=. 2017 , publisher=

work page 2017
[77]

and Mildenhall, Ben , title =

Poole, Ben and Jain, Ajay and Barron, Jonathan T. and Mildenhall, Ben , title =. arXiv , year =

work page
[78]

Advances in neural information processing systems , volume=

Denoising diffusion probabilistic models , author=. Advances in neural information processing systems , volume=

work page
[79]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

High-resolution image synthesis with latent diffusion models , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

work page
[80]

Deep Unsupervised Learning using Nonequilibrium Thermodynamics , booktitle =

Jascha Sohl. Deep Unsupervised Learning using Nonequilibrium Thermodynamics , booktitle =

work page

Showing first 80 references.

[1] [1]

Design and Fabrication by Example , journal = TOG, author =

work page

[2] [2]

Stackabilization , author =

work page

[3] [3]

Foldabilizing furniture , author =

work page

[4] [4]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops , month =

Arora, Himanshu and Mishra, Saurabh and Peng, Shichong and Li, Ke and Mahdavi-Amiri, Ali , title =. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops , month =. 2022 , pages =

work page 2022

[5] [5]

Proceedings of the 2003 Eurographics/ACM SIGGRAPH symposium on Geometry processing , pages=

Filling holes in meshes , author=. Proceedings of the 2003 Eurographics/ACM SIGGRAPH symposium on Geometry processing , pages=

work page 2003

[6] [6]

School of Computing, University of Utah, UUCS-04-019, UT, USA , volume=

A hole-filling algorithm for triangular meshes , author=. School of Computing, University of Utah, UUCS-04-019, UT, USA , volume=

work page

[7] [7]

ACM Transactions on Graphics (TOG) , volume=

Robust repair of polygonal models , author=. ACM Transactions on Graphics (TOG) , volume=. 2004 , publisher=

work page 2004

[8] [8]

Proceedings of the fourth Eurographics symposium on Geometry processing , volume=

Poisson surface reconstruction , author=. Proceedings of the fourth Eurographics symposium on Geometry processing , volume=

work page

[9] [9]

The Visual Computer , volume=

A robust hole-filling algorithm for triangular mesh , author=. The Visual Computer , volume=. 2007 , publisher=

work page 2007

[10] [10]

ACM Transactions on Graphics (ToG) , volume=

Screened poisson surface reconstruction , author=. ACM Transactions on Graphics (ToG) , volume=. 2013 , publisher=

work page 2013

[11] [11]

Computer Aided Geometric Design , volume=

Poisson-driven seamless completion of triangular meshes , author=. Computer Aided Geometric Design , volume=. 2015 , publisher=

work page 2015

[12] [12]

IEEE Transactions on Visualization and Computer Graphics , volume=

Point cloud completion: A survey , author=. IEEE Transactions on Visualization and Computer Graphics , volume=. 2023 , publisher=

work page 2023

[13] [13]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

Shapeformer: Transformer-based shape completion via sparse representation , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

work page

[14] [14]

Make it stand: balancing shapes for 3D fabrication , author =

work page

[15] [15]

Build-to-Last: Strength to Weight 3D Printed Objects , journal = TOG, year = 2014, author =

work page 2014

[16] [16]

2025 , eprint=

Particulate: Feed-Forward 3D Object Articulation , author=. 2025 , eprint=

work page 2025

[17] [17]

2025 , eprint=

Articulate That Object Part (ATOP): 3D Part Articulation via Text and Motion Personalization , author=. 2025 , eprint=

work page 2025

[18] [18]

arXiv preprint arXiv:2502.02590 , year=

Articulate AnyMesh: Open-vocabulary 3D Articulated Objects Modeling , author=. arXiv preprint arXiv:2502.02590 , year=

work page arXiv

[19] [19]

3D Gaussian Splatting for Real-Time Radiance Field Rendering , journal =

Kerbl, Bernhard and Kopanas, Georgios and Leimk. 3D Gaussian Splatting for Real-Time Radiance Field Rendering , journal =. 2023 , url =

work page 2023

[20] [20]

2020 , booktitle=

NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis , author=. 2020 , booktitle=

work page 2020

[21] [21]

Weikai Chen and Cheng Lin and Weiyang Li and Bo Yang , title =

work page

[22] [22]

Yizhi Wang and Wallace Lira and Wenqi Wang and Ali Mahdavi-Amiri and and Hao Zhang , title =

work page

[23] [23]

Hongchi Xia and Entong Su and Marius Memmel and Arhan Jain and Raymond Yu and Numfor Mbiziwo-Tiapo and Ali Farhadi and Abhishek Gupta and Shenlong Wang and Wei-Chiu Ma , title =

work page

[24] [24]

Interaction-Driven Active 3D Reconstruction with Object Interiors , author =

work page

[25] [25]

Survey on Modeling of Human-made Articulated Objects , author =

work page

[26] [26]

Symmetrization , author =

work page

[27] [27]

Eurographics State-of-the-art Report (STAR) , year =

Structure-aware shape processing , author =. Eurographics State-of-the-art Report (STAR) , year =

work page

[28] [28]

Computers & Graphics , volume = 33, issue = 1, pages =

Sketch-based modeling: A survey , author =. Computers & Graphics , volume = 33, issue = 1, pages =

work page

[29] [29]

Mario Botsch and Olga Sorkine , title =

work page

[30] [30]

Guibas and Antonio Torralba and Joshua B

Yining Hong and Kaichun Mo and Li Yi and Leonidas J. Guibas and Antonio Torralba and Joshua B. Tenenbaum and Chuang Gan , title =

work page

[31] [31]

Structured 3D Latents for Scalable and Versatile 3D Generation , author =

work page

[32] [32]

2024 , eprint=

Deep Learning Based 3D Segmentation: A Survey , author=. 2024 , eprint=

work page 2024

[33] [33]

2023 , eprint=

Objaverse-XL: A Universe of 10M+ 3D Objects , author=. 2023 , eprint=

work page 2023

[34] [34]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Neumap: Neural coordinate mapping by auto-transdecoder for camera localization , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page

[35] [35]

ACM Transactions on Graphics (TOG) , volume=

3dshape2vecset: A 3d shape representation for neural fields and generative diffusion models , author=. ACM Transactions on Graphics (TOG) , volume=. 2023 , publisher=

work page 2023

[36] [36]

Advances in neural information processing systems , volume=

Pytorch: An imperative style, high-performance deep learning library , author=. Advances in neural information processing systems , volume=

work page

[37] [37]

Adam: A Method for Stochastic Optimization

Adam: A method for stochastic optimization , author=. arXiv preprint arXiv:1412.6980 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[38] [38]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Pla: Language-driven open-vocabulary 3d scene understanding , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page

[39] [39]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

3d highlighter: Localizing regions on 3d shapes via text descriptions , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page

[40] [40]

Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

Satr: Zero-shot semantic segmentation of 3d shapes , author=. Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

work page

[41] [41]

Proceedings IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , year=

Self-supervised Neural Articulated Shape and Appearance Models , author =. Proceedings IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , year=

work page

[42] [42]

arXiv preprint arXiv:2403.14937 , year=

Survey on Modeling of Articulated Objects , author=. arXiv preprint arXiv:2403.14937 , year=

work page arXiv

[43] [43]

Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

Paris: Part-level reconstruction and motion analysis for articulated objects , author=. Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

work page

[44] [44]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

REACTO: Reconstructing Articulated Objects from a Single Video , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page

[45] [45]

Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

Tune-a-video: One-shot tuning of image diffusion models for text-to-video generation , author=. Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

work page

[46] [46]

Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

Fatezero: Fusing attentions for zero-shot text-based video editing , author=. Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

work page

[47] [47]

ACM Computing Surveys , volume=

Diffusion models: A comprehensive survey of methods and applications , author=. ACM Computing Surveys , volume=. 2023 , publisher=

work page 2023

[48] [48]

2023 , eprint=

Zero-1-to-3: Zero-shot One Image to 3D Object , author=. 2023 , eprint=

work page 2023

[49] [49]

MVDream: Multi-view Diffusion for 3D Generation

Mvdream: Multi-view diffusion for 3d generation , author=. arXiv preprint arXiv:2308.16512 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[50] [50]

arXiv preprint arXiv:2312.02201 , year=

Imagedream: Image-prompt multi-view diffusion for 3d generation , author=. arXiv preprint arXiv:2312.02201 , year=

work page arXiv

[51] [51]

ModelScope Text-to-Video Technical Report

Modelscope text-to-video technical report , author=. arXiv preprint arXiv:2308.06571 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[52] [52]

Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets

Stable video diffusion: Scaling latent video diffusion models to large datasets , author=. arXiv preprint arXiv:2311.15127 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[53] [53]

AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

Animatediff: Animate your personalized text-to-image diffusion models without specific tuning , author=. arXiv preprint arXiv:2307.04725 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[54] [54]

European Conference on Computer Vision , pages=

Dynamicrafter: Animating open-domain images with video diffusion priors , author=. European Conference on Computer Vision , pages=. 2024 , organization=

work page 2024

[55] [55]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Align your latents: High-resolution video synthesis with latent diffusion models , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page

[56] [56]

Puppet-master: Scaling interactive video generation as a motion prior for part-level dynamics , author=

work page

[57] [57]

Articulate-anything: Auto- matic modeling of articulated objects via a vision-language foundation model.arXiv preprint arXiv:2410.13882, 2024

Articulate-Anything: Automatic Modeling of Articulated Objects via a Vision-Language Foundation Model , author=. arXiv preprint arXiv:2410.13882 , year=

work page arXiv

[58] [58]

European Conference on Computer Vision , pages=

Motiondirector: Motion customization of text-to-video diffusion models , author=. European Conference on Computer Vision , pages=. 2024 , organization=

work page 2024

[59] [59]

arXiv preprint arXiv:2312.05288 , year=

Motioncrafter: One-shot motion customization of diffusion models , author=. arXiv preprint arXiv:2312.05288 , year=

work page arXiv

[60] [60]

arXiv preprint arXiv:2402.14780 , year=

Customize-a-video: One-shot motion customization of text-to-video diffusion models , author=. arXiv preprint arXiv:2402.14780 , year=

work page arXiv

[61] [61]

arXiv preprint arXiv:2312.04966 , year=

Customizing motion in text-to-video diffusion models , author=. arXiv preprint arXiv:2312.04966 , year=

work page arXiv

[62] [62]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Dreamvideo: Composing your dream videos with customized subject and motion , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page

[63] [63]

arXiv preprint arXiv:2405.20155 , year=

MotionDreamer: Zero-Shot 3D Mesh Animation from Video Diffusion Models , author=. arXiv preprint arXiv:2405.20155 , year=

work page arXiv

[64] [64]

Xiang Wang and Hangjie Yuan and Shiwei Zhang and Dayou Chen and Jiuniu Wang and Yingya Zhang and Yujun Shen and Deli Zhao and Jingren Zhou , title =

work page

[65] [65]

Ruiqi Wang and Akshay Patil and Fenggen Yu and Hao Zhang , title =

work page

[66] [66]

2022 , eprint=

Imagen Video: High Definition Video Generation with Diffusion Models , author=. 2022 , eprint=

work page 2022

[67] [67]

European Conference on Computer Vision , pages=

Sv3d: Novel multi-view synthesis and 3d generation from a single image using latent video diffusion , author=. European Conference on Computer Vision , pages=. 2025 , organization=

work page 2025

[68] [68]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Wonder3d: Single image to 3d using cross-domain diffusion , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page

[69] [69]

IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models

Ip-adapter: Text compatible image prompt adapter for text-to-image diffusion models , author=. arXiv preprint arXiv:2308.06721 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[70] [70]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

Sapien: A simulated part-based interactive environment , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

work page

[71] [71]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

SSSien: A simulated part-based interactive environment , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

work page

[72] [72]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Shape2motion: Joint analysis of motion parts and attributes from 3d shapes , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

work page

[73] [73]

Proceedings of the 3rd Conference on Robot Learning , year=

Learning to generalize kinematic models to novel objects , author=. Proceedings of the 3rd Conference on Robot Learning , year=

work page

[74] [74]

Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

Where2act: From pixels to actions for articulated 3d objects , author=. Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

work page

[75] [75]

2021 IEEE International Conference on Robotics and Automation (ICRA) , pages=

Screwnet: Category-independent articulation model estimation from depth images using screw theory , author=. 2021 IEEE International Conference on Robotics and Automation (ICRA) , pages=. 2021 , organization=

work page 2021

[76] [76]

ACM Transactions On Graphics (TOG) , volume=

Learning to predict part mobility from a single static snapshot , author=. ACM Transactions On Graphics (TOG) , volume=. 2017 , publisher=

work page 2017

[77] [77]

and Mildenhall, Ben , title =

Poole, Ben and Jain, Ajay and Barron, Jonathan T. and Mildenhall, Ben , title =. arXiv , year =

work page

[78] [78]

Advances in neural information processing systems , volume=

Denoising diffusion probabilistic models , author=. Advances in neural information processing systems , volume=

work page

[79] [79]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

High-resolution image synthesis with latent diffusion models , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

work page

[80] [80]

Deep Unsupervised Learning using Nonequilibrium Thermodynamics , booktitle =

Jascha Sohl. Deep Unsupervised Learning using Nonequilibrium Thermodynamics , booktitle =

work page