Articraft: An Agentic System for Scalable Articulated 3D Asset Generation

Andrea Vedaldi; Christian Rupprecht; Chuanxia Zheng; Matt Zhou; Ruining Li; Shangzhe Wu; Xiaoyang Lyu; Zhaomou Song; Zhening Huang

arxiv: 2605.15187 · v1 · pith:NICLPOVGnew · submitted 2026-05-14 · 💻 cs.CV · cs.GR· cs.RO

Articraft: An Agentic System for Scalable Articulated 3D Asset Generation

Matt Zhou , Ruining Li , Xiaoyang Lyu , Zhaomou Song , Zhening Huang , Chuanxia Zheng , Christian Rupprecht , Andrea Vedaldi

show 1 more author

Shangzhe Wu

This is my paper

classification 💻 cs.CV cs.GRcs.RO

keywords articulatedassetsarticraftagenticassetharnessinterfacelarge

0 comments

read the original abstract

A bottleneck in learning to understand articulated 3D objects is the lack of large and diverse datasets. In this paper, we propose to leverage large language models (LLMs) to close this gap and generate articulated assets at scale. We reduce the problem of generating an articulated 3D asset to that of writing a program that builds it. We then introduce a new agentic system, Articraft, that writes such programs automatically. We design a programmatic interface and harness to help the LLM do so effectively. The LLM writes code against a domain-specific SDK for defining parts, composing geometry, specifying joints, and writing tests to validate the resulting assets. The harness exposes a restricted workspace and interface to the LLM, validates the resulting assets, and returns structured feedback. In this way, the LLM is not distracted by details such as authoring a URDF file or managing a complex software environment. We show that this produces higher-quality assets than both state-of-the-art articulated-asset generators and general-purpose coding agents. Using Articraft, we build Articraft-10K, a curated dataset of over 10K articulated assets spanning 245 categories, and show its utility both for training models of articulated assets and in downstream applications such as robotics simulation and virtual reality.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

UnfoldArt: Zero-Shot Recovery of Full Articulated 3D Objects from Text or Image
cs.CV 2026-06 unverdicted novelty 7.0

UnfoldArt uses multi-agent debate grounded in vision-language and video models to infer articulation parameters and reconstruct full 3D objects including occluded parts from text or image inputs.
P3D-Bench: Benchmarking MLLMs for Parametric 3D Generation and Structural Reasoning
cs.CV 2026-06 unverdicted novelty 7.0

P3D-Bench is a benchmark with three task families that scores MLLMs on generating executable parametric 3D programs, finding failures in precise geometry and part assembly.
3DCodeBench: Benchmarking Agentic Procedural 3D Modeling Via Code
cs.CV 2026-05 unverdicted novelty 7.0

3DCodeBench is a new benchmark evaluating 12 VLMs on translating multimodal prompts into procedural 3D modeling code, paired with 3DCodeArena for human preference rankings.
Sequential Planning via Anchored Robotic Keypoints
cs.RO 2026-06 unverdicted novelty 6.0

SPARK reaches 43.7% success on six LIBERO-PRO cells by LLM-generated typed behavior trees plus multi-prompt perception and recovery, more than doubling CaP-Agent0 and VLA baselines.