pith. sign in

arxiv: 2409.17106 · v1 · pith:QRTKL3RJnew · submitted 2024-09-25 · 💻 cs.CV · cs.GR

Text2CAD: Generating Sequential CAD Models from Beginner-to-Expert Level Text Prompts

classification 💻 cs.CV cs.GR
keywords modelsframeworkgenerategeneratingtexttext2cadannotationsdataset
0
0 comments X
read the original abstract

Prototyping complex computer-aided design (CAD) models in modern softwares can be very time-consuming. This is due to the lack of intelligent systems that can quickly generate simpler intermediate parts. We propose Text2CAD, the first AI framework for generating text-to-parametric CAD models using designer-friendly instructions for all skill levels. Furthermore, we introduce a data annotation pipeline for generating text prompts based on natural language instructions for the DeepCAD dataset using Mistral and LLaVA-NeXT. The dataset contains $\sim170$K models and $\sim660$K text annotations, from abstract CAD descriptions (e.g., generate two concentric cylinders) to detailed specifications (e.g., draw two circles with center $(x,y)$ and radius $r_{1}$, $r_{2}$, and extrude along the normal by $d$...). Within the Text2CAD framework, we propose an end-to-end transformer-based auto-regressive network to generate parametric CAD models from input texts. We evaluate the performance of our model through a mixture of metrics, including visual quality, parametric precision, and geometrical accuracy. Our proposed framework shows great potential in AI-aided design applications. Our source code and annotations will be publicly available.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Text2CAD-Bench: A Benchmark for LLM-based Text-to-Parametric CAD Generation

    cs.LG 2026-05 unverdicted novelty 7.0

    Text2CAD-Bench supplies 600 dual-prompt examples across four geometric and domain levels to test LLMs on text-to-parametric CAD, finding solid basic performance but sharp drops on complex topology and advanced features.

  2. Physics-in-the-Loop: A Hybrid Agentic Architecture for Validated CAD Engineering Design

    cs.CV 2026-05 unverdicted novelty 6.0

    A hybrid agentic architecture integrates knowledge-based physical verification tools into LLM-driven CAD design loops, producing more complex and functionally valid designs than prior agentic baselines.

  3. Beyond NL2Code: A Structured Survey of Multimodal Code Intelligence

    cs.CL 2026-06 unverdicted novelty 3.0

    A structured survey of multimodal code intelligence that formulates the field by code roles and organizes work into four domains while proposing verification-centered research directions.