Text2CAD: Generating Sequential CAD Models from Beginner-to-Expert Level Text Prompts

Didier Stricker; Mohammad Sadil Khan; Muhammad Zeshan Afzal; Sankalp Sinha; Sk Aziz Ali; Talha Uddin Sheikh

arxiv: 2409.17106 · v1 · pith:QRTKL3RJnew · submitted 2024-09-25 · 💻 cs.CV · cs.GR

Text2CAD: Generating Sequential CAD Models from Beginner-to-Expert Level Text Prompts

Mohammad Sadil Khan , Sankalp Sinha , Talha Uddin Sheikh , Didier Stricker , Sk Aziz Ali , Muhammad Zeshan Afzal This is my paper

classification 💻 cs.CV cs.GR

keywords modelsframeworkgenerategeneratingtexttext2cadannotationsdataset

0 comments

read the original abstract

Prototyping complex computer-aided design (CAD) models in modern softwares can be very time-consuming. This is due to the lack of intelligent systems that can quickly generate simpler intermediate parts. We propose Text2CAD, the first AI framework for generating text-to-parametric CAD models using designer-friendly instructions for all skill levels. Furthermore, we introduce a data annotation pipeline for generating text prompts based on natural language instructions for the DeepCAD dataset using Mistral and LLaVA-NeXT. The dataset contains $\sim170$K models and $\sim660$K text annotations, from abstract CAD descriptions (e.g., generate two concentric cylinders) to detailed specifications (e.g., draw two circles with center $(x,y)$ and radius $r_{1}$, $r_{2}$, and extrude along the normal by $d$...). Within the Text2CAD framework, we propose an end-to-end transformer-based auto-regressive network to generate parametric CAD models from input texts. We evaluate the performance of our model through a mixture of metrics, including visual quality, parametric precision, and geometrical accuracy. Our proposed framework shows great potential in AI-aided design applications. Our source code and annotations will be publicly available.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Text2CAD-Bench: A Benchmark for LLM-based Text-to-Parametric CAD Generation
cs.LG 2026-05 unverdicted novelty 7.0

Text2CAD-Bench supplies 600 dual-prompt examples across four geometric and domain levels to test LLMs on text-to-parametric CAD, finding solid basic performance but sharp drops on complex topology and advanced features.
Physics-in-the-Loop: A Hybrid Agentic Architecture for Validated CAD Engineering Design
cs.CV 2026-05 unverdicted novelty 6.0

A hybrid agentic architecture integrates knowledge-based physical verification tools into LLM-driven CAD design loops, producing more complex and functionally valid designs than prior agentic baselines.
Beyond NL2Code: A Structured Survey of Multimodal Code Intelligence
cs.CL 2026-06 unverdicted novelty 3.0

A structured survey of multimodal code intelligence that formulates the field by code roles and organizes work into four domains while proposing verification-centered research directions.