pith. sign in

Art3D: Training-Free 3D Generation from Flat-Colored Illustration

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it
abstract

Large-scale pre-trained image-to-3D generative models have exhibited remarkable capabilities in diverse shape generations. However, most of them struggle to synthesize plausible 3D assets when the reference image is flat-colored like hand drawings due to the lack of 3D illusion, which are often the most user-friendly input modalities in art content creation. To this end, we propose Art3D, a training-free method that can lift flat-colored 2D designs into 3D. By leveraging structural and semantic features with pre-trained 2D image generation models and a VLM-based realism evaluation, Art3D successfully enhances the three-dimensional illusion in reference images, thus simplifying the process of generating 3D from 2D, and proves adaptable to a wide range of painting styles. To benchmark the generalization performance of existing image-to-3D models on flat-colored images without 3D feeling, we collect a new dataset, Flat-2D, with over 100 samples. Experimental results demonstrate the performance and robustness of Art3D, exhibiting superior generalizable capacity and promising practical applicability. Our source code and dataset will be publicly available on our project page: https://joy-jy11.github.io/ .

fields

cs.CV 2

years

2026 1 2025 1

verdicts

UNVERDICTED 2

clear filters

representative citing papers

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • Art3D: Training-Free 3D Generation from Flat-Colored Illustration cs.CV · 2025-04-14 · unverdicted · none · ref 48 · internal anchor

    Art3D enhances flat-colored 2D illustrations with 3D illusion using pre-trained 2D model features and VLM realism evaluation, then generates 3D, while introducing the Flat-2D benchmark dataset.