Art3D: Training-Free 3D Generation from Flat-Colored Illustration

Abstract

Large-scale pre-trained image-to-3D generative models have exhibitedremarkable capabilities in diverse shape generations. However, most of themstruggle to synthesize plausible 3D assets when the reference image isflat-colored like hand drawings due to the lack of 3D illusion, which are oftenthe most user-friendly input modalities in art content creation. To this end,we propose Art3D, a training-free method that can lift flat-colored 2D designsinto 3D. By leveraging structural and semantic features with pre- trained 2Dimage generation models and a VLM-based realism evaluation, Art3D successfullyenhances the three-dimensional illusion in reference images, thus simplifyingthe process of generating 3D from 2D, and proves adaptable to a wide range ofpainting styles. To benchmark the generalization performance of existingimage-to-3D models on flat-colored images without 3D feeling, we collect a newdataset, Flat-2D, with over 100 samples. Experimental results demonstrate theperformance and robustness of Art3D, exhibiting superior generalizable capacityand promising practical applicability. Our source code and dataset will bepublicly available on our project page: https://joy-jy11.github.io/ .

Quick Read (beta)

loading the full paper ...